1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scraping business directory

Discussion in 'General Programming Chat' started by Ocke86, Jan 23, 2014.

  1. Ocke86

    Ocke86 Newbie

    Joined:
    Jul 16, 2013
    Messages:
    41
    Likes Received:
    3
    Hello

    I need some help , i need to collect business information from a websites , my question is this possible on a business directory websites , these sites are in swedish and norweigen so i am guessing that the tool i be using must be multilanguage capable ?

    Does anyone know a good program for this ?

    Appreciate the help
     
  2. thisismymp3

    thisismymp3 Power Member

    Joined:
    Jan 6, 2010
    Messages:
    762
    Likes Received:
    290
    yea, its called something-hell, i forgot the first name though, its in someones signature.
     
  3. SaulGoodman

    SaulGoodman Registered Member

    Joined:
    Oct 8, 2013
    Messages:
    64
    Likes Received:
    38
    Location:
    BHW
    WebHarvy or Yellabot might be useful, but beware when scraping yellow pages, I know for a fact that some sites like yp.com include (a small amount) of fake information and fake business listings in order to prove that you stole / scraped their sites in case you do it on a large scale and they find out...
     
  4. Chris22

    Chris22 Regular Member

    Joined:
    Sep 29, 2010
    Messages:
    400
    Likes Received:
    1,059
  5. divok

    divok Senior Member

    Joined:
    Jul 21, 2010
    Messages:
    1,015
    Likes Received:
    634
    Location:
    http://twitter.com/divok
  6. reapV

    reapV Registered Member

    Joined:
    Jan 27, 2014
    Messages:
    56
    Likes Received:
    10
    Build a small generic scraper in perl or python and go for both language versions of the site. By doing this you don't have any dependencies to third party services and have more flexibility.