1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Recommend a Solid Scraper for Alibaba

Discussion in 'Black Hat SEO Tools' started by smoney, Sep 9, 2013.

  1. smoney

    smoney Newbie

    Joined:
    Sep 9, 2013
    Messages:
    2
    Likes Received:
    0
    I need to build a database of sellers on Alibaba and Tradekey, possibly including photos of their products but I'm more interested in consolidating and categorizing their contact information. Has anyone here tried scraping the site, do they block bots, and can anyone recommend a good tool? Once I have a good tool to do this with I will probably run several instances from multiple IP's at different times and realize that I might have to limit how quickly I do this. I plan to import all of the data into a mysql db. Any advice / suggestions / information would be appreciated.

    [First post, not my first project.]
     
  2. bartosimpsonio

    bartosimpsonio Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 21, 2013
    Messages:
    8,911
    Likes Received:
    7,515
    Occupation:
    ZLinky2Buy SEO Services
    Location:
    ⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩
    Home Page:
    You'll probably need to hire a professional for that. I see you're new to the forum, so there are two sections here which may help you: Want to Buy and Hire a Freelancer. Good luck.
     
  3. smoney

    smoney Newbie

    Joined:
    Sep 9, 2013
    Messages:
    2
    Likes Received:
    0
    Really just looking for advice/experience/recommendations on software and approaches to scrape such large sites. I can handle all of the technical aspects myself. I know php/mysql and javascript, familiar with python and ROR, but don't want to spend time coding if possible. This project will take long enough and I have an idea I want to execute (build) as quickly as possible in order to be first to market on it. I'm flying solo and need as much of this data as possible to get started on it, though.
     
    Last edited: Sep 9, 2013