1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scraping Tool - Must Have?

Discussion in 'Black Hat SEO Tools' started by tixpf, Apr 26, 2014.

  1. tixpf

    tixpf Regular Member

    Joined:
    Dec 1, 2013
    Messages:
    295
    Likes Received:
    114
    Hi there,
    I bought GSA SER, CB and Indexer a couple of days ago and so far I'm satisfied. The last few days I let SER spent most of its time scraping relevant URLs for T1. Now I'm almost done and was reading on how to properly scrape for 'spammy' URLs to boost my tiers and the more I read the more I got the impression that SER's built in scraper is complete shit (or at least a very, very light version of a scraper). A friend of mine who I think knows a thing or two about SEO and automated link building told me the same.

    I couldn't really find a thread or discussion regarding this very topic (SER scraper vs. Scrapebox) which only makes me believe more that they're not even remotely comparable, because Scrapebox is way ahead.
    So I'd like to know from you guys if I should invest in Scrapebox, too. As of right now, I could afford it, but I'd rather not spend money on even more software, if it's not absolutely neccessary.

    Bonus question:
    I'm using SER scraper with public proxies, which is working just fine for me and I only use 10-20 proxies for submissions. If I would actually buy Scrapebox (or any other scraping tool) would I have to drastically increase the amount of proxies I use and do the proxies needed for Scrapebox/GScraper differ from the ones I'm using with SER (I use buyproxies.org semi-dedicated proxies)?


    Thanks in advance.
     
  2. tixpf

    tixpf Regular Member

    Joined:
    Dec 1, 2013
    Messages:
    295
    Likes Received:
    114
    Nevermind. Took me a lot of research, but I was able to figure out most of the answers myself.
     
  3. JasonS

    JasonS Jr. VIP Jr. VIP

    Joined:
    Sep 15, 2012
    Messages:
    2,989
    Likes Received:
    912
    Home Page:
    For Scraping: Gscraper

    Do not use private proxies for scraping, they'll get banned in no time by Google. So always public proxies for link harvesting/scraping.
     
  4. youmakeme

    youmakeme Power Member

    Joined:
    Dec 3, 2013
    Messages:
    605
    Likes Received:
    316
    I fully agree with Jason. GScraper is the best scrapping tool with their AA list subscription
     
  5. tixpf

    tixpf Regular Member

    Joined:
    Dec 1, 2013
    Messages:
    295
    Likes Received:
    114
    I kinda figured. Where do I get anonymous private proxies from? The proxy subscription from GScraper seems to be complete garbage.
     
  6. tixpf

    tixpf Regular Member

    Joined:
    Dec 1, 2013
    Messages:
    295
    Likes Received:
    114
    Could we fire this thread up again?
    I'm still fuzzy on the whole scraping thing..

    Right now I don't know if I need a scraping tool at all (small/medium sized projects), what software needs what proxies and how many, etc. It'd be great if you guys could elaborate a little. Give me details.
     
  7. jinsandy

    jinsandy Regular Member

    Joined:
    Jun 17, 2008
    Messages:
    374
    Likes Received:
    81
    to add to this question: what exactly are we scraping? content, links, emails?
     
  8. tixpf

    tixpf Regular Member

    Joined:
    Dec 1, 2013
    Messages:
    295
    Likes Received:
    114
    My question is about scraping URLs.

    edit: I let SER scrape over night using my private proxies. The result is quite astonishing to say the least. I already scraped ~20k URLs, which took me 3 days using public proxies. Unfortunately, 2 of them got banned, but this isn't neccessarily due to the scraping, because 2 of those proxies have already been BLed by Google when I initially checked them after my purchase.

    Seriously, I have no idea how to approach this. Scrapebox, GScraper, public, private proxies, how many, etc., etc.
     
    Last edited: May 2, 2014
  9. keval007

    keval007 Junior Member

    Joined:
    Jun 12, 2012
    Messages:
    145
    Likes Received:
    26
    Occupation:
    Web Scraper & PHP Developer
    I uses Scrapebox with Provate proxies and it working quite good for me. May be you need to try different footprints.
     
  10. ZennoBlaster

    ZennoBlaster Senior Member

    Joined:
    Jan 17, 2014
    Messages:
    1,025
    Likes Received:
    306
    No issues with SB and private proxies here. There are some footprints that you need to use carefully, for example: allintitle. Also, scrape Yahoo, Bing! and others if you experience a soft ban from G.
     
  11. tixpf

    tixpf Regular Member

    Joined:
    Dec 1, 2013
    Messages:
    295
    Likes Received:
    114
    Seriously guys, if you don't feel like actually reading what I've asked don't bother commenting. Especially not if it's one line of very vague information anyone could've come up with.

    If you want to be of any help then tell me:
    - Which Scraper: SER built in / Scrapebox / Gscraper?
    - Why that particular scraper? Pros / Cons?
    - Important features about the scraper (heavy proxy burning, faster/slower than the others, more/less results)
    - etc.

    I need a comparison between those scraping tools and which tool should be used for what purpose. I only do light-medium scraping and I have 30 private proxies.