Scraping Tool - Must Have?

tixpf

Regular Member
Joined
Dec 1, 2013
Messages
295
Reaction score
115
Hi there,
I bought GSA SER, CB and Indexer a couple of days ago and so far I'm satisfied. The last few days I let SER spent most of its time scraping relevant URLs for T1. Now I'm almost done and was reading on how to properly scrape for 'spammy' URLs to boost my tiers and the more I read the more I got the impression that SER's built in scraper is complete shit (or at least a very, very light version of a scraper). A friend of mine who I think knows a thing or two about SEO and automated link building told me the same.

I couldn't really find a thread or discussion regarding this very topic (SER scraper vs. Scrapebox) which only makes me believe more that they're not even remotely comparable, because Scrapebox is way ahead.
So I'd like to know from you guys if I should invest in Scrapebox, too. As of right now, I could afford it, but I'd rather not spend money on even more software, if it's not absolutely neccessary.

Bonus question:
I'm using SER scraper with public proxies, which is working just fine for me and I only use 10-20 proxies for submissions. If I would actually buy Scrapebox (or any other scraping tool) would I have to drastically increase the amount of proxies I use and do the proxies needed for Scrapebox/GScraper differ from the ones I'm using with SER (I use buyproxies.org semi-dedicated proxies)?


Thanks in advance.
 
Nevermind. Took me a lot of research, but I was able to figure out most of the answers myself.
 
I fully agree with Jason. GScraper is the best scrapping tool with their AA list subscription
 
For Scraping: Gscraper

Do not use private proxies for scraping, they'll get banned in no time by Google. So always public proxies for link harvesting/scraping.

I kinda figured. Where do I get anonymous private proxies from? The proxy subscription from GScraper seems to be complete garbage.
 
Could we fire this thread up again?
I'm still fuzzy on the whole scraping thing..

Right now I don't know if I need a scraping tool at all (small/medium sized projects), what software needs what proxies and how many, etc. It'd be great if you guys could elaborate a little. Give me details.
 
to add to this question: what exactly are we scraping? content, links, emails?
 
to add to this question: what exactly are we scraping? content, links, emails?

My question is about scraping URLs.

edit: I let SER scrape over night using my private proxies. The result is quite astonishing to say the least. I already scraped ~20k URLs, which took me 3 days using public proxies. Unfortunately, 2 of them got banned, but this isn't neccessarily due to the scraping, because 2 of those proxies have already been BLed by Google when I initially checked them after my purchase.

Seriously, I have no idea how to approach this. Scrapebox, GScraper, public, private proxies, how many, etc., etc.
 
Last edited:
I uses Scrapebox with Provate proxies and it working quite good for me. May be you need to try different footprints.
 
I uses Scrapebox with Provate proxies and it working quite good for me. May be you need to try different footprints.

No issues with SB and private proxies here. There are some footprints that you need to use carefully, for example: allintitle. Also, scrape Yahoo, Bing! and others if you experience a soft ban from G.
 
I uses Scrapebox with Provate proxies and it working quite good for me. May be you need to try different footprints.

No issues with SB and private proxies here. There are some footprints that you need to use carefully, for example: allintitle. Also, scrape Yahoo, Bing! and others if you experience a soft ban from G.

Seriously guys, if you don't feel like actually reading what I've asked don't bother commenting. Especially not if it's one line of very vague information anyone could've come up with.

If you want to be of any help then tell me:
- Which Scraper: SER built in / Scrapebox / Gscraper?
- Why that particular scraper? Pros / Cons?
- Important features about the scraper (heavy proxy burning, faster/slower than the others, more/less results)
- etc.

I need a comparison between those scraping tools and which tool should be used for what purpose. I only do light-medium scraping and I have 30 private proxies.
 
Back
Top