Scrapebox doesn't harvest a single URL! :(

Monie

Regular Member
Joined
Mar 22, 2014
Messages
210
Reaction score
42
I recently purchased a copy of scrapebox, and I've trying for many days to do a successful harvest. With proxies enabled, it doesn't harvest a single URL. Sometimes it shows that the scrape is competed without actually harvesting any URLs (all keywords not completed) and sometimes it shows the error 403.

I use No Hands Proxies to scrape public proxies, and they work really well with GSA SER. I also downloaded the free version of Gscraper to test the proxies and it could harvest URLs at 20-25 URLs per second. So obviously it isn't a problem with proxies.

i have been in contact with scrapebox support and they haven't been able to identify the problem yet. They only reply every 24 hours to each of my email replies so correspondence with them has been very slow.

Things I've done to try and troubleshoot are: test server connection, adjust maximum number of proxy retries, adjust connections, etc. Nothing works. :(

Help me out here guys!
 
Same Problem With me Couple of weeks ago, Scrapebox won't Scrape from Google. Private Proxies solved the problem for me.
 
Public proxies die fast and scrapebox doesn't automaticaly refresh the list you have to do it manualy.

Get private/share proxies and you will be fine.
 
If you're doing it repeatedly you will need a decent list of proxies. Private proxies usually work best, get a subscription for monthly private proxies that give you new ones each month.
 
You can search and buy fresh & private proxies on this forum for ScrapeBox.
 
But the proxies work GREAT with every other software I use (Gscraper, GSA SER). The proxies pass the proxy test in Scrapebox as well. Even if the proxies were bad it should at least be able to harvest a few URLs. But it doesn't scrape a SINGLE one from the four search engines. All evidence points to the fact that it is a problem specific to Scrapebox (maybe with how it handles public proxies).
 
Don't use scrapebox if you want to scrape. You can scrape millions with Gscraper in one night. I've been there struggling with scrapebox in the past just like you. It's great for many things like link extraction but it's scraping abilities just can't be compared with Gscraper or Hrefer. Just my two cents.
 
Don't use scrapebox if you want to scrape. You can scrape millions with Gscraper in one night. I've been there struggling with scrapebox in the past just like you. It's great for many things like link extraction but it's scraping abilities just can't be compared with Gscraper or Hrefer. Just my two cents.
I have given Scrapebox support access to my VPS to solve the problem. If they cannot solve it, I will probably ask for a refund and get Gscraper instead.
 
Don't use scrapebox if you want to scrape. You can scrape millions with Gscraper in one night. I've been there struggling with scrapebox in the past just like you. It's great for many things like link extraction but it's scraping abilities just can't be compared with Gscraper or Hrefer. Just my two cents.
One more question about Gscraper: Scrapebox has a one million URL limit right? Does Gscraper have a similar limit?
 
Don't sell your scrapebox. It's super useful for lot's of other stuff. Yeah scrapebox does crashes beyond 1 million urls.

No problems with Gscraper. I can scrape 3 million in one night.
 
One more question about Gscraper: Scrapebox has a one million URL limit right? Does Gscraper have a similar limit?
The 1 million url limit, is when you import the list to Scrapebox.
For scraping there is no limit, as long as your proxies are live.

I just tested the scraping module and it's working on google, yahoo and bing.
See if you have the updated version. Sometimes the problem is with the simple things.
Try and scrape without proxies. If it works, then the problem is with the proxies and the solution is to get private proxies.
 
Don't sell your scrapebox. It's super useful for lot's of other stuff. Yeah scrapebox does crashes beyond 1 million urls.

No problems with Gscraper. I can scrape 3 million in one night.
Had never had Scrapebox crash when scraping. I've scraped millions of urls in one go.

I think that is your experience with scrapebox, but it's unfair to generalize when you say that scrapebox does crash beyonb 1 millions urls. ...considering that most of the members here that uses scrapebox have a positive experience with it.
 
I used to scrape more then 1 million with sb without any problem.

just check your proxies. you will need to pay for proxies. the free ones you will found on forum will only work for 1-2 hours.
-=-
 

Oh wow, thanks for the link W130SN, I didn't know about such a software, sounds like a steal for 27 bucks!

OP what I do know that Gscraper needs a whole bunch of working proxies. Private proxies wouldn't work either. They can burnt out too fast.

I'm not sure whether No Hands proxies would be a good solution for massive scraping, but I do know you would need a lot of public proxies, in the thousands. You can find some sellers that supply working public proxies to subscribers on a regular basis in the 'buy proxies' forum.
 
Oh wow, thanks for the link W130SN, I didn't know about such a software, sounds like a steal for 27 bucks!

OP what I do know that Gscraper needs a whole bunch of working proxies. Private proxies wouldn't work either. They can burnt out too fast.

I'm not sure whether No Hands proxies would be a good solution for massive scraping, but I do know you would need a lot of public proxies, in the thousands. You can find some sellers that supply working public proxies to subscribers on a regular basis in the 'buy proxies' forum.
But the proxies scraped from No Hands Proxies work perfectly in both Gscraper and GSA SER. I am getting 14 LPM just using these public proxies. This shows that No Hands Proxies is working perfectly and there is no problem with the proxies.

When I use the same exact proxies with SB however, it doesn't work.

However when I scrape without proxies, it works.

All this leads me to conclude that it is a problem with SB and public proxies.

Successful scrapebox users - are you using public or private proxies?

@DannyZhang I am getting 3000-6000 URLs per minute on Gscraper with these proxies. However since it is the free version it doesn't scrape more than 100,000 URLs.

The problem is with Scrapebox. All other software work perfectly with No Hands Proxies. I have the latest version of SB 1.16.4.
 
Last edited:
But the proxies scraped from No Hands Proxies work perfectly in both Gscraper and GSA SER. I am getting 14 LPM just using these public proxies. This shows that No Hands Proxies is working perfectly and there is no problem with the proxies.

When I use the same exact proxies with SB however, it doesn't work.

However when I scrape without proxies, it works.

All this leads me to conclude that it is a problem with SB and public proxies.

Successful scrapebox users - are you using public or private proxies?

@DannyZhang I am getting 3000-6000 URLs per minute on Gscraper with these proxies. However since it is the free version it doesn't scrape more than 100,000 URLs.

The problem is with Scrapebox. All other software work perfectly with No Hands Proxies. I have the latest version of SB 1.16.4.

Heh. Tell me what's that speed compared to scrapebox. :) If you know what you are doing you can even get up to 30k/min average.

Regarding scrapebox not working with proxies, perhaps someone more experienced with sb scraping can help you.
 
Had never had Scrapebox crash when scraping. I've scraped millions of urls in one go.

I think that is your experience with scrapebox, but it's unfair to generalize when you say that scrapebox does crash beyonb 1 millions urls. ...considering that most of the members here that uses scrapebox have a positive experience with it.

My apologies. Never meant to discredit scrapebox. Just wanted to clarify that the problems comes when you IMPORT more than 1 million urls. Usually you need to import to remove duplicate URLs.

However scraoebox has a addon DupRemove, which works like a charm if you are dealing with large files. (Think 1 million and above)
 
Back
Top