ScrapeBox the ULTIMATE SERP Scraper & Auto Blog Commenter with PRStorm Mode

Status
Not open for further replies.
Hi,

I try this URL to purchase scrapebox http://www.scrapebox.com/bhw, here there is option of mastercard but my country is not listed there, however my master card is belong to Payoneer, so should I use country as USA as I received card from USA.

looking forward to hear from you.

Regards,
AdorableSeo

Honestly I don't know. I mean you could try it, it will either work or it won't. If it charges the money then you are set if not then you would want to find someone that has a paypal account. Then they can buy it for you and Scrapebox support can change the license over into your namle.

Will ping our comment that post via comment poster? is it the ping function?

You could use RSS ping, I have a video on it, but the regular ping mode is only for driving traffic, it will do nothing for your links. Also you can simply build some backlniks, like blog comments etc.. to your links and that will help index them.

https://www.youtube.com/watch?v=z3SzAFBEKew

Alright... not sure if this is a Scrapebox issue, but have tried everything I can think of (new/different Moz accounts, with/without proxies, restarted scrapebox, opened other instances, etc) but all the Page Authority checker addon has been giving me these last 3 days is: External links = 0. See screenshot.
View attachment 79552

It has always worked flawlessly for me in the past, but these last 3 days hasn't worked no matter what I did. Any idea what's going on?

I should also note: When I manually paste these urls in Moz search bar, they all work fine.

I posted in your other thread, but here is a copy and paste from Scrapebox support about it.

"Moz have added https://www.incapsula.com to their server which is a firewall and it's blocking legitimate requests from some users. This was done after they lowered their limit from 1 million to just 25,000 lookups. I would suggest emailing them to see if your IP or API key can be whitelisted and not be blocked by their firewall."


I am using scrapebox for sometime and it is doing good work so far. The only problem I am having is that it is damn slow when it comes to email grabbing. No matter which options I choose or which sites I try, it is even slower than all the other email scrapers I have used. I am running it on a VPS so it cannot be a bandwidth issue. Is there a way to increase its speed?
It works quite fast for me. Are you using the grab emails by crawling sites function or the grab mails from harvested url list?

How many connections are you running? etc... can you give more details about your setup? I would need to see as much detail as possible in order to shed some light, as speed optimzation is probably an issue with your setup etc..

Are you using proxies? Does it go faster without proxies?
 
I have tried both options of email grabbing/crawling. The speed is more or less same with both options. I had set email grabbing with 10 connections but then I increase it to 500 connections but to no effect at all. It scraped 3000 emails in 3 days which was a shock for me.
 
I have tried both options of email grabbing/crawling. The speed is more or less same with both options. I had set email grabbing with 10 connections but then I increase it to 500 connections but to no effect at all. It scraped 3000 emails in 3 days which was a shock for me.

When you say it scraped 3,000 emails in 3 days, it's likely that most your URL's are just completely random and don't have any emails on them. Or if you are crawling one site with 500 connections, you could be blocked from accessing the site so you aren't getting any emails due to this.

You need to ensure your URL's are targeted and have emails published on the pages. I just run a quick test and got a few hundred emails in a couple of minutes, so there doesn't seem to be anything wrong with the feature.
 
I'm getting this issue with vanity name checker where it adds the web 2.0 domain name onto the end of the urls I import. Not sure what I'm doing wrong or if its the software? I can't remember it doing this last time I used the feature which was a few months ago now.

bhwsb.png
 
Last edited:
When you say it scraped 3,000 emails in 3 days, it's likely that most your URL's are just completely random and don't have any emails on them. Or if you are crawling one site with 500 connections, you could be blocked from accessing the site so you aren't getting any emails due to this.

You need to ensure your URL's are targeted and have emails published on the pages. I just run a quick test and got a few hundred emails in a couple of minutes, so there doesn't seem to be anything wrong with the feature.

I have tested craigslist links as well but still the speed is a problem and sometimes it does not scrape emails from craigslist pages at all.
 
Lately (as in, the past few days) I've been getting nothing but 503 errors when trying to scrape from Google. Anyone happen to know what would cause this and what I can do to fix it? I thought it might be something to do with the proxies, but here are a few things:

- All of my proxies (private ones of course) test out just fine in the proxy tester
- I can use any of my proxies within a browser to do a Google search no problem (no captchas pop up or anything)
- Scraping with an old version of SB (1.16.6) works just fine with my proxies as well

I'm not sure what to make of it.

Update: When I try to scrape without proxies I get a 200 (it connects) but then the connection times out.
Update #2: Doing a Google search in a browser with my proxies IS giving me a captcha now. So I guess that's the problem. Also I can scrape without proxies no problem when I tether my phone, so the connection timing out must be a network adapter/router issue.
Update #3: I reset my router to factory settings and now I can scrape without proxies, so that's good. Proxies are still cucked though. Am I curious now though - is 503 the error everyone gets whenever they're getting captchas for Google searches? I've done a lot of searching on this subject but no one else seems to have made that connection.
 
Last edited:
@Sweetfunny
Alright... not sure if this is a Scrapebox issue, but have tried everything I can think of (new/different Moz accounts, with/without proxies, restarted scrapebox, opened other instances, etc) but all the Page Authority checker addon has been giving me these last 3 days is: External links = 0. See screenshot.
View attachment 79552

It has always worked flawlessly for me in the past, but these last 3 days hasn't worked no matter what I did. Any idea what's going on?

I should also note: When I manually paste these urls in Moz search bar, they all work fine.

So I opened a support ticket with Moz about getting my account and IP whitelisted (because of above explained issue), and today Ian from Moz got back to me, saying:
Your API credentials look all good on my end and I just ran a successful query with them. We do not Whitelist Keys or IPs, sorry. If you are using a 3rd party tool I recommend reaching out to them for support with their tool, or if it is Scrapebox have them reach out to me directly so we can get this fixed for everyone.

Ian
IanW [at] ***.***
 
Last edited:
Please Note: @Sweetfunny


Alright... not sure if this is a Scrapebox issue, but have tried everything I can think of (new/different Moz accounts, with/without proxies, restarted scrapebox, opened other instances, etc) but all the Page Authority checker addon has been giving me these last 3 days is:

Having exactly the same issue here with Page Authority addon , setup new Moz account etc. and still not working??
 
having the same issue with page authority add on like others (again)
contacting moz and this is their reply

On Tue, May 3, 2016 at 11:03 PM, Ian from Moz wrote:
This sounds like it is an issue with your 3rd party tool. We have a lot of people write in with issues relating to ScrapeBox. Since SB uses proxies and allows for multiple API keys and rotates them (against our Terms of Use), it is very prone to being blocked by our backend service. If you are having troubles I recommend reaching out to them and having them contact us to get their tool in line with our TOUs.

--
Ian from Moz

 
Trying to build a custom scraper for images from user twitter accounts, but I can't really wrap my head around it despite having reviewed the custom scraper tool tutorial excessively. It seems it's still over my head, although I have had success on some other sites.

Any thoughts?

I also want to mention that SB if single-handedly the best investment I ever made in online endeavors.
 
Trying to build a custom scraper for images from user twitter accounts, but I can't really wrap my head around it despite having reviewed the custom scraper tool tutorial excessively. It seems it's still over my head, although I have had success on some other sites.

Any thoughts?

I also want to mention that SB if single-handedly the best investment I ever made in online endeavors.

Scrapebox already has an image scraper built in that can scrape from twitter, and the custom data grabber is only going to save off the image url, not the image.

here is a vid on the image scraper
https://www.youtube.com/watch?v=zicEH7ohseU

having the same issue with page authority add on like others (again)
contacting moz and this is their reply

On Tue, May 3, 2016 at 11:03 PM, Ian from Moz wrote:
This sounds like it is an issue with your 3rd party tool. We have a lot of people write in with issues relating to ScrapeBox. Since SB uses proxies and allows for multiple API keys and rotates them (against our Terms of Use), it is very prone to being blocked by our backend service. If you are having troubles I recommend reaching out to them and having them contact us to get their tool in line with our TOUs.
--
Ian from Moz


Having exactly the same issue here with Page Authority addon , setup new Moz account etc. and still not working??

@Sweetfunny


So I opened a support ticket with Moz about getting my account and IP whitelisted (because of above explained issue), and today Ian from Moz got back to me, saying:

So for everyone here the issue is definitely moz and not scrapebox. The reason is you can simulate the exact same request, byte for byte in a browser, in programs like netpeak, in scrapebox etc... and all of them get randomly blocked. Ive also talked with other people that are using other tools and having the same issue.

Further MOZ has no way of knowing if a request is from scrapebox or a browser, there is no footprint, so they can't block scrapebox specifically. Scrapebox does offer proxies as noted, but other tools that don't that are 100% inline with moz TOU are also being blocked, moz is just trying to weed out free traffic most likely.

Also some users have hit me up and I told them to contact moz and they say they talked with someone at moz that whitelisted their IP and then all was well, while other users get a response from moz that moz doesn't whitelist ips. There seems to be a "grab an excuse from the bucket" game going at moz support where they pick and excuse and hand it to you hoping you will go away. Thats my personal opinion anyway.

I would just keep at them and let them know many tools aren't working, tools in line with their TOU etc... you don't even need to tell them its scrapebox. Hopefully they will dial back the firewall like before.



At the end of the day moz is running a business and while they are giving away free access to their stuff, Im sure if they did not give away free access but could make just as much money, they they would happily do that. I can't fault them for wanting to minimize free traffic and setting low limits. However they are indeed blocking all sorts of legitimate requests and its nothing to do with their TOU its to do with saving cash flow in resources. Again my opinion.

Lately (as in, the past few days) I've been getting nothing but 503 errors when trying to scrape from Google. Anyone happen to know what would cause this and what I can do to fix it? I thought it might be something to do with the proxies, but here are a few things:

- All of my proxies (private ones of course) test out just fine in the proxy tester
- I can use any of my proxies within a browser to do a Google search no problem (no captchas pop up or anything)
- Scraping with an old version of SB (1.16.6) works just fine with my proxies as well

I'm not sure what to make of it.

Update: When I try to scrape without proxies I get a 200 (it connects) but then the connection times out.
Update #2: Doing a Google search in a browser with my proxies IS giving me a captcha now. So I guess that's the problem. Also I can scrape without proxies no problem when I tether my phone, so the connection timing out must be a network adapter/router issue.
Update #3: I reset my router to factory settings and now I can scrape without proxies, so that's good. Proxies are still cucked though. Am I curious now though - is 503 the error everyone gets whenever they're getting captchas for Google searches? I've done a lot of searching on this subject but no one else seems to have made that connection.

When you reset your router your ISP probably assigned you a new IP, which is why it works (Im guessing) but 503 from google and 302 is proxies blocked. If you wait 24-48 hours your proxies will probably be unblocked, if not then you probably need new proxies.

I have a helpful video on this here:
https://www.youtube.com/watch?v=GadX5AXiW34

I have tested craigslist links as well but still the speed is a problem and sometimes it does not scrape emails from craigslist pages at all.

Craigslist isn't cut and dry you need to generate the email pages. I have a video
https://www.youtube.com/watch?v=3gjewPDf4rM


I'm getting this issue with vanity name checker where it adds the web 2.0 domain name onto the end of the urls I import. Not sure what I'm doing wrong or if its the software? I can't remember it doing this last time I used the feature which was a few months ago now.

View attachment 79618

Those aren't urls you entered. a url needs a protocol such as http:// in front. So it needs to be http://domain.com not domain.com

When you enter this in a browser it assumes you want http and does that for you, but the vanity checker needs you to have that so it can auto distinguish between a keyword and a url.
 
... So for everyone here the issue is definitely moz and not scrapebox. The reason is you can simulate the exact same request, byte for byte in a browser, in programs like netpeak, in scrapebox etc... and all of them get randomly blocked. Ive also talked with other people that are using other tools and having the same issue...

I get what you're saying. You saying IPs get randomly blocked or requests are randomly blocked? I fear it's the IP if anything... as even with a 1000 requests I can't get a single request to show properly.

We definitely need an alternative to Moz now.
 
Those aren't urls you entered. a url needs a protocol such as http:// in front. So it needs to be http://domain.com not domain.com

When you enter this in a browser it assumes you want http and does that for you, but the vanity checker needs you to have that so it can auto distinguish between a keyword and a url.

Thanks, works perfect now.
 
Hi loopline

i got my scrapebox finally and want to learn this software from beginning to the advanced level.

where can i start learning ? i know you have youtube tutorials.

would that be enough for everything?
 
Agentk007 told me I should share my post here concerning Moz's problems with SB;

I've signed up just to reply to this, I usually lurk and this has been a bothersome issue. I've been using SB for a while and I've run into this issue recently.

I tried contacting both ScrapeBox and Moz about this. Moz's replies have been irritating. SB support told me to contact Moz, so I did.

In the first reply from Moz, I was told that Moz does not whitelist keys or IPs. And when I furthered my query asking about SB, they told me SB is "breaking their Terms of Use" and that SB doesn't seem to be willing to communicate with Moz to fix the issue. They claim that their API is "working as it should" and the only users having issues are the ones using SB. Then they've gone and suggested a list of tools that works with Moz. Nice way to slide in your business partners. Sounds to me like they're trying to limit browser requests and favoring their partners.

This is quite irritating as neither PageRank (Which likely will never work again) nor Page Authority functions for me recently. Could anybody help me with this?

I'll keep you guys updated on the situation.
 
Hey I was wondering what the differences are between vanity name checker and alive check? Correct me if I'm wrong but my current understanding is that they are the same, but VNC searches only specific websites and alive check can search all the websites I put into footprints? Thanks for the help!
 
I get what you're saying. You saying IPs get randomly blocked or requests are randomly blocked? I fear it's the IP if anything... as even with a 1000 requests I can't get a single request to show properly.

We definitely need an alternative to Moz now.

Well you can take a given IP and create a request that is byte for byte the same in like Scrapebox, Netpeak and a browser and with the same IP the first request will faill, 2nd will succeed and 3rd will fail etc. All quite random, but it seems moz has lifted the firewall for the moment as its now working for everyone I have talked to and workign all the time with all tests.

Thanks, works perfect now.

Great. Your welcome.

I am talking about the email pages. I had started scraping craiglist after watching the tutorials.

Can you post a few urls that have mails on them that scrapebox isn't scraping?

Hi loopline

i got my scrapebox finally and want to learn this software from beginning to the advanced level.

where can i start learning ? i know you have youtube tutorials.

would that be enough for everything?

Yes the youtube channel will keep you busy for quite some time. I have over 30 hours of video and the majority of it is still relevant today. A few really old videos are no longer relevant, or features have been upgraded and replaced/renamed. but you can start here
https://www.youtube.com/user/looplinescrapebox/videos

and this playlist is more in order to get started as well
https://www.youtube.com/playlist?list=PLd2GyqDU6SSwBGVROdFI9jS4WPTODl-vB


Agentk007 told me I should share my post here concerning Moz's problems with SB;



I'll keep you guys updated on the situation.

Yes, as noted above it would seem they are trying to limit free traffic and drive paid revenue. I mean I can't blame them there, but their responses are either random or the latest one I saw was like a literal copy and paste response from the 1 person and several people were posting it word for word as their response.

ITs not related to TOU, Scrapebox doesn't in and of its self violate any TOU, you could use scrapebox with no proxy and 1 account or even a paid account and it still gets blocked, or did. All is working again at the moment.

Hey I was wondering what the differences are between vanity name checker and alive check? Correct me if I'm wrong but my current understanding is that they are the same, but VNC searches only specific websites and alive check can search all the websites I put into footprints? Thanks for the help!

They are very different. The alive checker is checking header and GET requests to see if page responds or is alive.

The vanity checker can help generate lists of urls for given platforms or work with loaded lists, but its not looking for an alive or dead status necessarily. Its looking to see if a page meets X criteria and you can build in and customize that critera.

So the vanity checker applies a check to see if various criteria is met and gives a qualified answer based on 1 or multiple criteria. The alive checker is just telling you if the page is alive or dead.
 
Hello Sweetfunny and loopline.
I am facing a small issue with link checker addon. Yesterday I provided a list of 150k urls and started to extract internal links with 200 connections. After few hours when I logged back into my VPS, the addon window was closed (scrapebox main window was open). I checked the addon data folder and found a semi processed links file. I assume that at some point link checker addon crashed.
Any pointers on how to fix this?

I know this is not a scrapebox support thread, but felt like asking here as BHW has been a home for all of us since long. :)
 
Status
Not open for further replies.
Back
Top
AdBlock Detected

We get it, advertisements are annoying!

Sure, ad-blocking software does a great job at blocking ads, but it also blocks useful features and essential functions on BlackHatWorld and other forums. These functions are unrelated to ads, such as internal links and images. For the best site experience please disable your AdBlocker.

I've Disabled AdBlock