1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Seriously frustrated with scrapebox

Discussion in 'Black Hat SEO Tools' started by zacatictac, Nov 28, 2013.

  1. zacatictac

    zacatictac Power Member

    Joined:
    May 2, 2010
    Messages:
    597
    Likes Received:
    754
    Occupation:
    SEO
    Location:
    Metaverse
    So I recently bought scrapebox to harvest urls to use with GSA. However scrapebox is practically worthless to me for scraping because I can't seem to ever find any non google banned proxies. I have 10 private proxies but I do not want to use them for the amount of scraping I want to do i(iuse the for posting with GSA). I have been trying for the last 4 hours to get some decent, google passed proxies and getting 1 or 2 from lists of thousands that take forever to test. I even bought a subscription to proxylist.co for their 400 proxy list plan. Almost every single proxy from their lists fails or is non google passable. I have tried searching and reading posts and still cant seem to find anything that works or decent proxy sources. Sorry for the rant, let me get to the point: Can anyone help me out and give me some working tips to get a good amount of proxies for scraping? Or recommend me a reliable supplier of non private proxy lists for a reasonable price? I know proxygo has a great service but I just cant afford that payment right now. Any help would be appreciated!
     
  2. mrankin

    mrankin Jr. VIP Jr. VIP Premium Member

    Joined:
    Oct 17, 2008
    Messages:
    1,215
    Likes Received:
    571
    Location:
    Australia
    Home Page:
    You really have to go with private proxies for Google scraping - Google temp bans proxies after about 15 searches in a couple of minutes. It will depend on how much scraping you're going to do to determine how many proxies you'll need. Mine start at 10 for $9 per month for shared proxies which should be fine for scraping. If you're scraping thousands of search results you're going to want 50 at a minimum.
     
  3. innozemec

    innozemec Jr. VIP Jr. VIP

    Joined:
    Aug 19, 2011
    Messages:
    5,288
    Likes Received:
    1,799
    Location:
    www.Indexification.com
    Home Page:
  4. Scritty

    Scritty Elite Member Premium Member

    Joined:
    May 1, 2010
    Messages:
    2,807
    Likes Received:
    4,496
    Occupation:
    Affiliate Marketer
    Location:
    UK
    Home Page:
    I use 200 private proxies and run 5 threads with them. That's overkill of course - but I can run 24/7
    Scraping is essential for my business, though as I buy so many proxies I get them for a dollar each (they are good - 7 nationalities, USA, Aus, Can, UK, France, Germany and Italian)

    10 should be enough to run one thread slowly.
    Also - don't scrape from just Google. Scrapebox now has about 30 SE's most of the sites you find on the other SE's will also beindexed on Google. 90% plus.
    So spread the risk in terms of proxies and SE's used.

    Scritty
     
  5. Conor

    Conor Jr. VIP Jr. VIP

    Joined:
    Nov 7, 2012
    Messages:
    3,358
    Likes Received:
    5,419
    Gender:
    Male
    Location:
    South Africa
    Home Page:
    Hint: Gscraper scrapes a LOT faster than scrapebox, for me at least. And using $5 public proxies from thebigproxylist.com (They have a discount BST here), you should be good to go.

    There's even a free version of Gscraper on the official site: http://www.gscraper.com/
     
  6. miedy

    miedy Senior Member

    Joined:
    May 17, 2012
    Messages:
    1,007
    Likes Received:
    463
    edit - same here -
     
  7. alternatesword

    alternatesword Jr. VIP Jr. VIP

    Joined:
    Aug 25, 2012
    Messages:
    2,319
    Likes Received:
    483
    Location:
    scabbard
    Home Page:
    No need to check google passed, just keep anonymous proxies which are medium/fast in speed and then try scraping. Some proxies failed in proxy check will also work for scraping google.
     
    • Thanks Thanks x 1
  8. Scritty

    Scritty Elite Member Premium Member

    Joined:
    May 1, 2010
    Messages:
    2,807
    Likes Received:
    4,496
    Occupation:
    Affiliate Marketer
    Location:
    UK
    Home Page:
    Not found that to be the case at all at least not in week long scrapes
    The opposite in fact. Found Gscrapper very much slower than scrapebox.

    For 8 figure scraping sessions time is not the issue. Proxy burn out is.
    I've found that, for example (hypothetically) say with a VPS or server, it's better to have one thread making one call to Google every 5 minutes just left on overnight than burning out after making 25 requests in 20 minutes and having a 4 hour Google ban, and then subsequently getting banned even quicker the next time as the proxy is then on Google's black list and will burn out ofter 10 requests the next time you use it in a tool.
    It does depend on how much you want to do, but when you are looking to scrape say 50 million URL's a week, keeping proxies alive and running 24/7 has a slight priority over scraping just a few tens of thousands of URL's in 30 minutes but then having your proxies banned and subsequently black listed.

    That's one reason I don't buy from proxy companies that offer "fresh" proxies every month. No such thing as a "fresh" proxy - it's just a proxy someone else has likely been abusing for weeks before you get to have your turn with it. Rather keep the same 200 and know exactly how to use them so they never get a ban - and they never do.

    Scritty
     
    • Thanks Thanks x 2
  9. Mr_Cool

    Mr_Cool Junior Member

    Joined:
    Aug 17, 2012
    Messages:
    110
    Likes Received:
    39
    Scritty, I like the cut of your jib :cool:
     
  10. Nightly

    Nightly Regular Member

    Joined:
    Oct 18, 2013
    Messages:
    292
    Likes Received:
    79
    Sounds to me you are not frustrated with Scrapebox, but frustrated with yourself for not being equipped with necessary tools.

    With private proxies, I go for a 1:15-1:20 ratio when using Google, and may even put a delay If I'm feeling "sketched out". I go a bit less for other SE's because If I can beat Google, chances are I can beat everyone else. Get around 50 private proxies and go from there (should be less than $100/m). Where do you get your proxies from Scritty? I pay for 500 Private Proxies a month at $1.59 each (all USA). Getting them at a buck each would save me an easy couple hundred a month.
     
  11. extremeboy

    extremeboy Jr. VIP Jr. VIP

    Joined:
    Jul 8, 2010
    Messages:
    2,992
    Likes Received:
    647
    Occupation:
    World Best RANK Tracker SERPCloud.com
    Home Page:
    Get dedicated private proxies couple hundreds and use the same ones as you know how you were using previously are good;)
     
  12. Skullator

    Skullator Junior Member

    Joined:
    Jul 21, 2013
    Messages:
    105
    Likes Received:
    76
    Yes as Scritty and other have mentioned.. it is absolutely worth your money to go ahead and invest in at least 10 private proxys for all your scraping needs..
    This way you can pull down some good sized lists and thanks to proxy rotation you won't catch too many temp bans.
    Also public proxy's are pretty terrible for speed etc. The sooner you get them out of your life the better, just my 2c. Cheers.
     
  13. xenergy81

    xenergy81 Junior Member

    Joined:
    Jul 6, 2009
    Messages:
    105
    Likes Received:
    6
    Occupation:
    Full time Online Marketer & Software Engineer
    Location:
    IPv4
    I got the same issues as you in the first place, but then I tried gscraper and use their additional service for unlimitted proxy service. I'm able to get thousands footprint from google in a very short time, and it's very stable as well.
    I own scrapebox as well, but I'm now hardly look back since using gscraper and its additional service.