1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scrapebox stopped harvesting.

Discussion in 'Black Hat SEO Tools' started by Ampix0, May 19, 2012.

  1. Ampix0

    Ampix0 Power Member

    Joined:
    Jan 10, 2012
    Messages:
    525
    Likes Received:
    60
    Home Page:
    I have google passed fast proxies. and using the custom footprint "site:.edu intext"leave a reply"" and 16 keywords (I also tried many other edu footprints) and im coming up completely dry. It says 0 bandwidth to anything but yahoo and then comes up empty.

    It just stopped working. idk what happened i had a large list of EDU sites i had posted to earlier and I didnt get on any this time around. I also successfully posted to 300 and got 1

    Edit: Wow. just took it off custom footprint. moved it back to wordpress. nothing still.

    Edit: This seems odd. if I close scrapebox and open it again, I can get about 100 and then it stops working again.
     
    Last edited: May 19, 2012
  2. Ampix0

    Ampix0 Power Member

    Joined:
    Jan 10, 2012
    Messages:
    525
    Likes Received:
    60
    Home Page:
    Picture to aid my assistance lol [​IMG]
     
  3. EXtraHand

    EXtraHand Junior Member

    Joined:
    Jan 26, 2012
    Messages:
    111
    Likes Received:
    62
    You must have more keywords, there's aren't that many EDU Blogs
     
  4. GoldenGlovez

    GoldenGlovez Moderator Staff Member Moderator Jr. VIP

    Joined:
    Mar 23, 2011
    Messages:
    701
    Likes Received:
    1,713
    Location:
    Guangdong, China
    Home Page:
    Remove the + symbol in your keyword as those are no longer used. Also, try adding a few hundred keywords instead of just 16. Finally, under settings uncheck 'Use multi-threaded harvester' and see what the results box outputs. You can get a clearer picture of the issue (should for example your proxies be blocked but still showing as unblocked in the proxy manager).
     
    Last edited: May 19, 2012
  5. madoctopus

    madoctopus Supreme Member

    Joined:
    Apr 4, 2010
    Messages:
    1,249
    Likes Received:
    3,498
    Occupation:
    Full time IM
    All replies that involve adding more keywords are crap. No offense but that is not a solution to his problem. From what I can tell SB is shit when it comes to working with queries that contain quotes. Remove the quotes and it will probably work. Yeas I know that it defeats the initial purpose but personally I found no way to make it work with quotes. I have queries like

    Code:
    "some keyword" AROUND "other keyword" +("powered by X" AROUND "some keyword")
    And they scrape no results. SB devs should look into this because being limited to scraping without advanced queries basically takes all the power from you.
     
    Last edited: May 19, 2012
  6. GoldenGlovez

    GoldenGlovez Moderator Staff Member Moderator Jr. VIP

    Joined:
    Mar 23, 2011
    Messages:
    701
    Likes Received:
    1,713
    Location:
    Guangdong, China
    Home Page:
    How is suggesting adding additional keywords a crap idea? There are many search queries paired with particular keywords that will return ZERO results. Queries with advanced operators work just fine (for example: "name" "website" "1..100 responses"). I harvest upwards of 50 million URL's/day using footprints that contain keywords in quotes and advanced operators, I fail to see where your problem lies. The footprint you provided is unnecessarily complicated, and limiting in results. Keep in mind Google removed the + search operator nearly half a year ago, you should instead be using the AND operator.
     
    • Thanks Thanks x 1
    Last edited: May 19, 2012
  7. Scritty

    Scritty Elite Member Premium Member

    Joined:
    May 1, 2010
    Messages:
    2,807
    Likes Received:
    4,496
    Occupation:
    Affiliate Marketer
    Location:
    UK
    Home Page:
    The + no longer works (in fact it fecks up the searches)
    As GG says ncheck multithread harvester and see if it will run "on base settings".
    Also with 16 keywords - theres a chance it's found zero blogs with that specific footprint. Think in terms of hundreds or better thousands of keywords.
    Make sure proxies aren't banned (Google bans quicker than ever now) It will ban a proxy in under 10 searches unless they are spread over many minutes. How many do you have? I'm not safe unless I have 10+ proxies per thread (anly 18 months or so ago you could reverse that ratio) 100 proxies - 8 threads is how I roll with Google these days.


    Scritty

    [EDIT] Just seen - 13 proxies - should be fine for one thread. Unless you burnt them out with an earier "test" run or something, or they are public (waste of time - even proxy goblin, proxy multiply elite proxies are often knackered in the time it takes for you to copy and paste them across) . But other than those issues - your proxy ratio looks ok.
     
  8. Ampix0

    Ampix0 Power Member

    Joined:
    Jan 10, 2012
    Messages:
    525
    Likes Received:
    60
    Home Page:
    These are actually some free private ones I had gotten. Thought they are slow. I had some other privates ones but they died :( lol
     
  9. jb2008

    jb2008 Senior Member

    Joined:
    Jul 15, 2010
    Messages:
    1,158
    Likes Received:
    972
    Occupation:
    Scraping, Harvesting in the Corn Fields
    Location:
    On my VPS servers
    1. scrapebox is not as good as hrefer and even with good proxies it often doesn't harvest on G.
    2. stop using your special operator (inurl:) not only will it not work, you will screw the proxies for everyone else

    Find other ways of getting what you want without special operators. Change to hrefer. I am pretty confident it's a combination of your proxies (G softbans lightning fast these days) and SB itself
     
  10. madoctopus

    madoctopus Supreme Member

    Joined:
    Apr 4, 2010
    Messages:
    1,249
    Likes Received:
    3,498
    Occupation:
    Full time IM
    Fact is SB will many times return zero results when a search in browser or with my custom scraper for same keyword will return tens of thousands results. Adding more keywords won't magically make SB scrape what it couldn't scrape in the first place.
     
  11. turoc

    turoc Junior Member

    Joined:
    Oct 28, 2009
    Messages:
    117
    Likes Received:
    33
    test your footprint in Google first and see what it returns. If it returns results in Google then it will return results in SB most of the time. Most of the time poor results in SB are because of a badly constructed footprint or a footprint that generates no results as there are no matches.

    Invest in decent proxies too. You can use private proxies to harvest if you maintin a ratio of 1:5 (one connection for every 5 proxies - in other words if you have 10 private proxies, set your max harvester connections for Google to 2).