1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Is this good what I'm doing? [Scrapebox harvesting]

Discussion in 'Black Hat SEO Tools' started by Sedoc94, May 25, 2012.

  1. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    I just bought SB 2 days ago and what I am currently doing is:

    - Scrape about 200 keywords. (should I do more?)
    - Harvest all the Wordpress URLs from those keywords.
    - Trim all harvested URLs to root, and remove duplicates.
    - Check PR of the domains, and delete everthing from N/A to PR3.
    - Harvest all Wordpress URLs from these domains.
    - Check URL PR, and delete everything from N/A to PR3.
    - Check for outbound links, and delete every URL with over 40.
    - Post comments on all the URLs that are left.
    - Do a linkcheck on all succeeded posts, and save the all the succeeded linkchecks. (to make an auto approve list)

    Or should I just first comment on only 1 blog of each domain, and after I checked if they're auto approve, harvest all the blogs from the auto approved blogs from the domains I posted on?

    Scrapebox is new to me. I don't even know if what I do is good or wrong, but is this what I'm doing going into the right direction?

    Please any help / tips are very much welcome!

    Thanks a lot!
     
    Last edited: May 25, 2012
  2. utuxia

    utuxia BANNED BANNED

    Joined:
    Feb 14, 2011
    Messages:
    673
    Likes Received:
    111
    It's better if you only post once to a domain at a time, but if you're just looking for autoapprove, then that would work. It's just you're sending like 20 comments to each domain at times and the admin may get pissed.
     
  3. Scritty

    Scritty Elite Member Premium Member

    Joined:
    May 1, 2010
    Messages:
    2,807
    Likes Received:
    4,496
    Occupation:
    Affiliate Marketer
    Location:
    UK
    Home Page:
    More keywords is good (think thousands)
    I wouldn't trim to root.
    Go through blog checker and click to shuffle the list once or twice then check all and remove the crud
    Harvest more than WP. SB can post to 6 or 7 blogs (Moveable Type, Blogengine as well as a couple of others you need to find your own footprionts for, Joomla etc)
    I would not use scrapebox for posting - I would buy Scrapejet and put the lists you harvest in there. It's much more stable (it's not browser based)
    Good work checking OBL - that's best practice
    Keep your auto approve list to yourself (unless you want your OBL check of 40 to be meaningless in about 3 days as they all get spammed to buggery)

    ADD

    Rotate keywords (use Googles keyword tool to give you a list of 20-30 keywords relavent to the main word you want to rank for and turn that into one big spin).
    Make a GOOD custom comment with tens of thousands of variations that are on topic

    Scritty
     
  4. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    I have a problem though. If I have a list of thousands of keywords, how would I let it harvest links untill it's complete?

    Most of the time I harvest some public proxies, and most of them are slow. So I harvest about 200 good (slow) working proxies, but if harvesting returns hundreds of thousands results the public proxies are more than likely going to die mid-run. Which means the results are not optimal cuz it's not all I could get from those keywords.

    How do you harvest so many results with proxies that actually work through the whole process till complete?
    I don't think about using my private proxies because I have only 10 and they will all be blocked by the search engines quickly, so thats a waste.
     
  5. Scritty

    Scritty Elite Member Premium Member

    Joined:
    May 1, 2010
    Messages:
    2,807
    Likes Received:
    4,496
    Occupation:
    Affiliate Marketer
    Location:
    UK
    Home Page:
    Idea would be to harvest with Yahoo rather than google and use 1 thread only. Yes - it will take all night to go through 3000 keywords, but I often get 300,000 URL's per thread per 24 hours. THis is where a VPS comes in handy. TUrn it on and then go and do something else for a day.

    Yahoo 999 errors are not as bad as Google bans and Yahoo these days returns over 90% of the results Google does with only about 50% of the risk.
    There are no guarantees here - proxies can go at any time, but worth a go.

    Scritty
     
  6. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    I've decided to single-threaded harvest (Wordpress) from Google, because Yahoo blacklists my proxies even with only 1 connection.
    For the harvest I'm using 1781 keywords and 10 of my private proxies.

    It's doing this already for an hour and already got over 100k results!

    When it completes harvesting, what do you suggest me to do?

    "Go through blog checker and click to shuffle the list once or twice then check all and remove the crud."
    Do you mean the DóFollow/NoFollow Check (addon)? I should only keep the DóFollows?
     
    Last edited: May 25, 2012
  7. utuxia

    utuxia BANNED BANNED

    Joined:
    Feb 14, 2011
    Messages:
    673
    Likes Received:
    111
    Definitely harvest the serps with 1 connection only spread over as many private proxies as you can. I've noticed bing works pretty well. I can run 5 connections and get through large lists (they are the same results as Yahoo).

    Also, you can comment scrape to find relevant comments to use when working with large lists.
     
  8. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    Ah thanks for the tip. But the thing for me with Bing and AOL is that they don't even work for me.. -.-

    I added 1 API to Bing, and it's still not working. Always 0 results, usually ends with an error.

    AOL ends with 0 results and an error aswell, I have no idea why they're not working.
     
  9. ProAffiliate01

    ProAffiliate01 Junior Member

    Joined:
    Feb 27, 2012
    Messages:
    147
    Likes Received:
    10
    Occupation:
    Web specialist
    Location:
    Denmark
    I think this is good advice. I would say you are doing fine but do not make too many comments on one blog. You might want to stick to one or two comments per blog for now.
     
  10. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    Why is that? ;o
     
  11. utuxia

    utuxia BANNED BANNED

    Joined:
    Feb 14, 2011
    Messages:
    673
    Likes Received:
    111
    Don't use an api with bing. Also, AOL bans your ip after 2 queries it seems, so I don't even bother with them anymore.

    Loopline has a tool for scrubbing large urls lists. http://scrapeboxmarketplace.com/free-tools/scrapebox-classroom-domain-cleaner
     
  12. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    Is it really so necessary to only comment a max of 4 blogs per domain?

    I know the moderator might think I'm spammy and such and be more cautious, but aren't there tons of people who just blast to them all and have no mercy? (with good results)

    What do you personally do?

    Btw: Scrapebox been harvesting for about 3 hours now and I got 500k Google results!! ;o
    And it looks like I'm gonna end up with 1,5 million results when it's done..

    I have no idea how I'm going to handle this big ass list ;o
     
    Last edited: May 25, 2012
  13. utuxia

    utuxia BANNED BANNED

    Joined:
    Feb 14, 2011
    Messages:
    673
    Likes Received:
    111
    SB once you hit 1mill stop the harvesting. SB won't handle lists longer than 1mill.
     
  14. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    I will let the harvesting run until it completes.

    If I end up with more than 1 million results, I export them and process them into chunks.
     
  15. utuxia

    utuxia BANNED BANNED

    Joined:
    Feb 14, 2011
    Messages:
    673
    Likes Received:
    111
    SB won't let you do that. Even if you harvest 2 million, as soon as you click "OK" when completed, it truncates the list to 1mill.
     
  16. GoldenGlovez

    GoldenGlovez Moderator Staff Member Moderator Jr. VIP

    Joined:
    Mar 23, 2011
    Messages:
    701
    Likes Received:
    1,713
    Location:
    Guangdong, China
    Home Page:
    This is incorrect. Your harvested URL's are saved inside the 'Harvester_Sessions' folder inside your main Scrapebox folder. Anything over 1 million URLs will be found here. The harvest will be broken up into multiple files of 1 million URLs each (about 75MB files).
     
    • Thanks Thanks x 2
  17. utuxia

    utuxia BANNED BANNED

    Joined:
    Feb 14, 2011
    Messages:
    673
    Likes Received:
    111
    doh, ignore me.
     
  18. jstorm

    jstorm Registered Member

    Joined:
    Apr 25, 2012
    Messages:
    76
    Likes Received:
    13
    Occupation:
    Web Developer
    Location:
    USA
    Trying to get the hang of using SB, too - appreciate many of the tips/suggestions but am feeling a bit confused, too. :confused: I haven't tried the auto posting at all yet - just harvesting some links with small lists of keywords in hopes of seeing how this all works, but wonder, when you all get these huge lists harvested and cleaned up, then what? mass posting or ... ?? Thanks.
     
  19. utuxia

    utuxia BANNED BANNED

    Joined:
    Feb 14, 2011
    Messages:
    673
    Likes Received:
    111
    I've been using SB for about a month and I highly recommend you play with sb on a dummy domain you don't care about. I've done a lot of bad things and picked up a lot of good tips. The more you use the software the better you get at it. Don't blast your money site until you know what you're doing.
     
  20. Sedoc94

    Sedoc94 Registered Member

    Joined:
    Dec 17, 2009
    Messages:
    83
    Likes Received:
    5
    I'm currently posting to my harvested URLs and the Success/Failed ratio is about 10:7.

    Is this normal? How many do you guys normally get?
    I'm using 65 connections with the fast poster and 10 private proxies.

    Before posting do you run a Malware and Phising filter to filter out the bad sites?
    And do you start Slow commenter after you've done posting on a list of 150k+ domains?
    (I'm looking for AA blogs)

    Also, if you want to have some AA blogs with a nice PR, do you first check the PR. remove all low ones and than you go posting to see if they auto approve? Or in opposite direction?

    Thanks guys for all the answers! This is really helping me out! :)
     
    Last edited: May 26, 2012