1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scrapebox - What type of blogs are best to scrape

Discussion in 'Black Hat SEO Tools' started by younga2, Mar 13, 2011.

  1. younga2

    younga2 Junior Member

    Joined:
    Jun 26, 2010
    Messages:
    115
    Likes Received:
    17
    Location:
    In the money
    I scraped a good few Urls with scrapebox lastnight trying to put together an auto-approved list. I used the custom footprint of "powered by wordpress". After scraping and filtering dups' I started posting. I got just over 2000 successful posts. I ran the link checker to look for auto approve and here are my results:

    Found: 5

    WTF!!

    Im guessing its because they were wordpress blogs so heres my question..

    What blog type is best for auto approve lists?
     
  2. ahiddenman

    ahiddenman Elite Member

    Joined:
    Dec 11, 2010
    Messages:
    2,647
    Likes Received:
    2,087
    Location:
    204.15.23.255
    Yeah some of them may be awaiting moderation or approval.

    I use the .edu footprint and wordpress selected and i usually get a good amount of approved blogs.
     
  3. chris456

    chris456 Regular Member

    Joined:
    May 17, 2010
    Messages:
    281
    Likes Received:
    567
    I would say you should wait some week to get them indexed by google or you should use some indexing - pinging tool to index them quicker .
     
  4. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,363
    Location:
    zeta reticuli
    May be you are not following the "method" to find auto-approved blog pages ;)

    Try these methods..

    http://www.blackhatworld.com/blackhat-seo/black-hat-seo/259001-sb-auto-approving-blogs.html

    http://www.blackhatworld.com/blackh...ow-find-high-pagerank-auto-approve-blogs.html

    http://www.blackhatworld.com/blackh...easily-find-auto-approve-blogs-scrapebox.html

    http://www.blackhatworld.com/blackh...ge-ranking-auto-approve-blogs-commenting.html

    http://www.blackhatworld.com/blackhat-seo/link-building/261054-scrapebox-help.html

    http://www.blackhatworld.com/blackh...apebox-methods-finding-autoapprove-blogs.html

    http://www.blackhatworld.com/blackh...ding-your-competitors-auto-approve-blogs.html


    Regarding best blog platform to scrape-I would say Drupal is the BEST because most of the time you need to register first before posting comments. This makes them less attractive to spammmers. Also links in comments are do-follow by default and more chance to get HIGH PR pages! I really ENJOY harvesting Drupal blogs :)
     
    • Thanks Thanks x 6
  5. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,363
    Location:
    zeta reticuli
    Chris, I think you misunderstood. OPs concern is finding auto-approved blog pages, not related to G00gle index problem.
     
  6. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,363
    Location:
    zeta reticuli
    What footprint you are using?
     
  7. uditbhansali

    uditbhansali Regular Member

    Joined:
    Aug 16, 2010
    Messages:
    486
    Likes Received:
    283
    Occupation:
    Ask your mom
    According to me go for blog engine. You will find a lot of auto approved urls + ********.
     
  8. chris456

    chris456 Regular Member

    Joined:
    May 17, 2010
    Messages:
    281
    Likes Received:
    567
    -:) Yes , thank you cyberzilla , You are right , I read that phrase bad , I thought he said after 2000 succesfull postings , he found only 5 visible posts = backlinks , but he thought something else - auto approved blogs.

    btw thanks that you mentioned Drupal blog , can you share with us the name of the tool or how are you registrating to this CMS and footprint you are using for finding them ? Everything with scrapebox?
     
    • Thanks Thanks x 1
    Last edited: Mar 13, 2011
  9. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,363
    Location:
    zeta reticuli
    Some basic footprints for Drupal and other blog platforms are posted here. Really a nice thread.

    Code:
    http://www.netbuilders.org/promoting/using-footprints-find-niche-specific-edu-gov-blog-pages-21864.html
    Do not restrict yourself to those basic footprints. Take a closer look at the comment section, you can find some nice footprints to harvest. There are ways to find drupal blogs where registration is not required to post comments. Use this footprint

    When you combine this footprint with the comment posted date, you can find those auto-approved blogs easily! ;)
     
    • Thanks Thanks x 4
  10. chris456

    chris456 Regular Member

    Joined:
    May 17, 2010
    Messages:
    281
    Likes Received:
    567
    You were right , this is very nice thread , thank you !! , I think it definitely worth it if you post that "Footprint" thread here on BHW for the others , really very useful with Pros and Cons , very nice thread !
    Didn't see nothing similar here on BHW till now.
     
  11. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,363
    Location:
    zeta reticuli
    No! we already have so many threads discussing about footprints for various blogs platforms. Try this in Google --> site:blackhatworld.com "footprints" "wordpress"
     
  12. JenniferMartin

    JenniferMartin Regular Member

    Joined:
    Jan 29, 2011
    Messages:
    391
    Likes Received:
    699

    You need to learn hard to find out auto approve links, harvesting does not means that you have got auto approved lists, i am sure that out of those 2000 links, mostly all of them are "moderated".

    Here is few advise to scrape autoapprove links.

    1) Harvest about 1 million wordpress, blog engine, movable urls with some good list of keywords, specially related to your niche

    2) Now remove the duplicate urls

    3) Split your scraped url list, about 20,000 urls each.

    4) Run Blog Aanalyzer on these splited url lists.

    5) You will get mostly urls with 404 Error, closed for comments and having spam protected sites.

    6) Delete all the "bad blogs", you will get about 5000 - 6000 urls left in each list

    7) Now its the time to "check" each list for auto approve list, best thing is run the fast poster with some spinned comments and linked to already "sandboxed" websites or site that does not exists, say gxrnmontozq.com .

    8) You will get around 40% - 50% sucessful post, but these are not the real figure as mostly they will be moderated sites :)

    9) Save your sucessful post urls as well as your failed urls.

    10) Now check these sucessful as well as failed urls for your sandboxed site url , you will get about 10% real auto approve urls from sucessful list while abount 0.5 - 1% auto approve urls in your failed to post list


    I hope these point will help you in search some good list of auto approve urls.

    Seach BHW for "Ultimate Scrapbox Adavantage" this will help you a lot about having some good foot prints and how to find do follow, high pr sites etc.
     
    • Thanks Thanks x 5
  13. chris456

    chris456 Regular Member

    Joined:
    May 17, 2010
    Messages:
    281
    Likes Received:
    567
    I know them all -:) I am extracting every day using footprints for one year , I am using footprints a lot , testing , trying etc . I welcome every new info about footprints because good and very accurate data you can get only using good footprint (best using operators ) .
    For example if I use 2 operator footprints like(site:..... inurl:.....) it gives me no results (proxies refused by engines - it works if I use 1 operator like site:... + "quotes" etc . but the thread you have found is different , they are telling you there , how many results you will get , if you use this footprint , it is really very useful , because you need normally test it , but in that thread he test that for us and posted results with his review Pros and Cons.
     
    Last edited: Mar 13, 2011