1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Any interest in this type of content bot?

Discussion in 'Black Hat SEO Tools' started by General Lee, Nov 19, 2012.

  1. General Lee

    General Lee Regular Member

    Joined:
    Jan 26, 2012
    Messages:
    230
    Likes Received:
    1,013
    Location:
    Confederate States
    I've been working on a bot that basically scrapes content from various sources that isn't indexed, be it expired, deindexed or not indexed for various other reasons. The point being is that the content is 100% orginal in the eyes of a search engine.

    For example if I type in "cigars" it'll generate 1000's of 1000's of words of content on this targeted keyword.

    I've actually built several sites here recently using this content and all of my web 2.0s have orginal content as well. The bot still needs work filtering out which content is worth while and which content is crap.
     
  2. imserious

    imserious Senior Member

    Joined:
    Mar 27, 2009
    Messages:
    946
    Likes Received:
    560
    Everyone will be interested but it will work only if you restrict access to a few
     
  3. Riders On The Storm

    Riders On The Storm Senior Member

    Joined:
    Feb 27, 2012
    Messages:
    1,150
    Likes Received:
    489
    interesting. waiting for it.
     
  4. alternatesword

    alternatesword Jr. VIP Jr. VIP

    Joined:
    Aug 25, 2012
    Messages:
    2,323
    Likes Received:
    484
    Location:
    scabbard
    Home Page:
    Some times google doesn't index crap content. If that can be filtered out during the scrape then it is 100% needed one.
     
  5. hpv222

    hpv222 Power Member

    Joined:
    Feb 8, 2010
    Messages:
    736
    Likes Received:
    274
    it would be awesome as long as it scrapes whole posts / articles. If it scrapes random words and sentences, then it would be a bit less awesome ;) I guess, you could always set a filter (minimum number of words) when scraping
     
  6. hpv222

    hpv222 Power Member

    Joined:
    Feb 8, 2010
    Messages:
    736
    Likes Received:
    274
    btw the webarchive site and the co.cc domains might be good sources, but then again, the co.cc domains are pretty much built on already scraped, used, reused, and abused content which is the reason G dropped them
     
  7. .:mAestro:.

    .:mAestro:. Regular Member

    Joined:
    Apr 17, 2010
    Messages:
    384
    Likes Received:
    104
    Occupation:
    Life
    This.
     
  8. losille

    losille Junior Member

    Joined:
    Feb 22, 2011
    Messages:
    109
    Likes Received:
    95
    That would be great for one of my sites. I am trying to collect historical/background information.
     
  9. 9to5destroyer

    9to5destroyer Jr. VIP Jr. VIP Premium Member

    Joined:
    Nov 14, 2011
    Messages:
    355
    Likes Received:
    205
    I think the problem will be (if its the methods im thinking of) is that it will become saturated very quick as people will be using it for website content and tier1 so if you have heavy users in the same niche it could be a problem. The other thing is once you release a public bot the methods will be more in the public eye hense more people will do it. If your going to do it I would keep it private and restrict it to several people or keep it to yourself knowing you will always have unique content
     
    Last edited: Mar 16, 2013
  10. Ambassy

    Ambassy BANNED BANNED

    Joined:
    Apr 13, 2011
    Messages:
    642
    Likes Received:
    163
    IMO a problem would be the fact that lots of the content isn't being indexed because it is of a crappy spun quality.