1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.
  2. Hey Guest Last month we upgraded BlackHatWorld.com to a new platform - . If you notice anything that requires attention please start a new thread here.
    Dismiss Notice

Content gen theory

Discussion in 'Cloaking and Content Generators' started by after_shox, Jan 6, 2007.

  1. after_shox

    after_shox Newbie

    Joined:
    Jan 3, 2007
    Messages:
    23
    Likes Received:
    3
    hi, im looking to get started in computer generated content creation. I downloaded rssgm and YACG but is there a program out there that will make content then just export it to a csv file?

    Ie i allready know how to program so i dont need anything that actually makes the pages as once i have it in the database i can do that myself. Then i can make my own 100% orginal pages.

    Also im still looking for a bit more theory on how computer generated content is made. Ie i know from what i know so far you can scrape other sites in a simalar niche as your own and then use markov etc to re-arange some words so that you dont get flagged as copyright. Im a right in thinking this is the main method? I havnt got my hands on any scrapers yet but im assuming they are quite complex ie they gotta decipher between text/content and the site design/nav on the site scraped. Any feedback appreciated. P
     
  2. Diamond Damien

    Diamond Damien Owner BlackHatWorld Staff Member

    Joined:
    Oct 27, 2005
    Messages:
    55,516
    Likes Received:
    12,122
    Home Page:
    There are some article re-writer programs or content twisters which will twist around the content say in a given article that you already have and spit it out into a text document. You can specify what % change you would like to change it to from say 10% change to over 100% change. I know this is different than scrapping programs which scrape sites but I was just giving you another idea.

    You are right in assuming scrapping related sites within a similar niche. Also some people use wikipedia which is a gold mine of free info...another good place for free content is public domain works.
     
  3. after_shox

    after_shox Newbie

    Joined:
    Jan 3, 2007
    Messages:
    23
    Likes Received:
    3
    Thanks, my inital feeling is that the more i can do myself instead of buying off the shelf tools the better. Ie i was looking at SEC but its $159, i know if i know the process behind it i will be able to create something cheaper and better for myself. P
     
  4. jcooper66

    jcooper66 Newbie

    Joined:
    Mar 15, 2007
    Messages:
    20
    Likes Received:
    0
    Dave can you give names and or links to these programs. I am interested. What is SEC ?
     
  5. instantleads

    instantleads Newbie

    Joined:
    Feb 5, 2007
    Messages:
    20
    Likes Received:
    3
    Occupation:
    Software developer, part time hacker
    Location:
    Earth
    Home Page:
    PM me for a copy of the basic Markov engine. It will edit any text to make it unique and readable, and yes it then just dumps it into a text file for you to place where you like.