1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

how search engine recognize unique content?

Discussion in 'Blogging' started by fung1990, Mar 25, 2010.

  1. fung1990

    fung1990 Power Member

    Joined:
    Dec 21, 2009
    Messages:
    579
    Likes Received:
    51
    how search engine recognize unique content?
    i sure lots of ppl wanna know about this.
    that will giving me an idea of making a super unique plugin.

    and you will got it
     
  2. al8xandru

    al8xandru Newbie

    Joined:
    Mar 23, 2010
    Messages:
    14
    Likes Received:
    2
    Home Page:
    I am just speculating here, but i think it compares only parts of your content. And as far as I know even different page layout and new tags in your content can make some spider recognize your content as unique...
    That's my 2 cents , wish you luck with your plugin.
     
  3. Kid Shaleen

    Kid Shaleen Regular Member

    Joined:
    Oct 29, 2009
    Messages:
    250
    Likes Received:
    63
    I've been looking over google's patents on this topic (and plan on writing more in detail when I've a better notion of how they all fit together.)

    But the use of a hashes seems one of their top methods.

    As I understand it, they won't try to look at and compare words. Rather they'll create a hash based on things like words per sentence, sentences per paragraph, paragraphs per article, sentences per article, etc.

    This way, an article spun merely by replacing one word with another would have roughly the same (identical?) hash value.

    That's why I really want to see article spinning software that can delete sentences and paragraphs, change sentence-ending puncuation to semi-colons (changes number of sentences and sentence length), and shuffle parts of different articles together.

    The says of simply spinning words are over.
     
  4. al8xandru

    al8xandru Newbie

    Joined:
    Mar 23, 2010
    Messages:
    14
    Likes Received:
    2
    Home Page:
    I can make a script to shuffle an articles words/sentences/paragraphs like there is no tomorrow, but to shuffle them in such a way that it still makes sense that is and other thing...
     
  5. Tseng

    Tseng Regular Member

    Joined:
    Mar 16, 2010
    Messages:
    289
    Likes Received:
    33
    Occupation:
    lampin
    Location:
    outerspace
    i have a respinner that uses a database of synonyms, adjectives, and adverbs all customizable, it all comes out human readable. not 100% flawless but the best spinner i've used personally so far.

    i also hear TBS is .. well, the best spinner. if you're willing to shell out the cash. pm me if you want to talk spinners I may be able to help you out.
     
  6. fung1990

    fung1990 Power Member

    Joined:
    Dec 21, 2009
    Messages:
    579
    Likes Received:
    51
    how to check if the article is unique
    any online tool?
     
  7. fung1990

    fung1990 Power Member

    Joined:
    Dec 21, 2009
    Messages:
    579
    Likes Received:
    51
    This page has 328 words matching your text, as highlighted below by Copyscape. Each matching block is highlighted with a different color.
     
  8. casius

    casius Junior Member

    Joined:
    Apr 18, 2010
    Messages:
    186
    Likes Received:
    20
    Home Page:
    Hello,

    Webmasters can initiate spiders not to crawl definite files or directories in the course of the typical robots.txt file in the root directory of the domain. In addition, a page can be clearly excluded from a search engine's database by applying a meta tag exact to robots. When a search engine visit a site, the robots.txt placed in the root directory is the primary file crawled. The robots.txt file is then parsed, and will initiate the robot as to which pages are not to be crawled.