1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How does google determines duplicate content

Discussion in 'White Hat SEO' started by nixnash, Dec 13, 2010.

  1. nixnash

    nixnash Power Member

    Joined:
    Oct 26, 2009
    Messages:
    581
    Likes Received:
    204
    Occupation:
    Student
    Location:
    BHW
    Hey Guys ,
    We usually use copyscape to check if the content is duplicate or not.

    What do you all think what google uses to decide about duplicate content.


    Code:
    http://googlewebmastercentral.blogspot.com/2008/09/demystifying-duplicate-content-penalty.html
    Google clearly mentions there is no such thing as duplicate content and its okay as long as it provides valuble imformation to the user.

    Looking forward to your views

    NIx
     
  2. steelballs

    steelballs BANNED BANNED

    Joined:
    Dec 5, 2008
    Messages:
    1,832
    Likes Received:
    4,562
    Why do so many people keep looking at duplicate content it is fraught with problems when you use it and many do - I guess because they are too lazy to either write or buy unique content from a third party supplier

    Google have their algorithms to check data
     
  3. houston27

    houston27 Registered Member

    Joined:
    Nov 26, 2010
    Messages:
    93
    Likes Received:
    73
    Location:
    Miami, FL, USA
    Home Page:
    Obviously the modified date is worthless as you pointed out.

    Google has a very vast Server and excellent programmer they have. Google has a record of what they've crawled, and I'd imagine that includes a timestamp, so more than likely the originator of the content is the one that they first recorded.

    None of this has much to do with site's outranking you however, if CNN copies you, you'd better believe they'll outrank you.
     
  4. GreyWolf

    GreyWolf Executive VIP Jr. VIP

    Joined:
    Aug 17, 2009
    Messages:
    1,930
    Likes Received:
    5,388
    Gender:
    Male
    Occupation:
    Artist / Craftsman
    Location:
    sitting at my PC
    Most people nowadays have a misunderstanding of what the problems really are with duplicate content. Unique content is usually very beneficial for your websites and articles, but isn't always an absolute requirement.

    Google doesn't really care about duplicate content unless you're building a bunch of duplicate landing pages on a single website. That's where the idea of a duplicate content penalty comes from and it's similar to keyword stuffing.

    The only real problem regarding duplicate content among different websites is that it creates more competition in the index for the same content. The first instance found will have a slight advantage, but that advantage can quickly disappear when other sites do a better job optimizing for other factors.

    Article writers have to worry about duplicate content for a different reason. Most article directories will only accept original content. Programs like copyscape were created to identify duplicate content for people doing directory submissions.

    Somehow over time the idea of a duplicate content penalty for having duplicate landing pages, and the idea of needing to have unique content for article directories merged into a single idea that duplicate content is bad and there is a duplicate content penalty for any kind of scraped content. If that were really the case then it would be virtually impossible for content aggregators and auto blogs to rank well in the serps. In fact it would be almost impossible to get an ecommerce site to rank well if the product is sourced from a wholesaler since the bulk of your stores content is the same as every other online retailer for the same products.

    There's a couple other threads right now discussing this topic in even more depth. If you want to understand more about duplicate content, when it's ok to use it, and when it's best to be unique then you should read these threads as well.
     
  5. dannyhw

    dannyhw Senior Member

    Joined:
    Jul 16, 2008
    Messages:
    980
    Likes Received:
    462
    Occupation:
    Software Engineer
    Location:
    New York City Burbs
    I'm still on the fence about whether or not when one copy of the content ranks, the other actually gets bumped down.

    I saw some google forum where a guy was complaining to them that someone with a google proxy app had gotten a copy of his whole site indexed, outranked him and he claimed that he got a major bump down the list.
     
  6. dannyhw

    dannyhw Senior Member

    Joined:
    Jul 16, 2008
    Messages:
    980
    Likes Received:
    462
    Occupation:
    Software Engineer
    Location:
    New York City Burbs
    I'm still on the fence about whether or not when one copy of the content ranks, the other actually gets bumped down.

    I saw some google forum where a guy was complaining to them that someone with a google proxy app had gotten a copy of his whole site indexed, outranked him and he claimed that he got a major bump down the list.