1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Copyscape Clone

Discussion in 'BlackHat Lounge' started by surajprakash31, Jan 5, 2009.

  1. surajprakash31

    surajprakash31 Regular Member

    Joined:
    Oct 7, 2008
    Messages:
    261
    Likes Received:
    459
    Home Page:
    I'm thinking of making a Copyscape Clone with the same features and functionality....has anyone ever thought of this before?? Is it possible to make a site like Copyscape? I've seen clones of almost every site from Google to Rapidshare but never even heard of a Copyscape clone....can anyone share his views on this??
     
  2. bigman

    bigman Jr. VIP Jr. VIP Premium Member

    Joined:
    Feb 23, 2008
    Messages:
    278
    Likes Received:
    225
    if you can make a copyscape clone...you can really make it big since they are literally a monopoly as of now...

    plus their pay-per-use business model sucks at a time when everyone is trying to convert to an free-for-use but ad-supported model...

    i've got absolutely no idea of how difficult or easy the task might be bit if you do pull it off and can offer a free service as against copyscape's paid service, i honestly think it has the potential of being something really big...
     
  3. surajprakash31

    surajprakash31 Regular Member

    Joined:
    Oct 7, 2008
    Messages:
    261
    Likes Received:
    459
    Home Page:
    no other ideas?!?!
     
  4. Marko

    Marko Newbie

    Joined:
    Oct 10, 2010
    Messages:
    0
    Likes Received:
    0
    If you can make it... share it with the users here :D I mean if you are good and want to help... if you are here just for the money sell it, you will make sales.

    As I see you can do this, then DO it. Don't ask should you do it. Start coding. Take action. Then decide what will you do after you got it done.
     
  5. ghprod

    ghprod Regular Member

    Joined:
    Mar 18, 2009
    Messages:
    230
    Likes Received:
    40
    Home Page:
    any progress from OP?
     
  6. dewshare

    dewshare Newbie

    Joined:
    Feb 4, 2011
    Messages:
    0
    Likes Received:
    0
    Yes I am PHP OOP Programmar, I am willing to make such a script just like copyscape, if you have idea of its working and know some thing about its way of working them i will do code for you to make this possible

    thanks
     
  7. TheEditor

    TheEditor Regular Member

    Joined:
    Aug 20, 2007
    Messages:
    425
    Likes Received:
    206
    I'm looking for a Copyscape clone as well. I don't intend to compete with them. Anyone know of something similar? Open source?
     
  8. Zak_A

    Zak_A Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 16, 2008
    Messages:
    809
    Likes Received:
    873
    Gender:
    Male
    Occupation:
    WP designer & developer
    Location:
    Western Europe
    The simple statement "copyscape clone" is so ironic, I love the idea ! :D

    I guess it's absolutely possible to create such a thing (they did, so why couldn't you too ?).

    Some other programs that work in the same manner also exist : I'm thinking of some plagia checkers that are used by almost all universities (and the companies that run these softwares sell them for hundreds or thousands).
    These softwares most probably use a similar algorithm (even maybe the same), the only difference with copyscape is that they are based on huge databases of books and academic works, while copyscape is based on content that is published on the web.


    Considering the current situation of this industry (plagia checking) I can only think of two options :

    - Either this requires a huge investment (the algorithm may require lots of programming hard work and/or if needs a huge database that is hard/expensive to gather and/or require huge and expensive servers to run). This option would explain while there is only a few companies that dominate this market, and why they charge so much for it.
    - The 2nd option is that no one even tried to recreate the algorithm because everyone assume option #1 is true, and companies take advantage of this popular false assumption to silently dominate the market and to charge outrageous prices for the softwares.

    Ok I know I didn't really answer OP's main question, but that's my view of it :)

    Bottom line is : the idea is great and is worth thinking about. If you find a way to create such a piece of software by yourself (thus demonstrating option#2), then you'll be sitting on a potential goldmine !

    (I just realized this post is 2 years old, but it doesn't matter, the topic is still worth being dup up IMO :) )
     
  9. TheEditor

    TheEditor Regular Member

    Joined:
    Aug 20, 2007
    Messages:
    425
    Likes Received:
    206
    I'm not even looking for a complete clone. I'd just like a tool that gave a similarity score for potential matches in the document repository. The kind of thing that would allow you to ask a question like "was this written by Shakespeare?".
     
  10. altschule

    altschule Regular Member

    Joined:
    Sep 1, 2010
    Messages:
    282
    Likes Received:
    185
    Location:
    Sector 9
    I figure you make a crawler, which would definitely need a lot of resources, then create your way of checking the content against your crawled data.
     
  11. TheEditor

    TheEditor Regular Member

    Joined:
    Aug 20, 2007
    Messages:
    425
    Likes Received:
    206
    Actually, I don't need the crawler. I'm not testing against new content. The clone would be testing against a repository that never changes.