1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

WP-Obscure Content Obfuscator

Discussion in 'Blogging' started by denight, Apr 11, 2009.

  1. denight

    denight Registered Member

    Joined:
    Apr 7, 2009
    Messages:
    58
    Likes Received:
    26
    Well I got tired and decided to write a small wordpress plugin that basically rewrites posts with some obfuscated content in them.

    hxxp://wordpress.org/extend/plugins/wp-obscure/

    You can find it in the wordpress plugin search page inside WP, too... It's called WP-Obscure.

    It's very basic and very new. Let's call it a rough draft... anyway, since it's mainly meant to be a blackhat tool, I thought i'd post it here and get some constructive feedback (or just to be told I suck ass, ;))...

    My main goal is to create the highest percentage of unique content without having to rewrite an article with synonyms, bad grammar, etc.

    This is also my first post, so... hey :).

    dN
     
    • Thanks Thanks x 6
  2. iglow

    iglow Elite Member

    Joined:
    Feb 20, 2009
    Messages:
    2,081
    Likes Received:
    856
    Home Page:
    you could write a bit how it works or what doesi t change etc..
     
  3. pspdropship

    pspdropship Newbie

    Joined:
    Sep 11, 2007
    Messages:
    34
    Likes Received:
    38
    that's a cool concept actually, not rewrite it, but to obscure it, so if I understand it right, the content will not be rewritten which reduces readability, but to add extra content to make the whole piece somehow unique, but takes longer for readers to understand the point, right?

    if it performs like this, then I think it'll be a great tool since lots of bloggers writing their own content are having hard times getting to the point anyway...
     
  4. pspdropship

    pspdropship Newbie

    Joined:
    Sep 11, 2007
    Messages:
    34
    Likes Received:
    38
    sorry, just read the description, it doesn't work the way I described above, then I'm wondering if this will cause search engines to deindex if they get to know what you are doing?
     
  5. denight

    denight Registered Member

    Joined:
    Apr 7, 2009
    Messages:
    58
    Likes Received:
    26
    You would get deindexed (and most of the time, just penalized) for 100% duplicate content much faster than you would for changing a few characters and adding a few html tags - for all a search engine would know, the changes WP-Obscure makes could just be automatic changes made by a WP theme, or a user for that matter.

    I constantly copy and paste snips of articles for my main blog (completely whitehat), and don't see any penalties because of it.

    In fact, I've seen auto blogs that rank better than the original content creators because they have both original and duplicate content on them. That's what this script does - it combines the both.

    This script doesn't stuff keywords, it doesn't cloak pages, and it doesn't do anything against googles (for example) TOS.

    At any rate, it'll take a good amount of time (I've seen some take 6 months before) to get properly ranked, and it will of course rely on other SEO techniques such as backlinks and keyword optimization.
     
  6. denight

    denight Registered Member

    Joined:
    Apr 7, 2009
    Messages:
    58
    Likes Received:
    26
    Here's an example of what this script does...

    HTML code without WP-Obscure:




    HTML code with WP-Obscure:


    Code:
    Why do people wear diamond watches?  If you have ever seen&#32&#115&#111mebody wearing a diamond w<input id="stats" type="hidden" />atch then you will proba&#98&#108&#121 realize that there was little point
    in even bothe&#114&#105&#110<input id="tracker" type="hidden" />g to ask the question.  Simply taking one look at&#32&#116&#104e watch on their wrist woul<input type="hidden" />d be enough to make 
    it &#109&#111&#114e than evident why anybody would be proud to wear &#115&#117&#99h<input id="stats" type="hidden" /> a beautiful piece of jewelry.  Diamond watches &#104&#97&#118e gained tremendous 
    populari<input type="hidden" />ty in recent years for&#32&#116&#119o major reasons.
    
    <br /><br />The first reason why diamond watch&#101&#115&#32ha<input id="tracker" type="hidden" />ve become so popular in 
    recent years is as a dir&#101&#99&#116 result of the hip hop or bli<input id="apps" type="hidden" />ng culture which has &#98&#101&#99ome a tremendously successful fashion style.  Mos&#116&#32&#110oti
    <input type="hidden" />ceably led by those hip hop or rap stars such a&#115&#32&#69minem, 50 Cent and P Diddy, th<s></s>e hip hop culture is&#32&#111&#110e that celebrates diamonds and all that sparkles 
    a&#110&#100&#32glit<input id="stats" type="hidden" />ters.  Naturally children and young people ar&#101&#32&#107een to emulate their celebritie<input id="phpint" type="hidden" />s, and as a result
    ,&#32&#98&#108ing jewelry, including diamond watches, has become&#32&#97&#32disti<input id="tracker" type="hidden" />nctive fashion statement.
    
    <br /><br />The second reason is&#32&#116&#104at an increasing
    number of desig<input id="apps" type="hidden" />ner watches have c&#111&#109&#101 onto the market, with different brand names now b&#101&#105&#110g rela<input id="counter" type="hidden" />ted to
    distinctive styles that have given th&#101&#32&#99lassic diamond watch a modern twi<input id="apps" type="hidden" />st. There are sev&#101&#110&#32main names to look out for if you are considering
    &#112&#117&#114chasing<input id="apps" type="hidden" /> a diamond watch either for yourself or as &#97&#32&#103ift.
    The trick is, the output to an actual web browser will look identical to that of the original content.

    By the way, this trick is not new - you may have heard the term email obfuscation, this is where an email address is converted to ascii characters to avoid harvesters, but correctly visible to a reader.

    Also... I made a quick webpage to show you as a real-world example:

    Normal Content: hxxp://psxdns.com/wpo-example/index.php?style=normal
    WP-Obscured Content:hxxp://psxdns.com/wpo-example/index.php?style=obscure

    You can run these two URLs through a duplicate content checker such as hxxp://duplicatecontent.net/ - this example, comes out to be 76.04% duplicate content (therefore, about 25% unique content).

    Like I said, I'm considering this current version a rough draft - there is going to be many updates and we can get that percentage down, but I'm going to need ideas on doing that.

    Thanks,
    dN
     
    Last edited: Apr 11, 2009
  7. mollah

    mollah Power Member

    Joined:
    Feb 11, 2009
    Messages:
    674
    Likes Received:
    548
    interesting idea. in to follow this thread.
     
  8. Viltedali

    Viltedali Regular Member

    Joined:
    Feb 10, 2008
    Messages:
    305
    Likes Received:
    32
    Location:
    Midwest-US
    I believe the phpb@y script uses obfuscation on the links to eb@y to hide their destination.
     
  9. shylesson

    shylesson Power Member

    Joined:
    Jan 10, 2008
    Messages:
    665
    Likes Received:
    2,090
    Location:
    ‹^›‹(•¿•)›‹^›
    How does this differ from uniquifier?
     
  10. ijake

    ijake Registered Member

    Joined:
    Aug 19, 2008
    Messages:
    73
    Likes Received:
    7
    Occupation:
    Web Developer
    Location:
    UK
    Thanks, would probably work better with an article with loads of non-characters (eg * , . > < etc.)
     
  11. cchance

    cchance Junior Member

    Joined:
    Apr 11, 2007
    Messages:
    149
    Likes Received:
    25
    the obfuscation makes some sense, but one thing, drop the use of input's and html tags inside that are hidden it wont do anything, because google strips html tags when it looks at actual word content or thats what i believe as so "hello friend" and "hello<input type>friend" are most likely the same in my opinion, now ascii encoding random characters and stuff like that i think would make more sense but even then its a definite question...

    what i'd suggest is something like writeagain, but using w00t letters

    replace certain words with alternative letters, replace some words that have e's with their unicode equivilent, even if the accent shouldnt be there it would still read the same to most people, replace a few i's with |'s etc stuff like that
     
  12. denight

    denight Registered Member

    Joined:
    Apr 7, 2009
    Messages:
    58
    Likes Received:
    26
    Interesting, I'll have to take that into consideration... what I'm messing with right now is going away from the whole converting 3~ characters into ASCII every 50~ and leaning more towards converting a letter or two from all common words (and, the, who, what... etc) - this way, your keywords will remain untouched but the article will be pretty obfuscated...
     
  13. pspdropship

    pspdropship Newbie

    Joined:
    Sep 11, 2007
    Messages:
    34
    Likes Received:
    38
    looking forward to updates, keep up the good work!
     
  14. venusweb

    venusweb Newbie

    Joined:
    Apr 20, 2009
    Messages:
    5
    Likes Received:
    1
    Occupation:
    Freelance Software Engineer
    Location:
    New Delhi, India
    I don't feel this is the right way to do it.
    There are other small plugins available in BHW
    and on the net to do this job of rewriting the complete article in a wordpress blog and they are good too.
    I have seen results upto 70 to 80 % UNIQUE articles from them
    (I will not give the name of the plugin as it is against the rules of BHW but I can tell you it is good and selling is BHW) :)
    What the people see is what is indexed by google and all SE's. If the keyword is lost then the page will loose it's ranking, this is not what we want, is it ?
    The person reading the article get the code with the article will get totally confused and WILL leave the site immediately for sure.
    BUT still best of luck with your adventure of finding a better free way.
     
  15. denight

    denight Registered Member

    Joined:
    Apr 7, 2009
    Messages:
    58
    Likes Received:
    26
    What are you talking about "get totally confused and will leave the site..."

    1.) Either you didn't check out the sample and see that when a user views a page, it is verbatim of what the original article is, to the viewers eyes.

    2.) Or either you mean that when the person reading the article decides to view the source code of the page and gets totally confused... - Who does that? Who views the source code of a page when reading articles just to read articles - the common user is not going to be viewing any source code.

    Either way, like I said in my previous post, I'm working on a method to keep keywords in tact.
     
  16. iglow

    iglow Elite Member

    Joined:
    Feb 20, 2009
    Messages:
    2,081
    Likes Received:
    856
    Home Page:
    Changing something to ascii characters would be good idea for upgrades. But mainly the code thing is good idea.
    And to the person above - it is not
    not allowed' to say names of scripts.
     
  17. dw_n_cinti

    dw_n_cinti Regular Member

    Joined:
    Dec 23, 2009
    Messages:
    265
    Likes Received:
    131
    Location:
    Cincinnati
    One thing I would suggest is a way to make sure that your main keywords are not disturbed by ascii text or html. You do not want to hurt any SEO.

    DW