1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

My Own Better-Than-Autoblog Script

Discussion in 'Cloaking and Content Generators' started by wkrappen91, Feb 6, 2011.

  1. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    Ok so I am learning PHP and mySql at the moment, and I had this idea how to put it to use.

    http://what.online-casinotest.com/blackhatworld.php

    My idea was based on the site wn.com, but its smaller scale.
    Basically what it does:
    Someone enters a keyword->
    It checks in the DB if the keyword has been entered before->
    If so, user gets redirected to the ready site.
    If not, The script pulls:

    Wikipedia article
    ezine Article
    3 Youtube videos
    3 Yahoo! Answers Q&As
    And a flickr image

    related to the Keyword entered
    after that all that is packed up into a new file and the user gets send there.
    In the sidebar of the pages there are 3 other random sites pulled from the DB and interlinked.
    If you enter a new keyword on the site you landed on, its the same again.
    No since those pages are actual .php files they can be indexed by the BIG G.
    I know its all duplicate, but i think if it gets big enough (lets say 100.000 pages) there will be plenty of random hits to it.
    This is just a beta stage on a slow server so be patient. it might take 10 seconds if you enter a KW that has not been entered before...
    But if you enter it again, shit is instant:p

    ToDo:
    Better Design
    Adsense
    Sitemap (Not to hard, just have it loop through the DB and echo all the files)
    index.php
    some tweaking on the script (image fails 1 out of 10 times, wiki fails 5 out of 10 times)


    http://what.online-casinotest.com/blackhatworld.php


    Please Guys!
    Let me know what you think, and what i should change.
    I hope you like it
     
    • Thanks Thanks x 1
  2. blackhit

    blackhit Super Moderator Staff Member Jr. VIP Premium Member

    Joined:
    Jan 28, 2008
    Messages:
    2,406
    Likes Received:
    4,275
    Location:
    Dark Side Of The Moon
    Server down?
     
  3. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    nope
    just got a domain + hosting
    so took it off of the subdomain sucker

    http://whattoknowabout.net/indexx.php
    Few notes:
    Index.php is not done yet, so you gotta use the indexx for now:p
    i chose the domain name since i figured i would get a ton of long tail hits once it gets going (what to know about indians, what to know about faceremoval) and so on. 100.000 keywords or so.
    The design of the inner pages is almost final, but might undergo some more changes.
    sitemap is located at: http://whattoknowabout.net/sitemap/sitemap.php and it automatically pulls all current pages ready to crawl for google.
    Please dont click my ads to death as there are no privacy policy and stuff in place
    tell me what you think:p
     
  4. Autumn

    Autumn Elite Member

    Joined:
    Nov 18, 2010
    Messages:
    2,197
    Likes Received:
    3,044
    Occupation:
    I figure out ways to make money online and then au
    Location:
    Spamville
    Not a bad idea. Somewhat slow and some errors visible. That template is awful and you need to work on your ad placement.

    I would try and include some more text sources too, rather than just the usual suspects.

    Are you multithreading your fetches in the background?
     
  5. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    Thanks:p
    I know its slow, but once you enter the keyword again (or someone else did before) its instant. I kind of liked the simple template and i figured the placement wasn't 100% (or even 10%:p)
    Im working on getting more sources (forums, twitter, news, amazon/shopping.com)
    No multithreading as im new to php and have no clue on how to do so...
    The image scraping is the weak spot right now as it fails way to much...
    working on the google API but the pictures turn out really small...
     
  6. Autumn

    Autumn Elite Member

    Joined:
    Nov 18, 2010
    Messages:
    2,197
    Likes Received:
    3,044
    Occupation:
    I figure out ways to make money online and then au
    Location:
    Spamville
    Have a look at the curl_multi family of functions.

    flickr is always a bitch to work with and is notoriously unreliable.

    The basic structure of the template is OK but the implementation doesn't look very professional to my eye, it looks like spam. You need to get your ads front and centre and above the fold more. If you read up about adsense hotspots you will see that putting your vertical block on the left will probably get your a better CTR.

    I would put your image links to other pages at the bottom where they are less likely to be clicked.

    Congrats on getting it up and running though!
     
  7. reinie

    reinie Elite Member

    Joined:
    Jan 16, 2009
    Messages:
    1,577
    Likes Received:
    1,040
    hey i like it, it looks like a good idea, just spend some time building it up,tweaking tweaking and once again, tweaking...
     
  8. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    mh im not using curl at all at the moment but i will have a look at it.
    Damn there is a lot to learn:D i was so happy i had it figured out as far as i am but i know that its WAY to slow to be good right now.
    Im really planning on kicking out flickr. the pictures are really unrelated to. you dont even get a straight shot of any product or so...
    Aight ill put them down, and try to pump the add-placement.
    looking at the template now... i dont like it anymore:D:D
    but yea it was kind of more a test of my not-existing skills but then i liked it too much to not put it online.
     
  9. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    Wow:(
    Thats nice:(
    Workin my ass off learning cUrl to multithread the request ( i needed multithread inside multithread too:( )
    i even made a 2 page construckt where the first does something, saves it into a .txt file and posts the name of the txt file to the next which reads it and continues the work.
    everything is nice, smooth and kind of fast (alot faster than before)
    And now...
    999 error from yahoo due to to many requests:(:(
    That blows!
     
  10. BlackCat67

    BlackCat67 Junior Member

    Joined:
    Nov 17, 2010
    Messages:
    127
    Likes Received:
    24
    Location:
    Texas
    Good work, I agree the template/design could use some work. However the script works great aside from a few errors from time to time. I would love to have a copy of the script.
     
  11. Autumn

    Autumn Elite Member

    Joined:
    Nov 18, 2010
    Messages:
    2,197
    Likes Received:
    3,044
    Occupation:
    I figure out ways to make money online and then au
    Location:
    Spamville
    Welcome to the wonderful world of scraping. :D Unfortunately that's just a limitation of this kind of site design. Imagine what it's going to be like when you've got dozens / hundreds of surfers on the site at the same time...
     
  12. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    UPDATE:

    whattoknowabout.net
    Double Multi Threading is now in place:)
    index.php posts keyword to a.php
    a.php performs first set of 4 requests, saves all the shit to .txt file, calls
    b.php
    which grabs .txt file, performs second set of 4 requests and puts out the result.

    Obviously BETA right now
    no database, no file that is created, just the pure output in b.php.
    check it out, (dont enter querys with space just yet:p still need to figure out which urls need "+" and which need "%20"
     
  13. Red942

    Red942 Newbie

    Joined:
    Jan 18, 2011
    Messages:
    25
    Likes Received:
    0
    Great work man :)
     
  14. galus

    galus Registered Member

    Joined:
    Mar 28, 2009
    Messages:
    98
    Likes Received:
    19
    Location:
    Taipe
    Thanks for your nice script, I`d like a wish list with your further vision.
    1.Passable to save flickr image within our own domain site folder.
    2.Scrape image from google image search and filter the scale with owner decide.
    ex.
    Code:
    http://unmth.com/Google-Images/index.php
    Regard.
    G.
     
  15. dizz

    dizz Elite Member

    Joined:
    May 19, 2009
    Messages:
    2,068
    Likes Received:
    1,785
    Occupation:
    This... AND MORE!! :D
    Location:
    Texas
    I just got a 404 message thats not good.I want to see.PM me when it is up.
     
  16. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    What do you mean thanks for the nice script?:)
    I hope you werent able to leech it or something?
    Why would i waste space on my hosting for flickr images? it makes no sense.
    Less space used, less traffic used, and their server is pry faster than mine:p

    just go to the http://whattoknowabout.net/ and enter a (one word) keyword.
    the indexx and all the inner pages are deleted due to change to multithreading.
    The generated pages are not static as of now (like they used to) but they are just a preview and still need to be processed and saved to .php/entered into the mysql database and so on.
    Also i need a design but i think i have an idea. not sure if i can pull it off... more a coder than a drawer...
    hope you like it
     
  17. Autumn

    Autumn Elite Member

    Joined:
    Nov 18, 2010
    Messages:
    2,197
    Likes Received:
    3,044
    Occupation:
    I figure out ways to make money online and then au
    Location:
    Spamville
    No no no!

    Download the images, use imagemagick to slightly resize them, optimize them and remove the metadata - this is to make them unique in googlebot's eyes.

    Give them keyword filenames eg. your-keyword-here.jpg.

    Link your thumbs to the big pic, or put the big pic on its own page and surround it with niche keywords (and ads of course).

    You will start getting google image search traffic = profit. :)
     
    • Thanks Thanks x 1
  18. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    aight good idea:)
    will get to that.
    what do you think about the multithreading?
    speed better that way?
     
  19. galus

    galus Registered Member

    Joined:
    Mar 28, 2009
    Messages:
    98
    Likes Received:
    19
    Location:
    Taipe
    Yes, u r right.
    I saw some good traffic also from image search.
    To op, I just admire your effort, not for tricks or leech propose.
    Good day.
     
  20. wkrappen91

    wkrappen91 Power Member

    Joined:
    Sep 9, 2010
    Messages:
    588
    Likes Received:
    720
    Location:
    127.0.0.1
    Ah i was kind of scared for a moment there
    But thanks in that case:)
    I will get to the image-traffic thing as soon as the rest stands on its feet.