1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

[REQ HELP] How to disallow Archive.org

Discussion in 'Forum Suggestions & Feedback' started by l0l0, Jan 16, 2009.

  1. l0l0

    l0l0 Registered Member

    Joined:
    Dec 17, 2008
    Messages:
    90
    Likes Received:
    38
    Home Page:
    Hi guys,

    I searched Google for how to block Archive.org from my websites.
    I have read somewhere that Archive.org is close related to Alexa, is this true?
    So, what happen when i block Archive.org?
    Will this affect my Alexa ranking?

    Anyway, how can i disallow Archive.org (by .htaccess)?

    Thanks in advance for your help:)
     
  2. xhpdx

    xhpdx Regular Member

    Joined:
    Sep 21, 2008
    Messages:
    331
    Likes Received:
    2,160
    Occupation:
    Coder
    Location:
    EU
    you can do it with robots.txt
    Code:
    User-agent: ia_archiver
    Disallow: /Folder/
    with htaccess should look like this:
    Code:
    RewriteEngine On 
    RewriteCond %{HTTP_USER_AGENT} ^ia_archiver
    RewriteRule ^.* - [F,L] 
    
     
  3. l0l0

    l0l0 Registered Member

    Joined:
    Dec 17, 2008
    Messages:
    90
    Likes Received:
    38
    Home Page:
    Thanks for your fast reply XHPDX :)
    Yes, i made a mistake, it have to be with Robot.txt ofcourse.
    Im gonna try this, thanks again :)
     
  4. shahfil

    shahfil Newbie

    Joined:
    Nov 16, 2008
    Messages:
    5
    Likes Received:
    4
    btw, Robot.txt != robots.txt
     
  5. xhpdx

    xhpdx Regular Member

    Joined:
    Sep 21, 2008
    Messages:
    331
    Likes Received:
    2,160
    Occupation:
    Coder
    Location:
    EU
    yes, make sure the file is robots.txt, not just robot.txt as this won't work