1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Creepy Google never deletes anything, even if you do

Discussion in 'BlackHat Lounge' started by bartosimpsonio, Jul 17, 2014.

  1. bartosimpsonio

    bartosimpsonio Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 21, 2013
    Messages:
    8,875
    Likes Received:
    7,477
    Occupation:
    ZLinky2Buy SEO Services
    Location:
    ⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩
    Home Page:
    Well today I got this helpful message from Google. The only problem being : I worked on this site years ago and have since not been the webmaster and thus "deleted" it from a previous Google Webmaster Stools account....years ago.... If I access this GWT account *right now* this website is not there.

    Looks like Google's system forgot to "pretend they deleted it" this time, sending me friendly reminders that the site was down.

    Google never deletes anything, it's creepy and it's gonna leak more and more, eventually.


    ------------------------------------------


    http://example.com/: Googlebot can't access your site

    Over the last 24 hours, Googlebot encountered 351 errors while attempting to access your robots.txt. To ensure that we didn't crawl any pages listed in that file, we postponed our crawl. Your site's overall robots.txt error rate is 73.3%.

    You can see more details about these errors in Webmaster Tools.

    Recommended action
    If the site error rate is 100%:

    Using a web browser, attempt to access http://example.com/robots.txt. If you are able to access it from your browser, then your site may be configured to deny access to googlebot. Check the configuration of your firewall and site to ensure that you are not denying access to googlebot.
    If your robots.txt is a static page, verify that your web service has proper permissions to access the file.
    If your robots.txt is dynamically generated, verify that the scripts that generate the robots.txt are properly configured and have permission to run. Check the logs for your website to see if your scripts are failing, and if so attempt to diagnose the cause of the failure.

    If the site error rate is less than 100%:

    Using Webmaster Tools, find a day with a high error rate and examine the logs for your web server for that day. Look for errors accessing robots.txt in the logs for that day and fix the causes of those errors.
    The most likely explanation is that your site is overloaded. Contact your hosting provider and discuss reconfiguring your web server or adding more resources to your website.
    If your site redirects to another hostname, another possible explanation is that a URL on your site is redirecting to a hostname whose serving of its robots.txt file is exhibiting one or more of these issues.

    After you think you've fixed the problem, use Fetch as Google to fetch http://example.com/robots.txt to verify that Googlebot can properly access your site.

    Learn more in our Help Center.

    Got feedback? Leave it here. Be sure to include this message ID: [WMT-XXXXX]
    Google Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 | Unsubscribe.
     
    • Thanks Thanks x 1
  2. Slim.

    Slim. Junior Member

    Joined:
    Jun 7, 2012
    Messages:
    186
    Likes Received:
    93
    Occupation:
    Affiliate Manager
    Location:
    England
    I am so happy that I saw a world pre-internet. Its scary how much is saved nowadays. The internet never forgets! Makes me think of those poor Tumblr freaks posting videos to youtube that will likely haunt them to the day they die.
     
  3. LakeForest

    LakeForest Supreme Member

    Joined:
    Nov 11, 2009
    Messages:
    1,269
    Likes Received:
    1,802
    Location:
    Location Location
    Yeah...The future generations are going to look at their grandmothers in a way like never before, directly into their holes. And with image identification getting better and better, you'll be able to access sites and they will tell you "oh hey, this is your grandma! look how smooth everything is! She must be very loving and caring, she has 6 guys around her!"



    I guess the lesson is becoming don't use G for anything. I don't use G WMT or G Analytic or G API or G anything except adsense, and i haven't worked on that in a while.
     
    • Thanks Thanks x 1
  4. bartosimpsonio

    bartosimpsonio Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 21, 2013
    Messages:
    8,875
    Likes Received:
    7,477
    Occupation:
    ZLinky2Buy SEO Services
    Location:
    ⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩⇩
    Home Page:
    Some people were radically against the "right to be forgotten". But I think it's 100% justified : it must be made illegal to store information for longer than a certain amount of time unless authorized by users. With cheap storage you can basically store people's lives in digital form. It's legal but it's immoral. If you screw up when you're 17 years old, you shouldn't have to bear that burden when you're mature, married and so on.

    Google, Facebook, others must be forced to delete data by law. Their technical capabilities are oh so cute, they're really clever, but the rule of law must be above their hard drive capacity.