1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scraping content from Wayback machine

Discussion in 'Black Hat SEO' started by forlearnx, Jan 15, 2013.

Tags:
  1. forlearnx

    forlearnx Regular Member

    Joined:
    Apr 20, 2011
    Messages:
    392
    Likes Received:
    92
    Location:
    UK
    As you know wayback machine keeps whatever it has scraped from past crawls.

    Does anyone know of a way to scrape the contents of expired domains previously crawled by wayback machine?
     
  2. trevormorley

    trevormorley Junior Member

    Joined:
    Feb 25, 2010
    Messages:
    170
    Likes Received:
    46
    would like to know the answer to this as well, would be great for grabbing content...
     
  3. CosmicSoundz

    CosmicSoundz BANNED BANNED

    Joined:
    Apr 30, 2012
    Messages:
    1,230
    Likes Received:
    1,296
    If it shows up on wayback.com I can scrape it.. PM me.
     
  4. MMBlack

    MMBlack Junior Member

    Joined:
    May 7, 2010
    Messages:
    125
    Likes Received:
    26
    Occupation:
    Writer
    Location:
    Northern Hemisphere
    The process of scraping on Wayback Machine is a little difficult to navigate; but if you're looking for something on a specific domain, it's great.
     
  5. mofoparrot

    mofoparrot Junior Member

    Joined:
    Jan 2, 2011
    Messages:
    130
    Likes Received:
    24
    Yeah, why I normally just use http://archivescraper.net, Gets the job done. This is very common in making pbn's. Not only for content but diffrent css's on the sites as well to keep the uniqueness and reduce the footprint
     
  6. Brad100

    Brad100 Jr. VIP Jr. VIP Premium Member

    Joined:
    Nov 9, 2014
    Messages:
    1,347
    Likes Received:
    958
    Gender:
    Male
    Well, I'm sure the OP would have appreciated your help... only if you arrived 2 years ago.

    :pat:
     
    • Thanks Thanks x 6
  7. mofoparrot

    mofoparrot Junior Member

    Joined:
    Jan 2, 2011
    Messages:
    130
    Likes Received:
    24
    Yeah true, but it helps all future people who have the same problem. Since even two years later, this still is the most consistent way to rank. Also most of these threads would have been archived but since the BHW redesigned they showed up non archived. But yeah, rep given :p
     
  8. blacktrilby

    blacktrilby Power Member

    Joined:
    Dec 9, 2008
    Messages:
    512
    Likes Received:
    388
    Occupation:
    Webmaster
    Location:
    Matt Cutts Underwear
    What sort of results do you get using this?
     
  9. terrycody

    terrycody Senior Member

    Joined:
    Sep 29, 2012
    Messages:
    858
    Likes Received:
    201
    Occupation:
    marketer
    Location:
    Hell
    I dont think automate scrape expired domain content is good, why?

    Well, I use a method from BHW that use expired domain content for money site, 2 month later, i posted 200 posts and till now, got 0 traffic, i will not say those re-usable content is bad, just hard to choose the good ones, nearly 99.9% of expired domain contents are garbish, spun, all whatever duplicate. And you also need put all the contents through copyscape to check it for re-usable, so fucxxxx time-comsuming, doesnt pay off i think, i really do not know how these people build PBN use expired domain, maybe just find the right one, but in my opinion? Its hard for a tool to automate this, it cant tell the quality of content, you must do this by your hand.
     
  10. mofoparrot

    mofoparrot Junior Member

    Joined:
    Jan 2, 2011
    Messages:
    130
    Likes Received:
    24
    Really depends on the niche. As long as you keep it private and don't rent links its completly fine. You would be amazed at how few good links you need to rank for some niches. I tend to do a few sites a week, I have money sites that have ranked for as long as I can remember once I set up the pbn in those niches. .
     
  11. mofoparrot

    mofoparrot Junior Member

    Joined:
    Jan 2, 2011
    Messages:
    130
    Likes Received:
    24

    I think you are misunderstanding the purpose of this. You are not using this content as your money, not to say you couldn't scrape one and try and rebuild it (have done this with success), but this is to build a PLN/PBN to your money site. You pick up a expired/deleting domain with a decent link profile in the niche you are trying to rank for. You then restore a old version of the site, put your link on it. Rinse and repeat.