1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How to transfer content from Wayback to a live site

Discussion in 'BlackHat Lounge' started by groundfalling, Feb 28, 2015.

  1. groundfalling

    groundfalling Jr. VIP Jr. VIP

    Joined:
    Jun 18, 2012
    Messages:
    327
    Likes Received:
    294
    Hi Guys,
    I need to build a copy of a dead site from wayback machine. The site has been offline for almost six months and its pages are no longer indexed in google. The site is pretty big with more than 100,000 pages. It was not a wordpress site but i need to transfer all its content to a WP site.What would you recommend would be the best way to it. I have also already downloaded it from wayback but the downloaded copy is not as per original sitemap, the content is all mixed up. So my only hope is transfer all the content directly from wayback to my new site. Is there any way out there to automate this process or manual copy is the only option? Thanks
     
  2. ezines

    ezines Power Member

    Joined:
    Jan 3, 2011
    Messages:
    719
    Likes Received:
    219
    Occupation:
    Online/Offline
    Location:
    Somewhere On Earth
    I think there is a tool/service called wayback machine ripper or downloader, archiver or something? I haven't tried it as I don't stumbled that many pages in wayback machine, so can't tell if the tool is reliable and do what it says.

    Not sure if HTTrack can do that also...
     
    • Thanks Thanks x 1
  3. Panther28

    Panther28 Jr. VIP Jr. VIP

    Joined:
    May 2, 2010
    Messages:
    2,537
    Likes Received:
    3,561
    Occupation:
    Internet.
    Location:
    Internet.
    Home Page:
    you could try one of the csv uploaders for articles, but it sounds like even with that you'll still require a great deal of manual work.
     
  4. saurabh82

    saurabh82 BANNED BANNED

    Joined:
    Sep 27, 2013
    Messages:
    1,228
    Likes Received:
    133
    i dnt think it can be automated.You have to manually copy the articles
     
  5. groundfalling

    groundfalling Jr. VIP Jr. VIP

    Joined:
    Jun 18, 2012
    Messages:
    327
    Likes Received:
    294
    This is the problem, the site is so big that it would take ages to copy it manually.
     
  6. RuthSam

    RuthSam Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 19, 2010
    Messages:
    3,812
    Likes Received:
    973
    Gender:
    Male
    Home Page:
  7. groundfalling

    groundfalling Jr. VIP Jr. VIP

    Joined:
    Jun 18, 2012
    Messages:
    327
    Likes Received:
    294
    I have already downloaded a copy of the site using HTTrack but it just dumped all the content in html pages in 1000s of sub folders. so it is almost impossible to arrange them all.
     
  8. RuthSam

    RuthSam Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 19, 2010
    Messages:
    3,812
    Likes Received:
    973
    Gender:
    Male
    Home Page:
    hmm, I didn't refer to HTTRACK it can't do what you want to do, already tested it for some time ago!