1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Archive.org site downloader

Discussion in 'Hire a Freelancer' started by welly_59, Jun 16, 2015.

  1. welly_59

    welly_59 Power Member

    Joined:
    Aug 30, 2011
    Messages:
    698
    Likes Received:
    258
    I have this on freelancer already but open to offers from here if someone is capable.

    I need a script written to scrape a website from archive.org. The script will remove all archive.org tags/ads in the code, and download all files int the same as original folders and sub folders. The downloaded website should be complete as it is on archive.org, and able to be uploaded without further code modification. You can guide me to upload in my cpanel account. Example: I provide an URL like http://web.archive.org/web/20080604122556/http://thedomain.com/; to the script, and it will get ALL content on the page (including subpages) The URL Structure of the site musn't change. Need simple web interface, where I enter the starting archive.org URL Each site recovery should contain all pages in HTML format, All images that the sites was using should e downloaded. URL structure of the sites should be exactly as it was with original site including links to images internal and outbound links. Files passing variables (example ending with ?dvar=variable) should also be saved as original
     
  2. pavan

    pavan Elite Member

    Joined:
    Mar 30, 2008
    Messages:
    1,819
    Likes Received:
    457
    I can do this
    add me up on skype
    Skype: PavanBHW
     
  3. MrBlue

    MrBlue Senior Member

    Joined:
    Dec 18, 2009
    Messages:
    974
    Likes Received:
    680
    Occupation:
    Web/Bot Developer
    Already have a bot for this. Hit me up on Skype: i_m_mrblue
     
  4. welly_59

    welly_59 Power Member

    Joined:
    Aug 30, 2011
    Messages:
    698
    Likes Received:
    258
    You both have pm. No Skype sorry as I'm on mobile device for next few days