1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Is there a tool that can extract PDF Files from a site?

Discussion in 'Black Hat SEO Tools' started by bhconsultant, Apr 17, 2012.

  1. bhconsultant

    bhconsultant Junior Member

    Joined:
    May 24, 2010
    Messages:
    117
    Likes Received:
    12
    Hi,

    I was wondering, if there is a tool that can extract all PDF Files from a website? Or at least give me a directory listing of all pdf files on a site so I can download the ones I need. I know I can do this with Google search operators, but that only work for indexed files.

    Thanks
     
  2. Ambassy

    Ambassy BANNED BANNED

    Joined:
    Apr 13, 2011
    Messages:
    642
    Likes Received:
    163
    You could try downloading the entire website with a tool like webreaper.
     
  3. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,364
    Location:
    zeta reticuli
    You can use any website copier to download and make an offline version of a site and then browse through it to find the files you need. This works only if the PDFs are made open to world.

    Code:
    http://www.httrack.com/
     
  4. poweronics

    poweronics Jr. VIP Jr. VIP Premium Member

    Joined:
    May 1, 2011
    Messages:
    3,117
    Likes Received:
    353
    Occupation:
    Freelancer
    Home Page:
    That is very helpful tool buddy. Thanks for sharing.
     
  5. bhconsultant

    bhconsultant Junior Member

    Joined:
    May 24, 2010
    Messages:
    117
    Likes Received:
    12
    Thanks for the help guys. HTTrack, doesnt seem to do the job and isnt very user friendly IMHO. I am still trying to find a woking copy of webreaper.

    Any other alternatives?
     
  6. LX911

    LX911 Regular Member

    Joined:
    Jun 28, 2011
    Messages:
    360
    Likes Received:
    36
    You can use Internet Download Manager and select 'Download all Links' ;)
     
  7. HoNeYBiRD

    HoNeYBiRD Jr. VIP Jr. VIP

    Joined:
    May 1, 2009
    Messages:
    5,913
    Likes Received:
    7,150
    Gender:
    Male
    Occupation:
    Geographer, Tourism Manager
    Location:
    Ghosted
  8. ant_des

    ant_des Registered Member

    Joined:
    Sep 14, 2008
    Messages:
    85
    Likes Received:
    63
    you can use scrapebox with this query in Google:
    filetype:pdf site:yoursite.com

    After you can use downthem all or internet download manger for dowonloading all pdf files.
     
  9. bhconsultant

    bhconsultant Junior Member

    Joined:
    May 24, 2010
    Messages:
    117
    Likes Received:
    12
    Thanks bud..IDM seems pretty effective, although it keeps popping up errors on Win 7 64 Bit, but it still does the job.