1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How to find PDF documents which are not in Google as text? (Only as picture)

Discussion in 'Black Hat SEO' started by senuke, Aug 4, 2014.

  1. senuke

    senuke Newbie

    Joined:
    Oct 7, 2011
    Messages:
    43
    Likes Received:
    7
    Occupation:
    Search Engine Optimizer, Internet Marketing, Linux
    Location:
    Germany
    Home Page:
    Is there a way to find pdf which are not scanned as text in Google?
    I look for this old type of pdfs where the content is just a picture in the pdf.

    Why?

    After a long research i find a few and run this thru my OCR and copied text pieces in google and all
    of this content was not in google. Great isn it? So the picture was in Google but not the text on the picture.

    I thought it would be a great help to find more of this special type of pdfs to get quick and easy unique content.

    May be some of the genius programmers here could even set up a Search engine for that?

    What you think about my idea? Bad or a idea which we should follow?

    Thanks for replay.
     
  2. SEO Power

    SEO Power Elite Member

    Joined:
    Jul 14, 2014
    Messages:
    2,637
    Likes Received:
    680
    Occupation:
    Self employed
    Location:
    Houston, TX
    The textual content of most, if not all PDFs, are indexed by Google so far as the PDF file/url is indexed. You'll need to do a deep search to find the kind of PDFs you are looking for in Google. What you are essentially looking for is PDFs whose urls haven't been indexed.
     
  3. edvard_munch

    edvard_munch Newbie

    Joined:
    May 13, 2013
    Messages:
    17
    Likes Received:
    1
    Occupation:
    Still student
    Location:
    Croatia
    Interesting, I've been asking my self same question.
     
  4. senuke

    senuke Newbie

    Joined:
    Oct 7, 2011
    Messages:
    43
    Likes Received:
    7
    Occupation:
    Search Engine Optimizer, Internet Marketing, Linux
    Location:
    Germany
    Home Page:
    What i find out.

    Serching Google Pictures with pdf f.e. "ford mustang pdf" you get a lot of indexed stuff but if you are looking for pics old Magazines and Newspapers
    you find a few wich are not indexed even if the url to the document is.

    Well that is far away from finding quick content. However may be some member has a idea.

    Thanks for reading
     
  5. SpookSEO

    SpookSEO Senior Member

    Joined:
    Dec 15, 2012
    Messages:
    848
    Likes Received:
    180
    Occupation:
    Linkbuilder
    Location:
    London, UK
    Home Page:
    This has also been a huge struggle for me. Well, I guess, you will need to conduct extensive research to be able to reach the PDFs that you wish to see. The safest answer to this could be the fact that you can rely on the indexes organized by Google.