1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

GET: URL Text Scraper for OSX (Java)

Discussion in 'Black Hat SEO Tools' started by JunglePocket, Nov 2, 2011.

  1. JunglePocket

    JunglePocket Registered Member

    Joined:
    Jan 2, 2008
    Messages:
    86
    Likes Received:
    109
    I had a simple tool made which extracts text from a list of URLs and saves into one .txt file. It's programmed in Java as I wanted to use it on my Mac, but not sure if it works on Windows.

    Instructions:

    There are 3 files in the folder: HTMLParser.jar, urls.txt and output.txt

    1) Open urls.txt and paste in the urls you want to scrape.
    2) Click HTMLParser.jar and OSX should open it with Jar Launcher
    3) It will ask you to select what file to use for the input URLs (choose urls.txt) and also what file to save to (choose output.txt)
    3) Wait a few minutes depending on the number of urls you are scraping.
    4) View extracted text in output.txt (make sure this file is empty before running the application.)

    There is no GUI, so you cannot easily see when it has finished scraping, but if you keep an eye on the size of the output.txt file in Finder, you should be able to determine when it's finished.

    http://www.mediafire.com/?ykgc0mwr495od5c

    http://www.virustotal.com/file-scan...fd82936baa10da7aad3b10d18040179ebc-1320225585
     
    Last edited: Nov 2, 2011