Stripping just the test from an entire blog

Discussion in 'Associated Content & Writing Articles' started by poodeedy, Jan 7, 2010.

  1. poodeedy

    poodeedy Registered Member

    Joined:
    Nov 4, 2008
    Messages:
    69
    Likes Received:
    15
    Location:
    NJ
    Any good tools for this? I have a few scrapers and the good old http track, however what I am looking for us a tool that can grab all of the text from a webpage but not include all of the information such as graphics or videos or any other code. Text and text only.

    Just looking to run a tool on a site, download all text, organized by directory. Tried a few demos but all stunk like a witches taint.
     
  2. warrior skunk

    warrior skunk Newbie

    Joined:
    Sep 1, 2009
    Messages:
    45
    Likes Received:
    25
    Imacros has a pretty good extraction tool. I suppose you could use it to extract all the text within the body. I have a feeling thats not what you want though because it will take all the sidebars and menu link text.

    You could also tell it to grab all the text from inside a div... say the div that has the post in it. If thats what you are looking for.