1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Stripping just the test from an entire blog

Discussion in 'Associated Content & Writing Articles' started by poodeedy, Jan 7, 2010.

  1. poodeedy

    poodeedy Registered Member

    Nov 4, 2008
    Likes Received:
    Any good tools for this? I have a few scrapers and the good old http track, however what I am looking for us a tool that can grab all of the text from a webpage but not include all of the information such as graphics or videos or any other code. Text and text only.

    Just looking to run a tool on a site, download all text, organized by directory. Tried a few demos but all stunk like a witches taint.
  2. warrior skunk

    warrior skunk Newbie

    Sep 1, 2009
    Likes Received:
    Imacros has a pretty good extraction tool. I suppose you could use it to extract all the text within the body. I have a feeling thats not what you want though because it will take all the sidebars and menu link text.

    You could also tell it to grab all the text from inside a div... say the div that has the post in it. If thats what you are looking for.