1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scrape another wordpress site?

Discussion in 'Blogging' started by wizzkidd, Dec 22, 2011.

  1. wizzkidd

    wizzkidd Junior Member

    Joined:
    Jan 1, 2011
    Messages:
    138
    Likes Received:
    14
    Does anyone know of a tool that can assist me with scraping the content from another wordpress site?

    Most importantly, I'm after the image in the post, and the post title. I dont mind if it's not auto posted to my wordpress site, because i can use a bulk import script if need be.

    Any ideas?

    WK
     
  2. wizzkidd

    wizzkidd Junior Member

    Joined:
    Jan 1, 2011
    Messages:
    138
    Likes Received:
    14
    I was hoping I could use a plugin, but if not, i'll use a simple php script.

    I think what I need to do is scrape the html between a START and END block of code. Then I should be able to work with it from there. My limited PHP knowledge should get me through it.

    Anyone out there?
     
  3. Malachute

    Malachute Junior Member

    Joined:
    Oct 21, 2009
    Messages:
    124
    Likes Received:
    414
    Most plugins for wordpress that are used for scraping are meant to scrape the latest content from the blog using RSS. If you want to use the start and end block of code - use Yahoo Pipes. Here is a pipe example from my ebook. Autoblogging plugins like wprobot and autoblogged are good choices.

    I'd suggest you to write a small script to scrape the older content first and then use a plugin to scrape future content automatically so you don't have to edit the script all the time, which can be a huge timesink.