Is there any tool to scrape content off wordpress blogs?

 

Results 1 to 5 of 5
I want to 'clone' a wordpress blog site. The website is huge - 2200 pages ...
  1. #1
    daserpent's Avatar
    daserpent is offline Power Member
    Join Date
    May 2010
    Posts
    748
    Thanks
    212
    Thanked 454 Times in 174 Posts

    Default Is there any tool to scrape content off wordpress blogs?

    I want to 'clone' a wordpress blog site. The website is huge - 2200 pages in google.

    Is there any tool that would scrape the content as it is? Maybe use category specific rss feeds to scrape and put them in the right categories on my domain?




  2. #2
    thesuvo's Avatar
    thesuvo is offline Registered Member
    Join Date
    Feb 2010
    Location
    India
    Posts
    82
    Thanks
    5
    Thanked 18 Times in 8 Posts

    Default Re: Is there any tool to scrape content off wordpress blogs?

    I'm also looking for the same tool Bro

  3. #3
    stimpo321's Avatar
    stimpo321 is offline Registered Member
    Join Date
    Aug 2010
    Location
    Worcs UK
    Posts
    69
    Thanks
    14
    Thanked 6 Times in 6 Posts

    Default Re: Is there any tool to scrape content off wordpress blogs?

    Would wp robot do that?

  4. #4
    DustinX is offline Junior Member
    Join Date
    Nov 2009
    Posts
    104
    Thanks
    43
    Thanked 30 Times in 25 Posts

    Default Re: Is there any tool to scrape content off wordpress blogs?

    Well.. there are a few things I can think of. I downloaded this off of BHW I think, at the time it was completely free but I'm not sure which version it is. It's called full text RSS, it's from here: http://fivefilters.org/content-only/

    Here is the mediafire link to the version I have uploaded on my server, I don't know rules about sharing links or whatever as I've never done it but here is the mediafire link & virus total..

    So anyways, it takes whatever RSS feed and puts it into full post format.. fully preserved in formatting I believe. What I did was use backlink energizer which posts content from URL's and put in the URL that the RSS tool generated, but the problem is I think it only does up to 30 posts... can't remember. At least it's a start for a possible solution

    Other thing I can think of is using iMacros which the full version is available in the download section somewhere.. it can scrape the entire site.. you could use scrapebox or something to get all of the URL's and plug it in and scrape away.. then you can use a macro also to post it to your own blog. Can put all of the scraped HTML files into a folder and then open each individual URL on your pc like C:\blackhat\user\pages\1.html etc and post it onto your blog that way. Sorry this is kind of mangled lol, didn't sleep well last night

    Other than that I can't really think of an easy way to do it. There is probably an easier way of doing it. Another thing just thought of is you could scrape all of the URL's of the site, and then put them in the format of site:blahblah.com/URL or whatever and then use those as the keywords in the autoblog tool on your wordpress thing and have it do them all right awqy. Not sure how it would work out but hopefully it can give you some ideas on where to start

    Good luck!

  5. The Following User Says Thank You to DustinX For This Useful Post:

    daserpent (08-04-2011)

  6. #5
    kokoloko75's Avatar
    kokoloko75 is offline Elite Member
    Join Date
    Jan 2011
    Location
    Paris (France)
    Age
    25
    Posts
    1,627
    Thanks
    566
    Thanked 1,852 Times in 594 Posts

    Default Re: Is there any tool to scrape content off wordpress blogs?

    Yes, use RSS feed and WP-Robot or WP-o-Matic.
    Also, look at my old thread to create RSS feed from non-RSS website :
    Code:
    http://www.blackhatworld.com/blackhat-seo/blogging/285876-guide-any-content-jacking.html
    Beny

  7. The Following User Says Thank You to kokoloko75 For This Useful Post:

    daserpent (08-04-2011)


Similar Threads

  1. Replies: 24
    Last Post: 08-22-2009, 03:59 PM
  2. is getting wordpress MU blogs considered to be spam?
    By supermanfun in forum White Hat SEO
    Replies: 2
    Last Post: 11-27-2008, 11:30 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  




BlackHatWorld on Twitter BlackHatWorld on FaceBook


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108