[Guide] Generate rss feed and grab the content from any website!

alus1onZ

Regular Member
Joined
Apr 30, 2009
Messages
364
Reaction score
212
Before I star explaining the method I want to say that this is my first posted guide/method and I'm sorry if it isn't in the right section.

So here is how simple it is to get the content from any website by making a custom rss feed.

1. Grab an account at h**p://feed43*c0m (if you don't like it just search google for similar websites, but I found this one best)

2. Log in to ur feed43 account and click on the Create new feed

3. At "Adress:" fill in the url that u want to grab content from it and press Reload.

(I will make a custom feed for the recently added articles on bukisa.com)

4. U will now see the source code of the site. We need the recently added articles so we r gonna search for "recently added" in the source.

So i see that it starts with "<div id="recently_added">" and ends with "<div id="top_rated">"

When u find this u fill it in "Step 2. Define extraction rules" like so:

Code:
<div id="recently_added">{%}<div id="top_rated">
where the string before {%} is start string and after {%} is end string

5. Defining the Search pattern:

We need 3 things for the custom rss feed: 1. Link ; 2. Title ; 3. Content

The pattern should start from <div class="recently_added_img">

The link should be defined from <a href=" to "> .

The title should be defined from <h3><a href="http://www.bukisa.com/articles/ to "> , but the "165649_how-to-train-climbing-vine-on-brick-wall" and etc. , should not be seen from the feed otherwise it will be confused

The content should be defined from <p> to </p>

So we put {%} instead of "to" and {*} for skiping content. As a final result the search pattern looks like this:

Code:
<div class="recently_added_img">{*}<a href="{%}">{*}<h3><a href="http://www.bukisa.com/articles/{*}">{%}</a></h3>{*}<p>{%}</p>
Now press Extract!

6. RSS item properties

Here we will use the already defined patterns from step 5

When the extraction is done right u will be able to see the link, title and content as Items ( {%1}, {%2} and {%3} )

For "Item Title Template*:" fill in {%2} - this is the extracted title we've just got

For "Item Link Template*:" fill in {%1}

For "Item Content Template*:" fill in {%3}

Now press "Preview" and enjoy!





Here is the final result h**p://feed43*c0m/bhw.xml



Hit the
post_thanks.gif
if u like the guide :drinking2
 
Last edited:
I'm bumping the thread to be seen from more people. I think the guide will be very helpful for many of u guys/ladies.
 
Short and sweet, very nice. Why do I find myself trying to overengineer stuff, like using a spaghetti junction of Yahoo! Pipes, when an approach like this is just as good or better? Must be oldtimers...
 
Thanks, nice share.
I always using pipes yahoo to customize my own content.
 
Didn't work
Code:
 Global pattern didn't match anything
that's what I get
 
Well done. I've been using feed43 for about a year to fuel some efforts and I couldn't be happier with them.
 
Nice guide! Also Feedity is another site similar to feed43. I loled at the Yahoo Pipes comment above, So many times I have baffled myself buried in pipes up to my neck.
 
I am trying to extract feed from
Code:
[URL]http://www.moneycontrol.com/news/economy-5.html[/URL]

<title>{%}{*}title="{%}"{*}
<a href="{%}">{*}<a href="http://www.moneycontrol.com/video/economy/{*}">{%}</a>

in feed43 i get the error.
Item pattern should have at least one {%} macro defined

Can some one Please Help!

Thanks in Advance.:)
moz-screenshot.png
 
Last edited:
Www*feedbuner*c0m can burn feeds automatically no need to put html
 
Back
Top