Scrapebox, Parsehub and.... What?

webhell · Mar 24, 2020

I know how to use scrapebox to get the meta data from a list of websites. I know how to use Parsehub to get specific details from similar pages. I can use spin software to retrieve text from sites and save into separate files.

But i cant find a tool that searches a list of random websites and returns just "text" from the page (e.g. all chunks of text that have more than 50 characters before running into html code) in a txt/csv format.

Anyone know one?

loopline · Mar 25, 2020

webhell said:
I know how to use scrapebox to get the meta data from a list of websites. I know how to use Parsehub to get specific details from similar pages. I can use spin software to retrieve text from sites and save into separate files.

But i cant find a tool that searches a list of random websites and returns just "text" from the page (e.g. all chunks of text that have more than 50 characters before running into html code) in a txt/csv format.

Anyone know one?

You can try the article scraper addon/plugin that is in scrapebox. You could just scrape between the Body tags, as the addon removes html anyway. You could tinker with it, but it won't specifically scrape text that is in chunks of 50 characters or more.

webhell · Mar 25, 2020

Thanks loopline. But is that a premium addon or something? I cant see it in the regular list.

LeadCloak · Mar 25, 2020

webhell said:
Thanks loopline. But is that a premium addon or something? I cant see it in the regular list.

you can find it here. Article scraper is at bottom.

loopline · Mar 25, 2020

webhell said:
Thanks loopline. But is that a premium addon or something? I cant see it in the regular list.

There are both. The addon, is in the addons list and it just scrapes the articles/data.

Then there is the premium plugin which also scrapes the articles but further it includes integration with popular spinners, has its own inbulit spinner, can save off what the article actually would look like , it has a translator (uses google or bing translate), can upload articles to wordpress sites and more.

Video for the article scraping part of both the plugin and addon, the article scraping in the video is showing it in the plugin but its the exact same in the addon that comes included free with scrapebox.

And then for all the rest of the plugin - note that the article scraping tab in this below video is no longer like it was, it was updated to be like in the above video

webhell · Mar 28, 2020

Hi loopline. Thank you for the help. But i guess ive not explained very well. The scrapebox article scraper seemed to generate a bunch of snippets from pages and exported bulk chunks into separate files.

For example, i entered 8 urls to test it, it created 30 fields or snippets of text under two title fields, and exported everything into four files.

What im looking for is an article scraper that will visit the url in the list, scrape the text on that page, then export that data. Either one file with the url and the text scraped from it in csv - or even separate files for each entry would be fine then i'll txt merge them into one.

loopline · Mar 28, 2020

webhell said:
Hi loopline. Thank you for the help. But i guess ive not explained very well. The scrapebox article scraper seemed to generate a bunch of snippets from pages and exported bulk chunks into separate files.

For example, i entered 8 urls to test it, it created 30 fields or snippets of text under two title fields, and exported everything into four files.

What im looking for is an article scraper that will visit the url in the list, scrape the text on that page, then export that data. Either one file with the url and the text scraped from it in csv - or even separate files for each entry would be fine then i'll txt merge them into one.

I don't realy follow what your saying as far as what it exported, but the point of the article scraper is to export 1 file per "article". So generally speaking 1 title and 1 article per txt file. Im not sure how you set it up of course as it can be customized.

you could try the custom data scraper that is part of the email scraper plugin, and it will export what you tell it to into excel, however its not really specifically designed for extracting articles and shuting all that into 1 excel sell/csv segment.

So Im not sure scrapebox will do what you want and/or perhaps not the way you want it with that setup.

Scrapebox, Parsehub and.... What?

webhell

Junior Member

loopline

Elite Member

webhell

Junior Member

LeadCloak

Power Member

loopline

Elite Member

webhell

Junior Member

loopline

Elite Member

Main Menu

Marketplace

Making Money

BlackHat World