1. This website uses cookies to improve service and provide a tailored user experience. By using this site, you agree to this use. See our Cookie Policy.
    Dismiss Notice

Scrapebox, Parsehub and.... What?

Discussion in 'Black Hat SEO Tools' started by webhell, Mar 24, 2020.

  1. webhell

    webhell Junior Member

    Joined:
    Apr 25, 2013
    Messages:
    102
    Likes Received:
    25
    I know how to use scrapebox to get the meta data from a list of websites. I know how to use Parsehub to get specific details from similar pages. I can use spin software to retrieve text from sites and save into separate files.

    But i cant find a tool that searches a list of random websites and returns just "text" from the page (e.g. all chunks of text that have more than 50 characters before running into html code) in a txt/csv format.

    Anyone know one?
     
  2. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    5,302
    Likes Received:
    2,909
    Gender:
    Male
    Home Page:
    You can try the article scraper addon/plugin that is in scrapebox. You could just scrape between the Body tags, as the addon removes html anyway. You could tinker with it, but it won't specifically scrape text that is in chunks of 50 characters or more.
     
    • Thanks Thanks x 1
  3. webhell

    webhell Junior Member

    Joined:
    Apr 25, 2013
    Messages:
    102
    Likes Received:
    25
    Thanks loopline. But is that a premium addon or something? I cant see it in the regular list.
     
  4. LeadCloak

    LeadCloak Jr. VIP Jr. VIP

    Joined:
    May 15, 2018
    Messages:
    525
    Likes Received:
    218
    Gender:
    Male
    you can find it here. Article scraper is at bottom.
     
  5. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    5,302
    Likes Received:
    2,909
    Gender:
    Male
    Home Page:
    There are both. The addon, is in the addons list and it just scrapes the articles/data.

    Then there is the premium plugin which also scrapes the articles but further it includes integration with popular spinners, has its own inbulit spinner, can save off what the article actually would look like , it has a translator (uses google or bing translate), can upload articles to wordpress sites and more.

    Video for the article scraping part of both the plugin and addon, the article scraping in the video is showing it in the plugin but its the exact same in the addon that comes included free with scrapebox.



    And then for all the rest of the plugin - note that the article scraping tab in this below video is no longer like it was, it was updated to be like in the above video
     
  6. webhell

    webhell Junior Member

    Joined:
    Apr 25, 2013
    Messages:
    102
    Likes Received:
    25
    Hi loopline. Thank you for the help. But i guess ive not explained very well. The scrapebox article scraper seemed to generate a bunch of snippets from pages and exported bulk chunks into separate files.

    For example, i entered 8 urls to test it, it created 30 fields or snippets of text under two title fields, and exported everything into four files.

    What im looking for is an article scraper that will visit the url in the list, scrape the text on that page, then export that data. Either one file with the url and the text scraped from it in csv - or even separate files for each entry would be fine then i'll txt merge them into one.
     
  7. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    5,302
    Likes Received:
    2,909
    Gender:
    Male
    Home Page:
    I don't realy follow what your saying as far as what it exported, but the point of the article scraper is to export 1 file per "article". So generally speaking 1 title and 1 article per txt file. Im not sure how you set it up of course as it can be customized.

    you could try the custom data scraper that is part of the email scraper plugin, and it will export what you tell it to into excel, however its not really specifically designed for extracting articles and shuting all that into 1 excel sell/csv segment.

    So Im not sure scrapebox will do what you want and/or perhaps not the way you want it with that setup.