1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Website Scraping

Discussion in 'Black Hat SEO' started by Winston Wolf, Apr 13, 2010.

  1. Winston Wolf

    Winston Wolf Newbie

    Joined:
    May 10, 2009
    Messages:
    37
    Likes Received:
    2
    Hey guys,

    Does anyone know technically what type of website information you are allowed to scrape? I know that unique content is probably a bad idea.

    But what if you are scraping technical information? Say for example you see these websites that list thousands of drivers like this -

    http://www.driverfiles.net/
    http://www.nodevice.com/
    http://www.opendrivers.com/

    I assume they probably scraped that data. Is that sort of thing allowed since it is just general information?

    Thanks.
     
  2. aftershock2020

    aftershock2020 Senior Member

    Joined:
    Oct 19, 2007
    Messages:
    981
    Likes Received:
    477
    If it is given away as public information like that, just cover your own arse with posting where the source file came from, giving them a backlink from yoursite in your copy/offering of the profile about the download and you can actually give yourself authority ranking with the SE. Google is huge on source referencing.

    As I've said over and over...SE are in the ' information ' business. The more cross-referencing you can and will do, the better it will flow in return. Most people miss that the major authority links are loaded with information but have just as many offsite references and backlinks as they have incoming.

    They just seo better to stay on top of it as the leading source to find the information and visitors remember that, as a later returning factor.

    Edit: Furthermore, giving the source like that protects you as well if the downloads are bad or expire...Be sure and make it a common practice to test these download sources at random throughout their sites to make sure they are legit...do your homework and make sure they are someone you want to be attached to from an seo and business standpoint yourself, just not for the purpose of just getting the content.

    Food for thought.
     
    • Thanks Thanks x 1
    Last edited: Apr 13, 2010
  3. Winston Wolf

    Winston Wolf Newbie

    Joined:
    May 10, 2009
    Messages:
    37
    Likes Received:
    2
    Thanks for the response, that is a good idea.

    But in this case I would just need the file name, not the file itself. Like one of the websites listed above they don't link to the actual file just a software program (Driver Detective).

    So I wonder if it would need to be sourced at all if it is just the file name being scraped.