1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

The Best Web Data Extractor you know

Discussion in 'Black Hat SEO' started by chris456, Dec 21, 2010.

  1. chris456

    chris456 Regular Member

    Joined:
    May 17, 2010
    Messages:
    281
    Likes Received:
    567
    I want to ask you for the help How to find the best Domain Finder (by "keyword" in TLD), and after you find your niche URL's list which web data Extractor you use.

    What I search are 5 things:

    1. Best Software/Mode/Database Dump how to find nightclubs/stripclub's active domains (with "keyword" in TLD - (Top Level Domain) like a "nightclub", "stripclub" , "club"- in it (with keyword "club" should be a problem because the "club" should be a football - hockey - golf etc. so I possibly also search for the program which would filter them out) - in those domains with keyword I suppose should be a very big probability to find domains what I am searching for.

    I have extracted many active domains with nightclub-stripclub-cabaret from domaintools_com till now but they have limits so I can't grab them all , more comfortable should be a some possibly free desktop finder.

    2. Best Software/Mode how to find every nightclubs/stripclub's websites
    (by their meta , title , google etc.). Software I search should be very accurate.(Not Hotels , bordels , adult pages.)

    3. Your favorite program how to extract/scrape info from them
    Country, City , Tel , Fax , Email , Opening Hours.

    4. If exists some software where I know the list of clubs and the bot will search only for their single pages like /contact.htm , /contact.php , /about.html , /about.html , /about.php and extract the data from them(from body for example or the best should be to search first for terms like an Address:<...> , Location:<....>, Opening Hours:<..> , Tel:<..>, Phone<......> , etc .
    Maybe I live in some fantasy dream but in which place I can ask for this question if something like this exists if not here on this forum .
    For sure it should save me a lot of time .

    5. Best way to upload those data to the wordpress.
    (I wish to do that via CSV or MySQL , Excel etc. )

    The reason I would like to find nightclubs , stripclubs , venues , lapdance/tabledance clubs with their info like Address , Country, Region/State/Province , City , Tel , Website , Opening Hours etc to add it to my Wordpress website to inform the dancers for conditions , contacts to easily contact them to work there.


    I have already bought and own licence of Scrapebox , Web Data Parser , Easy Web Extractor , Visual Web Spider , Web Data Extractor , My Offline Browser. I have also tried Win Web Crawler and many others.
    At this moment I use all possible extractors to find a domain , after I add them to the extractors like for example a Easy Web Extractor from Web2Mine and instruct him how and what to extract in CSV files. It works fine for domains already registered on some portal , but there are another thousands of domains on the World Wide Web which never subscribed on any portal and I search for some way how to best find and extract them.

    At the moment I got several thousands addresses of websites (about 80K) thanks (Web Data Extractor or Win Web Crawler, Visual Web Spider etc. ) but many of these sites have nothing in common with Nightclub/Stripclub industry , so I need to manualy filter them out , visit those domains which I want and find and manually grab the data (address , country, city etc) mostly only by clipping which is very painful.
    I put this question here because maybe somebody knows better way how to find and extract the data. If there's some interesting software how to do it best I would pay for it very glad.
    Thanks for your respond
     
    Last edited: Dec 22, 2010
  2. pirondi

    pirondi Power Member

    Joined:
    Jan 5, 2010
    Messages:
    562
    Likes Received:
    118
    Ubot.
     
  3. chris456

    chris456 Regular Member

    Joined:
    May 17, 2010
    Messages:
    281
    Likes Received:
    567
    Will google for it now , thanks for the tip -:)
     
  4. bonao

    bonao Newbie

    Joined:
    Nov 19, 2010
    Messages:
    36
    Likes Received:
    9
    Outwit Hub, it's a firefox add on and you can DL pretty good info from GGL, YHO, & Bang.
    You can also s-c-r-a-p-e from industry specific associations where all websites listed are relevant. (like american s-t-r-i-p-p-e-r associations etc.).
     
    • Thanks Thanks x 3
  5. chris456

    chris456 Regular Member

    Joined:
    May 17, 2010
    Messages:
    281
    Likes Received:
    567
    Thank you very much will have a look at Outwib Hub , I've heard of it but have never tried , and specific associations is very good idea , thanks a lot for that , will search there for sure.
     
  6. jtrash01

    jtrash01 Regular Member

    Joined:
    Nov 5, 2013
    Messages:
    235
    Likes Received:
    95
    Location:
    BARCELONA, SPAIN
    uiPath seems powerfull (Microsoft/.NET, seems Friendly) But is paid software.

    If programming is not a problem. You should see "stackoverflow.com/questions/2861/options-for-html-scraping"