Y T Nuke  
Results 1 to 18 of 18
I've been looking for a good whois scraper/harvester. For a start, I need something that ...
  1. #1
    dre2027 is offline Newbies
    Join Date
    Jul 2008
    Posts
    13
    Reputation
    10
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Default whois scraper/harvester (?)

    I've been looking for a good whois scraper/harvester.

    For a start, I need something that can check 10k specific domains w/o getting blocked every 8-15 records. Or at least can change its ip address after every 5-10 records pulled.

    hidemyass.c0m/vpn/ can change my ip every xx minutes. That's as close as i've come to finding a way to do this. But still, xx minutes isn't good enough. I still get blocked. I need to change more frequently than by xx minutes. I need to change by the # of records or cycle through multiple whois systems(?).

    I may be going about this all wrong.

    But if not, do you have any idea of how to go about doing whois lookups w/o getting banned/blocked every handful of search results? Or know of an all-in-one bh tool that can do the harvesting for me en masse?

    Oh. Eventually I will need to lookup 250k+ specific domains. This initial 10k test group will determine if is worth the effort or not to continue with the idea.

    Thanks

  2. #2
    howdoyou is offline Regular Member
    Join Date
    Nov 2008
    Location
    Kentucky
    Age
    26
    Posts
    284
    Reputation
    8
    Thanks
    3
    Thanked 58 Times in 30 Posts

    Default Re: whois scraper/harvester (?)

    I can program you a small application that will do this. PM me if your interested.

  3. #3
    dre2027 is offline Newbies
    Join Date
    Jul 2008
    Posts
    13
    Reputation
    10
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Default Re: whois scraper/harvester (?)

    thanks. Pm sent.

  4. #4
    doctorwar is offline Registered Member
    Join Date
    May 2009
    Posts
    61
    Reputation
    11
    Thanks
    14
    Thanked 28 Times in 13 Posts

    Default Re: whois scraper/harvester (?)


  5. The Following User Says Thank You to doctorwar For This Useful Post:

    gregstereo (11-16-2009)

  6. #5
    howdoyou is offline Regular Member
    Join Date
    Nov 2008
    Location
    Kentucky
    Age
    26
    Posts
    284
    Reputation
    8
    Thanks
    3
    Thanked 58 Times in 30 Posts

    Default Re: whois scraper/harvester (?)

    The software works now, i just need a few details from you.
    Like how you want the list format to be.
    it changes to a new proxy built in every search so there is no chance of being locked out.

    If the one posted above works thats fine, just let me know what you want me to do - if you still want the one i coded let me know, and any additional features you want added.
    Last edited by howdoyou; 11-16-2009 at 02:46 PM.

  7. The Following User Says Thank You to howdoyou For This Useful Post:

    dre2027 (11-16-2009)

  8. #6
    dre2027 is offline Newbies
    Join Date
    Jul 2008
    Posts
    13
    Reputation
    10
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Default Re: whois scraper/harvester (?)

    Hey doctorwar , Just realized i had grabbed your whois scraper from the other thread a bit ago (blackhatworld.c0m/blackhat-seo/member-downloads/124531-get-onlinemarketresults-whois-email-scraper-v1.html#post1168895). It gets blocked too, at least for me

    New version would be killer if it could

    1) work with list of proxy addresses instead of a single proxy address and

    2) could rotate ip address every 4 or so requests, since 4 requests per minute per ip seems to be the max allowed for most registrars.

    Rotating the ip every third or fourth request would bypass the 4 per minute limitation. Could then grab whois records as fast as your connection speed and pc could handle. Would work provided the ip changes every 3-4 requests or cycles in a way that the same ip is never used more than 3-4 times in any given minute.

    That would be a killer whois scraper!!!

  9. #7
    dre2027 is offline Newbies
    Join Date
    Jul 2008
    Posts
    13
    Reputation
    10
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Default Re: whois scraper/harvester (?)

    Hey doctorwar, just realized i had grabbed your whois scraper from the other thread a bit ago (blackhatworld.c0m/blackhat-seo/member-downloads/124531-get-onlinemarketresults-whois-email-scraper-v1.html#post1168895). It gets blocked too, at least for me

    New version would be killer if it could

    1) work with list of proxy addresses instead of a single proxy address and

    2) could rotate ip address every 4 or so requests, since 4 requests per minute per ip seems to be the max allowed for most registrars.

    Rotating the ip every third or fourth request would bypass the 4 per minute limitation. Could then grab whois records as fast as your connection speed and pc could handle. Would work provided the ip changes every 3-4 requests or cycles in a way that the same ip is never used more than 3-4 times in any given minute.

    That would be a killer whois scraper!!!

  10. #8
    dre2027 is offline Newbies
    Join Date
    Jul 2008
    Posts
    13
    Reputation
    10
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Default Re: whois scraper/harvester (?)

    Earlier I wrote:

    "hidemyass.c0m/vpn/ can change my ip every xx minutes. That's as close as i've come to finding a way to do this. But still, xx minutes isn't good enough. I still get blocked."

    I meant to type "I'd still get blocked" instead of "I".

  11. #9
    doctorwar is offline Registered Member
    Join Date
    May 2009
    Posts
    61
    Reputation
    11
    Thanks
    14
    Thanked 28 Times in 13 Posts

    Default Re: whois scraper/harvester (?)

    When I get a few free minutes I can update this whois scraper to use proxy lists with delays/rotation

  12. #10
    badd's Avatar
    badd is offline Regular Member
    Join Date
    Jan 2008
    Location
    Earth
    Posts
    316
    Reputation
    12
    Thanks
    16
    Thanked 33 Times in 32 Posts

    Default Re: whois scraper/harvester (?)

    i can put u in the right direction if u r still looking for this tupe of bot
    Understand the value of TIME and keep working hard till you drop.

  13. #11
    doctorwar is offline Registered Member
    Join Date
    May 2009
    Posts
    61
    Reputation
    11
    Thanks
    14
    Thanked 28 Times in 13 Posts

    Default Re: whois scraper/harvester (?)

    In the mean time, what about using proxyfirewall to rotate proxies?

  14. #12
    dre2027 is offline Newbies
    Join Date
    Jul 2008
    Posts
    13
    Reputation
    10
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Default Re: whois scraper/harvester (?)

    Seems something more custom, more specific to whois searches is needed [like your software and howdoyou's software].

    On my end i'm finding registrars limiting searches to 4 per minute before temporarily banning or delaying my ip. So maybe cycling connections not by the minute but by the number of whois requests will get past autobanning.

    Also, I can't find a setting to make proxyfirewall or hidemy@$$ cycle per # of requests. Too bad. Because I want access to that 'free' whois data!!! lol

    I thought of maybe requesting the data direct from registrars so i could setup a new search engine but really with the goal of 'hacking' my own database remotely so i could claim someone else did it -hehe - but I bet i'd unknowingly leave a trail and still get caught. Oh well. Gotta do things legit...by scraping.

    Maybe i'm overlooking something(?).
    Last edited by dre2027; 11-18-2009 at 12:41 AM. Reason: edit for improved understanding of intent...

  15. #13
    yazzou is offline Registered Member
    Join Date
    Dec 2008
    Posts
    62
    Reputation
    12
    Thanks
    11
    Thanked 6 Times in 5 Posts

    Default Re: whois scraper/harvester (?)

    Interesting

  16. #14
    dre2027 is offline Newbies
    Join Date
    Jul 2008
    Posts
    13
    Reputation
    10
    Thanks
    2
    Thanked 0 Times in 0 Posts

    Default Re: whois scraper/harvester (?)

    d@mn! Not finding a scraper that can avoid getting banned every fourth request is costing me millions in possible opportunity losses. Too bad I'm too dang broke to do anything about it - lol

    Maybe i will setup my own search engine and create a 'reason' for internic to share all that private data with me. Ok. Bad idea.

    If anyone has or knows of a tool that can extract whois contact data into csv format and won't time out/get blocked after 3 or 4 requests, please hook a brotha up

    thx
    Last edited by dre2027; 12-12-2009 at 09:25 PM. Reason: forth isn't same as fourth - lol

  17. #15
    bradsteves is offline Registered Member
    Join Date
    Nov 2009
    Posts
    53
    Reputation
    10
    Thanks
    4
    Thanked 9 Times in 9 Posts

    Default Re: whois scraper/harvester (?)

    How about one of the xml/json providers:

    http://www.whoisxmlapi.com/?gclid=CM...FRhfagod4nRUqw

  18. #16
    Vinchenz is offline Newbies
    Join Date
    Jan 2010
    Posts
    2
    Reputation
    10
    Thanks
    1
    Thanked 0 Times in 0 Posts

    Default Re: whois scraper/harvester (?)

    I'm looking for something similar as well. Anyone have any new ideas about getting unlimited who is info? I would appreciate it.

    Since this is an old thread I thought I mention it here again.

    Thank you

  19. #17
    eb0la is offline Newbies
    Join Date
    Apr 2011
    Posts
    22
    Reputation
    10
    Thanks
    4
    Thanked 3 Times in 3 Posts

    Default Re: whois scraper/harvester (?)

    Have your tried TOR (the onion router).
    I use it at work to scrape piratebay and works great.

  20. #18
    gregstereo's Avatar
    gregstereo is offline Heating Engineer
    Join Date
    Oct 2009
    Location
    Moose Factory, ON
    Posts
    1,199
    Reputation
    120
    Thanks
    491
    Thanked 974 Times in 595 Posts

    Default Re: whois scraper/harvester (?)

    TOR is very risky and hideous software. I don't recommend using it - there are way better alternatives for anonymity.

    ScrapeBox has a whois scraper in it.


Natural Slow Link Building


SEO Blasts - High quality link building service

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
  SEnukeX SEO Software
Proudly Powered by Hostwinds.com Web Hosting Click Here For Exclusive BHW Discounts!

Cheap Web Hosting


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75