Discussion in 'Black Hat SEO' started by olgart, Feb 2, 2013.
as in the title.
The best search engines that have a high amount of users will have CAPTCHA. To be honest, I don't think there are any that doesn't use CAPTCHA services if there are too many requests.
Just use proxies you will not face this problem, I harvest at 100 threads with 1500 proxies and never get hit for captcha, the only programme I have ever seen ask for captcha is spyglass, I think that was because I had the proxy settings wrong.
Came here to say exactly this.
Use proxies. They're cheap, and you'll stop getting the captcha's completely.
Gotta use proxies if you're serious about scraping.
Make your own search engine.
Think of it this way: any search engine that didn't throw up captchas for making 10 requests/second from one IP probably wouldn't be around long enough to abuse anyways. Proxies are definitely the way to go here.
need a truckload of proxies. private proxies are useless (as you can never buy that many). scrapebox has some proxy sources, plus you can add some sources of your own, buy proxy lists from HMA and so on.
Separate names with a comma.