1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Google Spyware Warning ..

Discussion in 'BlackHat Lounge' started by pac1984, Dec 10, 2008.

  1. pac1984

    pac1984 Registered Member

    Joined:
    Sep 16, 2008
    Messages:
    92
    Likes Received:
    14
    Location:
    Leeds UK
    Anyone got any methods to how to stop Google thinking your a bot and banning your ass! It really pisses me off when im searching through lots of pages
     
  2. thanhclix

    thanhclix Power Member

    Joined:
    Oct 25, 2008
    Messages:
    646
    Likes Received:
    176
    I don't get you.
    Ban from google but which section, google search or google adsense?
     
  3. antx16

    antx16 Power Member

    Joined:
    Nov 25, 2007
    Messages:
    672
    Likes Received:
    1,536
    just use tor for a while and change your identity every now and then
     
  4. The Scarlet Pimp

    The Scarlet Pimp Jr. VIP Jr. VIP Premium Member

    Joined:
    Apr 2, 2008
    Messages:
    787
    Likes Received:
    3,119
    Occupation:
    Chair moistener.
    Location:
    Cyberspace
    maybe if you turn off your cookies?
     
  5. stylesb

    stylesb Regular Member

    Joined:
    Nov 7, 2008
    Messages:
    272
    Likes Received:
    31
    Location:
    Straight Outta Compton
    SLOWWW DOWN...

    u really need to be hitting google a lot to get banned and get a captcha all the time...

    it happens at work randomly but thats with 5 in house SEOs checking google against our keyword sheets
     
  6. oldenstylehats

    oldenstylehats Elite Member Premium Member

    Joined:
    Apr 10, 2008
    Messages:
    1,893
    Likes Received:
    1,196
    They've started to introduce a lot of new query filters in the last six months. In some cases you may be getting blocked because of how your query is formed, regardless of how quickly, frequently, or simultaneously the queries are being made from the same IP address. While an extremely large number of simultaneous connections from the same IP address will cause the CAPTCHA to pop, there are also many legitimate queries simultaneously coming from the same IP address, they aren't as strict about this as one might expect. (Think about public schools and libraries with older networks and small budgets.) If they can identify a specific query type as having very little substantial value to real users, they either neuter it or make it complicated to scrape.

    If you're programming scraping utilities yourself, one thing you might want to take into consideration is staggering your queries such that you're not scraping page after page after page of the same query, but rather a page of a query, a page of another query, a page of a third query, then the second page of the second query, second page of the first query and so on. Spreading query types across proxies is another method of avoiding the CAPTCHA. We've actually experienced what appears to be the dynamic generation of new query filters in-the-wild over the course of only a few minutes. Google are good at identifying generalized patterns, but not until those patterns have been introduced to the system. By being impatient and rampantly attacking a specific query type, you're almost ensuring that it will be blocked. Worse than that is that once a generalized pattern of abusive querying is identified, the speed at which similar but different queries are identified also increases. The more information that is stored on a specific query type, the faster the filters can be generated.
     
    Last edited: Dec 10, 2008
  7. pac1984

    pac1984 Registered Member

    Joined:
    Sep 16, 2008
    Messages:
    92
    Likes Received:
    14
    Location:
    Leeds UK
    yeah its mainly checking loads of pages

    For instance checking Digg pages for PR . can literally do 1000s of pages very quick using SEOquake toolbar and scanning the serps
     
  8. trophaeum

    trophaeum Senior Member

    Joined:
    Dec 21, 2007
    Messages:
    1,189
    Likes Received:
    706
    3 words

    rotate
    data
    centers

    get as many google dc ip's as you can and randomly rotate them, you wont get banned across them all at the same time normally
     
    • Thanks Thanks x 2