Proxy Harvesting

Discussion in 'Proxies' started by cipher.stream, May 4, 2012.

  1. cipher.stream

    cipher.stream Newbie

    Joined:
    Apr 23, 2012
    Messages:
    2
    Likes Received:
    1
    Has anyone else played around with harvesting proxies from their blog spam?

    My python script sends me about 15-20/day... this is how I have it setup:

    [1] I created a bunch of blogs that are spammer targeted. Everything from electronic reviews, movie blogs, video game blogs, health, etc. I just copied a few articles to get keywords, spun them and let them sit there
    [2] I turn off Akismet on wordpress
    [3] My python script monitors the mysql database for unapproved comments. When it finds a new one it records the IP address and performs an nmap (port scan) against it. It looks for 80,8080,3128 (actually does a full 1-10000 port scan) etc and tests the IP to see if it is a proxy. The output is to a text file. I don't have a python script to test if its anonymous or google ready but that's about 15more lines of code tops.

    TODO:
    Implement back end database to store found proxies, test times, country of origin, uptime stats, etc.
    Implement SOAP web service for all of my other scripts to request a proxy depending on the task (i.e. Get a google proxie for tasks that need it but stick to Anonymous proxies for simpler tasks. Return a country specific proxy)

    Thoughts? Suggestions?
     
    • Thanks Thanks x 1
  2. the_demon

    the_demon Jr. Executive VIP

    Joined:
    Nov 23, 2008
    Messages:
    3,231
    Likes Received:
    1,596
    Occupation:
    Search Engine Marketing
    Location:
    The Internet
    Seems like a great concept especially if you fine tune it. You might want to consider making this a premium plugin for wordpress and selling it here on bhw. I'm sure you could find some customers.
     
  3. coding.solo

    coding.solo Newbie

    Joined:
    Dec 22, 2010
    Messages:
    37
    Likes Received:
    9
    Unfortunately this wouldnt be just another WP addon because of the tech stack it has. It's a great idea to passively gain some free proxies, but as OP said... These proxies suck for tasks. Ok to scrape data though.
     
  4. 7mQZT

    7mQZT Newbie

    Joined:
    Feb 25, 2012
    Messages:
    32
    Likes Received:
    4
    That's not a bad idea.And i'm sure this could be expanded to do interesting things.. hm.
     
  5. proxyblaze

    proxyblaze Jr. VIP Jr. VIP

    Joined:
    Oct 26, 2011
    Messages:
    866
    Likes Received:
    140
    Occupation:
    Technical Assistant (Wipro)
    Location:
    ProxyBlaze.com
    Home Page:
    Scanning 1-10000 ports might take very long. You can also implement in such a way that they script only scan the ips with the http ports such as 80,8080 and so on...
    Just my 2 cents.
     
  6. Light Concept

    Light Concept Newbie

    Joined:
    May 15, 2012
    Messages:
    45
    Likes Received:
    4
    Occupation:
    Engineer
    Location:
    Europe
    This is quite clever! Kudos for the creativity!