1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

[BETA] Bulk Proxy Site Scraper - Harvest Hundreds of THOUSANDS of proxies FAST.

Discussion in 'Black Hat SEO Tools' started by HealeyV3, Nov 21, 2011.

  1. HealeyV3

    HealeyV3 Power Member

    Joined:
    Mar 4, 2009
    Messages:
    521
    Likes Received:
    345
    Hi everyone!

    I've been learning PHP, so I decided to start making some simple tools.

    Bulk Proxy Site Scraper - Harvest Hundreds of THOUSANDS of Proxies, within minutes.

    Features:
    • Bulk Proxy Scraping
    • Full Statistic Display
    • Cross-Browser Compatable
    • Duplicate Prevention
    • Access to your VERY OWN scraped proxy list!

    [​IMG]

    To try it out, simply go to :
    Code:
    http://www.platformcontrol.com/proxy_extract.php
    Enter in a list of Proxy Server Websites.

    Don't have Proxy Server Websites?
    Here are 280+ :
    Code:
    http://pastebin.com/T68GKXJr
    (Have Scrapebox? Simple harvest
    Code:
    ":80" proxy[CODE] to get a list!)
    
    The script will be up for 24-48 hours for testing.
    
    I'd appreciate anyone that experiences errors posting here so that we can get them all fixed! 
    
    I'm also interested in any suggestions / comments anyone has.
     
    • Thanks Thanks x 5
  2. Spawn

    Spawn Jr. VIP Jr. VIP

    Joined:
    Jun 20, 2009
    Messages:
    1,180
    Likes Received:
    391
    Occupation:
    Quality Articles for $1
    Home Page:
    looks good, does it check the proxys also?
     
  3. HealeyV3

    HealeyV3 Power Member

    Joined:
    Mar 4, 2009
    Messages:
    521
    Likes Received:
    345
    Hi, thanks!

    No, unfortunately, it does not have proxy checking available at this time.
    That's actually a separate script I've already developed :)
     
  4. cooltoad

    cooltoad Senior Member

    Joined:
    Sep 10, 2010
    Messages:
    934
    Likes Received:
    551
    Occupation:
    None of your business
    Location:
    On Vacation
    @HealeyV3 : The tool does the job pretty well. Now only if it could test the proxies and it would have been a golden nugget!
     
    • Thanks Thanks x 1
  5. HealeyV3

    HealeyV3 Power Member

    Joined:
    Mar 4, 2009
    Messages:
    521
    Likes Received:
    345
    How usefull of a tool would that be to you? I assume there are already a ton of proxy checkers out there....

    What functionality would you want in one? Does it have to be able to tell you if it's SOCKS etc, or just HTTP Proxies? If it's just HTTP, I think I could whip something together that tests in batches.... hm. I'll see what I can do.
     
  6. cooltoad

    cooltoad Senior Member

    Joined:
    Sep 10, 2010
    Messages:
    934
    Likes Received:
    551
    Occupation:
    None of your business
    Location:
    On Vacation
    I guess very useful. For starters I won't have to use my computers resources t check on proxies :p
    And when I say checking proxies, I mean to check if they are alive or not. Agreed there are many services like samair.ru that lets you check proxy status, but then they are slow. If you can come up with something that is fast or even better e-mail it once the checking is completed (that would be unique eh:cool:)

    Anyways, maybe I am asking too much :p
    But there is no doubt that the proxy extraction tool is working real good at the moment.
     
    • Thanks Thanks x 1
  7. HealeyV3

    HealeyV3 Power Member

    Joined:
    Mar 4, 2009
    Messages:
    521
    Likes Received:
    345
    Working on a new Proxy Checker as we speak :)
    Hopefully i'll have it set up later today.
     
  8. GoogleAlchemist

    GoogleAlchemist Regular Member

    Joined:
    Nov 25, 2009
    Messages:
    249
    Likes Received:
    28
    Occupation:
    Bad Ass SEO Consultant
    Location:
    Wherever I want
    Home Page:
    I'll check it out, but...and I in no way mean this to be snarky, what is the benefit/difference in using this vs scrape?
     
  9. pandanelu

    pandanelu Newbie

    Joined:
    Nov 12, 2011
    Messages:
    39
    Likes Received:
    2
    nice one ! as cooltoad, it's an ease on my pc
     
    • Thanks Thanks x 1
  10. MoGreen

    MoGreen Regular Member

    Joined:
    Mar 28, 2009
    Messages:
    284
    Likes Received:
    27
    Location:
    USA
    would definitely like to have the proxy check function. Thanks.
     
    • Thanks Thanks x 1
  11. HealeyV3

    HealeyV3 Power Member

    Joined:
    Mar 4, 2009
    Messages:
    521
    Likes Received:
    345
    By Scrape I assume you're referring to Scrapebox, please correct me if I'm wrong.

    Scrapebox is/was a revolutionary program in the Internet Marketing community, and I am personally an avid user of it. That being said, I do feel that mine is faster, and more user-friendly. It's also an easily deployable php script that can be instant-deployed on most servers.

    For those of you that wanted a proxy function... well...

    Stay tuned :)

    I have one that is 95% ready for testing.
    It's "basic" right now, meaning that it only has a few simple functions:
    It checks a "Search" query on a "Website" query, based on your "Batch Processing" (Sort of like Multi-threading... but not :) ), to see if it matches the correct results. This functionality itself sets my proxy checker aside from others currently on the market.

    The ability to test SPECIFIC webpages with a proxy is HUGE.
    Example: I know someone in the "Ticket Brokering" community (IE Scalper). He uses private proxies to connect/buy tickets from ticketmaster.com . He told me public proxies won't work on TM etc etc.... So I took one of my massive proxy lists, and simply ran it against a custom check of ticketmaster.com. While most public domain proxies are garbage for MOST IM things (Google Scraping, etc), he ended up getting THOUSANDS of usable proxies for his service.

    My proxy checking script is in a pretty infant stage right now, but here's a screenshot anyways :

    [​IMG]

    I need to remove some debugging crap, and add a "Timeout" option.
    It runs WELL at around 200 proxy checks at a time, but no more than that really.

    Once I get the debugging crap taken out, I'll put it live to have you guys test it.

    Right now it only checks to see if the proxy works, and works with more HTTP Proxies. I'm going to also add response time, nationality, Co-DEEN check.

    Aynwho, let me know what you guys think :)
     
  12. cooltoad

    cooltoad Senior Member

    Joined:
    Sep 10, 2010
    Messages:
    934
    Likes Received:
    551
    Occupation:
    None of your business
    Location:
    On Vacation
    Request: Codeen proxy checking is a great addon. Now just the one feature that will rock the tool will be checking if the passed proxies are Google passed. Non-SB users are having a hard time finding Non-Google blocked proxies.
     
  13. chuck59

    chuck59 Registered Member

    Joined:
    Oct 17, 2009
    Messages:
    58
    Likes Received:
    9
    Occupation:
    Software Engineer
    Location:
    Karachi
    Home Page:
    Using
    www dot checker dot freeproxy dot ru slash checker
    could be a good idea?
     
  14. chuck59

    chuck59 Registered Member

    Joined:
    Oct 17, 2009
    Messages:
    58
    Likes Received:
    9
    Occupation:
    Software Engineer
    Location:
    Karachi
    Home Page:
    Thanks for sharing this script, it could help saving a lot of scrape time :)
     
  15. starvingKicker

    starvingKicker Newbie

    Joined:
    Sep 7, 2010
    Messages:
    15
    Likes Received:
    0
    nice. thanks for this. looks very useful. been looking for something like this.
     
  16. Porsche

    Porsche Junior Member

    Joined:
    Oct 8, 2009
    Messages:
    160
    Likes Received:
    83
    Location:
    Reputation: 999999
    Does this still works?