[FREE] PHP Proxy Extractor - Extract HUNDREDS OF THOUSANDS OF PROXIES IN MINUTES!

HealeyV3

Power Member
Joined
Mar 4, 2009
Messages
521
Reaction score
348
I've been learning PHP and making little PHP apps like Amazon Top Seller Scrapers, Best Buy Scraper, Proxy Checker, etc, and decided to release my Proxy Extraction tool for free! (For a little while at least :) )

Bulk Proxy Site Scraper - Harvest Hundreds of THOUSANDS of Proxies, within minutes.

Features:
Bulk Proxy Scraping
Full Statistic Display
Cross-Browser Compatable
Duplicate Prevention
Access to your VERY OWN scraped proxy list!
Best of all: IT USES MY RESOURCES, NOT YOURS!

2011-11-19_2338.png


Here it is: http://platformcontrol.com/proxy_extract.php

Here are example proxy links to use :
Code:
http://www.angelfire.com/pe2/total4/files/proxies17.txt
http://proxy-list.3dn.ru/Proxy/5958.txt
http://www.halcon.tv/junk/proxy.txt
http://proxy-list.3dn.ru/Proxy/5933.txt
http://c.wrzuta.pl/wo9554/be669f4b001b284d4beee05a/0/proxy.txt
http://rmccurdy.com/scripts/proxy/proxylist.txt
http://webvungtau.googlecode.com/files/proxy.txt
http://boukan-hack.persiangig.com/document/proxy%20grab.txt
http://vavricek.cs.vsb.cz/images/a/a9/Proxy-list.txt
blackhatworld.com/blackhat-seo/proxy-lists/350872-medium-high-proxy-2.html
http://g0labi.persiangig.com/baraname4/http8.txt

Test it out and tell me what you think!
I have a Proxy Checker also that I'm debating on testing with you guys....

Thoughts, questions, etc?
 
Last edited:
So 50+ People used it.... no comments?

Any errors anyone? Any problems, questions?
I'm trying to debug here ! :)
 
Looks useful, haven't got around to using it yet. Bookmarked, cheers!

Awesome! Thanks!

When you use it please let me know any type of features it's missing, or any thoughts/ comments you have about it.

Also,
Does anyone want to test out my Proxy Checker for me?
It's multi-threaded and checks proxies against a URL YOU specify and text YOU specify. If so please contact me via PM or on AIM etc, as I don't wish to fully release it yet :)
 
which url should I use? I try various url proxy list but always get 0 result
 
Last edited:
Nice. If you want to force the txt files to be downloaded instead of opened with a browser (to make it easier for the users), you could add this to your .htaccess (if you have mod_rewrite):
Code:
RewriteEngine on
RewriteRule ^download/ - [L,T=application/octet-stream]
 
Transparent / Anonymous / Elite checker? Possible ?
 
which url should I use? I try various url proxy list but always get 0 result

Can you post some of the lists you are using? Because every URL I try that has a proxy list on it works...

Here are some examples of proxy sites to use :

Code:
http://www.angelfire.com/pe2/total4/files/proxies17.txt
http://proxy-list.3dn.ru/Proxy/5958.txt
http://www.halcon.tv/junk/proxy.txt
http://proxy-list.3dn.ru/Proxy/5933.txt
http://c.wrzuta.pl/wo9554/be669f4b001b284d4beee05a/0/proxy.txt
http://rmccurdy.com/scripts/proxy/proxylist.txt
http://webvungtau.googlecode.com/files/proxy.txt
http://boukan-hack.persiangig.com/document/proxy%20grab.txt
http://vavricek.cs.vsb.cz/images/a/a9/Proxy-list.txt
blackhatworld.com/blackhat-seo/proxy-lists/350872-medium-high-proxy-2.html
http://g0labi.persiangig.com/baraname4/http8.txt

In terms of Transparent / Elite, no unfortunately. I haven't figured out that functionality yet, but I plan on having it added to my proxy checker that I'm currently de-bugging.

Paincake - Thanks for that tip. I'll try it out :)
 
Awesome! Thanks!

When you use it please let me know any type of features it's missing, or any thoughts/ comments you have about it.

Also,
Does anyone want to test out my Proxy Checker for me?
It's multi-threaded and checks proxies against a URL YOU specify and text YOU specify. If so please contact me via PM or on AIM etc, as I don't wish to fully release it yet :)

Whats the use of this tool? And, I'll try your proxy checker :)
 
Whats the use of this tool? And, I'll try your proxy checker :)

I don't understand the question? Sorry!
Are you asking what's the use of my PHP Proxy Extractor?
If so, here's why:

1. It uses CPU Resources on any VPS that it's deployed to, NOT your computers.
2. Since I have it up on MY website, it uses MY resources, not your computer's.
3. It's a Bulk Proxy Extractor, it can extract MILLIONS of proxies in only a few minutes. You simply enter in a list of URL's you have that contain proxy lists (You can EASILY use Scrapebox to do this.), and it downloads them all for you. Not only does it throw it into your own private Proxy file, it also removes all duplicates from your list for you automatically!

I hope that answers your questions. If not, if you can be a little more specific, let me know :)

-HealeyV3
 
Glad to see you a learning PHP, and that you are cool enough to let people use your stuff for free. I know that you are still learning. Offering the checker along with the extractor would be great, pretty much combining the 2. Another thing I would suggest is offering the list to pull the proxies from, instead of us needing to find our own. Also like pancake said, having the files saved to a txt file would be great.
 
Glad to see you a learning PHP, and that you are cool enough to let people use your stuff for free. I know that you are still learning. Offering the checker along with the extractor would be great, pretty much combining the 2. Another thing I would suggest is offering the list to pull the proxies from, instead of us needing to find our own. Also like pancake said, having the files saved to a txt file would be great.


Hey mate,

What exactly do you mean by having the proxies saved to a text file? They already do that. When the proxy extractor is complete, it gives your a link to a text file. You simply have to right click, and "Save Target As" . I'll try and figure out the mod re-write stuff today, or come up with another solution if this seems too confusing for the rest of the testers as well.

Alright, fuck it :)

I was going to hold off on this, because I don't know if it'll crash my VPS or not, but here it goes:

Batch Processing, Bulk Proxy Checker - USE MY SERVER, NOT YOUR PC!


2011-12-11_1405.png


1. Checks HTTP Proxies against ANY website address YOU specify.
2. Check HTTP Proxies against ANY TEXT on the website address YOU specify.
3. Set your own timeout.
4. Check up to 200 at a time
5. Use MY VPS resources to check NOT YOURS. FREE UP YOUR COMPUTER!

LINK==> http://platformcontrol.com/proxy_check.php
Code:
http://platformcontrol.com/proxy_check.php

There are STILL BUGS present in my bulk checker, as it's still in development.

It currently does NOT tell you Geographics of the IP, nor does it tell you if it's Elite , etc.

The way it works is you specify whatever website you want.... say "blackhatworld.com". You then specify whatever text you want it to verify is on the site, for instance.... "Black Hat SEO Forum". You then set your Batch Quantity (<200), and your timeout.

That's it. You let it check and it spits out the "Good" proxies in a selectable text block.

LINK==> http://platformcontrol.com/proxy_check.php

---------------------------------------------------------------------------

PLEASE : I am learning PHP and I need suggestions. If there are ANY features you think would be a good addition to the program/s , please let me know. If there are errors / bugs, PLEASE let me know.
 
Can you post some of the lists you are using? Because every URL I try that has a proxy list on it works...

Here are some examples of proxy sites to use :

Code:
http://www.angelfire.com/pe2/total4/files/proxies17.txt
http://proxy-list.3dn.ru/Proxy/5958.txt
http://www.halcon.tv/junk/proxy.txt
http://proxy-list.3dn.ru/Proxy/5933.txt
http://c.wrzuta.pl/wo9554/be669f4b001b284d4beee05a/0/proxy.txt
http://rmccurdy.com/scripts/proxy/proxylist.txt
http://webvungtau.googlecode.com/files/proxy.txt
http://boukan-hack.persiangig.com/document/proxy%20grab.txt
http://vavricek.cs.vsb.cz/images/a/a9/Proxy-list.txt
blackhatworld.com/blackhat-seo/proxy-lists/350872-medium-high-proxy-2.html
http://g0labi.persiangig.com/baraname4/http8.txt

In terms of Transparent / Elite, no unfortunately. I haven't figured out that functionality yet, but I plan on having it added to my proxy checker that I'm currently de-bugging.

Paincake - Thanks for that tip. I'll try it out :)
Sorry mate I was using my own list nut yours give results. Now I am testing them with SB and will share my conclusion after. Thank you
 
Sorry mate I was using my own list nut yours give results. Now I am testing them with SB and will share my conclusion after. Thank you

Hey... If you don't mind, could you send me 1 or 2 examples of the URL's that DIDNT work? If something is bugging out and not working correctly on URL's that SHOULD work, I'd like to know so I can fix it.

You can PM me the links if you don't want to list them here, that'd be great :)

Thanks!
 
Pretty awesome tool here. It's pretty fast as well, worked perfectly for my list. The only thing I could suggest is some multi-threading, although that's pretty complex to implement in PHP. But, it would make a great tool to sell in the future. ;)

I'd love to test out the checker as well, if you're still looking for some. Just let me know. :)
 
Here's how you detect a transparent proxy:
PHP:
if (
      $_SERVER['HTTP_X_FORWARDED_FOR']
   || $_SERVER['HTTP_X_FORWARDED']
   || $_SERVER['HTTP_FORWARDED_FOR']
   || $_SERVER['HTTP_VIA']
   || in_array($_SERVER['REMOTE_PORT'], array(8080,80,6588,8000,3128,553,554))
   || @fsockopen($_SERVER['REMOTE_ADDR'], 80, $errno, $errstr, 30))
{
   echo 'transparent'
}
else {
  echo 'elite'
}

this would have to be in one file and you'd have to use curl in another file to retrieve this first file via a proxy
I don't recommend using the last condition (@fsockopen) because it will double the time spent checking the proxy
 
Last edited:
Can u share proxy checker also please ?
 
Pretty awesome tool here. It's pretty fast as well, worked perfectly for my list. The only thing I could suggest is some multi-threading, although that's pretty complex to implement in PHP. But, it would make a great tool to sell in the future. ;)

I'd love to test out the checker as well, if you're still looking for some. Just let me know. :)

Can u share proxy checker also please ?

Just look UP


I shared the Mult-Threaded(Well, Batch really, not multi-threaded), Proxy Checker script above :)

I agree with you on the Multi-Threading, I'll get that done next :)
Just finished that up on my Proxy Checker and it's working GREAT!
 
Just look UP


I shared the Mult-Threaded(Well, Batch really, not multi-threaded), Proxy Checker script above :)

I agree with you on the Multi-Threading, I'll get that done next :)
Just finished that up on my Proxy Checker and it's working GREAT!

Damn, that's what I get for being lazy. :P Testing that tool out now, and I really look forward to the multi-threaded scraper. Great work on these!
 
Damn, that's what I get for being lazy. :P Testing that tool out now, and I really look forward to the multi-threaded scraper. Great work on these!

Thank's mate!

After these 2 projects are done, (should only be a day or two), I'll actually have run out of other things to make....

So my question to you/everyone :
What should I make next?
What do you currently NEED?
What could be made better?

I'm open to all ideas. Got a private one or something you don't want to share in the open? Please AIM/Skype/Gtalk me, I'm always up for a good brainstorming session!

Maybe something similar to the projects I've done in the past? :
Amazon Top Products Scraper (Scrapes the top tages of each BrowseNode of Amazon and gives you a list of the ASIN's to use with Amazon Associate Wordpress Sites etc).
Chinese Electronics Product Scraper (Scraped a website for ALL products, parsing title, price etc)
Best Buy Product Scraper
Amazon Product Scraper
Proxy Checker / Extractor
 
Back
Top