1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Can't figure out the story behind proxy servers

Discussion in 'Black Hat SEO' started by SyZygy, Jul 28, 2011.

  1. SyZygy

    SyZygy Senior Member

    Joined:
    Dec 29, 2009
    Messages:
    864
    Likes Received:
    1,080
    Location:
    Europe
    Home Page:
    I've spent a few hour's searching and reading on the forum and it's very confusing... I don't know if the problem is in my head or in the explanations :/

    I bought Scrapebox and I need Proxy servers, first time that i encounter them and I have some questions that I just can't find the right answers..

    How many proxy servers do I need?
    Some say "your are a beginner, 5 - 10 is enough", and some say that I need 50 in the start. Why is one proxy server not enough? Why do I need 50? Is Scrapebox working faster with 50 Proxys, or?

    Types of proxy servers, and what to do with them?


    I figured out that there are three types of proxy servers, one that are public, private and vpn. What's the difference in speed and work with them?

    Public ones are great for harvesting, right?
    Private ones for posting, right?
    VPN? For what are they good?

    LifeTime of a proxy server
    I found some confusing offers, it's like "every day you get new ones" or "every month you get 10 new ones". What is this? What's the lifetime of a proxy and how long can I work with them?


    Some are for Scrapebox and some are not
    I found services and posts on bhw that just say that the proxys that they offer don't work great with Scrapebox, why? Isn't a proxy - a proxy? With same specs and so on?

    Tnx for answering and I hope that I did not make a duplicate thread.
     
  2. Monrox

    Monrox Power Member

    Joined:
    Apr 9, 2010
    Messages:
    615
    Likes Received:
    579
    1 proxy = 1 IP

    When searching:
    Too many search requests from one IP get the IP temporarily blocked => you can't use it to scrape for the time being. So you need another IP.

    When posting comments:
    Some people prefer not to do it with their home IP so they use proxies.
    Some centralized antispam tools like akismet can block an IP reported by enough webmasters as spam source. So it is better to use multiple IPs to dilute the impression.

    "Public" and "private" are not actual types of proxies. One definition is that public proxies are known and used by many people, private ones are known to only a handful, ideally to just a single person and / or are protected by a password so that only the few who know that password can use them. This means that private proxies can be a lot faster because they service less people. But if one user is using a private proxy very intensively, the other users will still get very slow speeds.

    Another public vs. private proxy definition is that public proxies show your real IP, private do not. A non elite private proxy still says 'hi, i am a proxy' to the server (search engine, blog etc.) it connects to. An elite proxy's look and feel is just like a normal IP. Do not rely too heavily on private proxies for real privacy protection. The testing tools can easily be fooled and the proxy operator can change its settings in a matter of seconds; also logs can be kept, it can be a honey pot etc etc.

    It is up to you to figure out what is meant by 'public' and 'private' every time, just pay attention to the context the terms are used in.

    The life time of a proxy server is not bound to anything in general. Many public ones are open by mistake and can be fixed at anytime. Other public ones are put up on purpose: trying to steal passwords, adding a virus to opened webpages in the browser, redirecting to CPAs or other ads... As to the private proxies, it is up to the vendor to offer different lifetime packages. Sometimes it is up to what's cheaper from his supplier.

    In simple and somewhat incorrect terms a VPN is a complete OS at some distant place which you can access from e.g. your home using the Internet. You can do anything you like there, like playing solitaire but the real advantage is that VPNs have large bandwidths. Scraping and posting means a lot of transferred data so it is better to use their 20, 50 or more mbps than your limited home connection.

    Most of the time 'scrapebox proxies' means that they have been checked with scrapebox, nothing else. Some proxies prevent a single IP from using them too frequently, like no more than 1 connection per second. These will be pretty useless (as in slow) for any scraping or posting, regardless of whether using scrapebox or something else. Hence they 'can't' be used with scrapebox.
     
    • Thanks Thanks x 1
  3. SyZygy

    SyZygy Senior Member

    Joined:
    Dec 29, 2009
    Messages:
    864
    Likes Received:
    1,080
    Location:
    Europe
    Home Page:
    Thank you Monrox, that's all i need to know.
    Gave you a rep+ :)
     
  4. cooltoad

    cooltoad Senior Member

    Joined:
    Sep 10, 2010
    Messages:
    934
    Likes Received:
    549
    Occupation:
    None of your business
    Location:
    On Vacation
    Code:
    In simple and somewhat incorrect terms a VPN is a complete OS at some distant place which you can access from e.g. your home using the Internet. You can do anything you like there, like playing solitaire but the real advantage is that VPNs have large bandwidths. Scraping and posting means a lot of transferred data so it is better to use their 20, 50 or more mbps than your limited home connection.
    Good knwledge there on proxies but knowledge on VPN is misleading. You probably are giving information for VPS and not VIrtual Private Network. A VPN is also somewhat like proxy but the connection between your computer and the VPN provider is encrypted and secure and it is difficult to detect that the source data is not coming from the VPN.
    On the other hand the info given by MONROX for VPN is actually meant for VPS(Virtual Private Server)
     
  5. Monrox

    Monrox Power Member

    Joined:
    Apr 9, 2010
    Messages:
    615
    Likes Received:
    579
    Yeah the stuff is badly interchanged all the time. 'S' is a server, 'N' is a network but most will consider casually mentioned VPN to be a VPS.

    Virtual Private Server (VPS) means the OS is hosted on a computer along with other OSes used by other people and each OS can only be accessed by its owner. Cheap but potentially slow due to overselling

    Dedicated Private Server (DPS) means one client only per computer. Expensive but a lot more freedom (usually an overkill for everyday scraping needs)

    Both of the above are also what actual hosting companies are offering for normal websites. In theory even shared hosting can be used for scraping but I doubt that the host will like that or that the speed will be very impressive.

    Virtual Private Network (VPN): the connection uses the Internet infrastructure to access the Private Server. Nifty & Free

    Dedicated Private Network (DPN): the connection is through own wires. Expensive but probably useful for important stuff.

    Or something like that :D
     
  6. Frogserv

    Frogserv Regular Member

    Joined:
    Jun 21, 2011
    Messages:
    376
    Likes Received:
    180
    Occupation:
    Entrepreneur
    Location:
    Paris, FR
    5 or 50 will not change the speed of your scrapebox.
    What will change the number of comments you can make in 1 minute it's the number of connections.
    For each proxy, you can put ~10 connections in scrapebox.
    So if you have 5 proxies, you will be able to open 50 connections at once and if you have 50, you will be able to open 500 connections.

    Scrapebox does his job faster because it has more connections opened, not because you have more proxies.
    Scrapebox with 5 proxies and 50 connections will be the nearly same with 10 proxies and 50 connections.

    Take proxies near your location, they will be faster.