1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Get only 10 URL from SB Scraping

Discussion in 'Black Hat SEO' started by TrollDad, Dec 4, 2014.

  1. TrollDad

    TrollDad Regular Member

    Joined:
    Jul 5, 2012
    Messages:
    257
    Likes Received:
    62
    Hi There, I got a vps and semi dedicated proxies for scrapping, I tried to use "site:blackhatworld" and "seo" as footprint to scrape but get only 10 urls, same case with other footprint, any idea?
     
  2. FBGuru

    FBGuru Senior Member

    Joined:
    Sep 22, 2013
    Messages:
    928
    Likes Received:
    1,171
    Location:
    Personality Type : ESTP
    Enter 1000 in the "Results:" box which is located right above the proxy list.

    [​IMG]
     
    • Thanks Thanks x 2
  3. zhonglingyu8

    zhonglingyu8 Newbie

    Joined:
    Nov 29, 2014
    Messages:
    20
    Likes Received:
    11
    I think most of those proxies are blacklisted by Google so it's only few urls show after scrape one keyword. Check them again by SB if have any blacklisted just contact proxy provider to replace them, if they won't try another proxy providers.
     
    • Thanks Thanks x 1
  4. browserfox

    browserfox Newbie

    Joined:
    Dec 4, 2014
    Messages:
    5
    Likes Received:
    1
    Try as FBGuru suggested, and also, test your proxies before scraping.
     
    • Thanks Thanks x 1
  5. Aty

    Aty Jr. VIP Jr. VIP

    Joined:
    Jan 27, 2011
    Messages:
    5,416
    Likes Received:
    3,701
    Home Page:
    You need the .com after blackhatworld.
     
    • Thanks Thanks x 1
  6. xrfanatic

    xrfanatic Jr. VIP Jr. VIP Premium Member

    Joined:
    Aug 28, 2010
    Messages:
    371
    Likes Received:
    166
    Location:
    http://bit.ly/slb64
    Home Page:
    May I ask what you are trying to scrape ? :)
     
  7. the_demon

    the_demon Jr. Executive VIP

    Joined:
    Nov 23, 2008
    Messages:
    3,177
    Likes Received:
    1,563
    Occupation:
    Search Engine Marketing
    Location:
    The Internet
    Use GScraper & Their proxy service for scraping. You'll get huge lists and it's way more powerful than ScrapeBox when it comes to scraping lists.
     
    • Thanks Thanks x 1
  8. TrollDad

    TrollDad Regular Member

    Joined:
    Jul 5, 2012
    Messages:
    257
    Likes Received:
    62
    20 proxies are all google passed
     
  9. TrollDad

    TrollDad Regular Member

    Joined:
    Jul 5, 2012
    Messages:
    257
    Likes Received:
    62
    QQ截图20141205125616.png

    seems got 700+ urls but all removed
     
  10. FBGuru

    FBGuru Senior Member

    Joined:
    Sep 22, 2013
    Messages:
    928
    Likes Received:
    1,171
    Location:
    Personality Type : ESTP
    You removed duplicate domains. Select "Remove Duplicate URL's" instead and you'll be golden.

    Also, if you need to scrape more URL's, try adding some keywords to the keywords list.

    Checkout loopline's channel to learn about using Scrapebox effectively.

    https://www.youtube.com/user/looplinescrapebox/videos
     
    • Thanks Thanks x 1
  11. ID Internet Marketer

    ID Internet Marketer Senior Member

    Joined:
    Jan 22, 2013
    Messages:
    938
    Likes Received:
    1,442
    Occupation:
    Blackhatworld Member
    Location:
    My Private ***
    keep in mind, when you use ONLY 1 footprint and or 1 keyword, in the end the total max about 1000 urls. won't get millions.

    i saw footprint for site:xxx.com, if you wanted collecting a site's urls better use sitemap generator software etc.
     
    • Thanks Thanks x 1