1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Just bought Scrapebox and want to understand how does it scrape Google! Help needed!

Discussion in 'Black Hat SEO Tools' started by SpecialOne, May 16, 2012.

  1. SpecialOne

    SpecialOne Registered Member

    Joined:
    Jan 12, 2011
    Messages:
    65
    Likes Received:
    21
    OK as my title says few days back I finally got my SB. I watched a lot of video tutorials but I still have a problem understanding how SB search Google.

    STEP 1

    For my testings I took "dog training" (*without quotes*) as my keyword, selected WordPress and scraped top ten Google results in SB. These are URL results:

    URL1.gif

    STEP 2

    Then I went to Google and manually typed "POWERED BY WORDPRESS" "Dog training", "POWERED BY WORDPRESS" Dog training, "Dog training" "POWERED BY WORDPRESS", Dog training "POWERED BY WORDPRESS".

    I received totally different results from above URLS.

    URL2.gif
    STEP 3

    Then I went step further. I checked "Custom Footprint" and in footprint field I typed "POWERED BY WORDPRESS" and used my "Dog training" (*without quotes*) in keyword box again.

    URL3.gif

    So from the results I got I learned that WordPress radio button IS NOT EQUAL "POWERED BY WORDPRESS" in Google. This is how that guy on Youtube was teaching people in his tutorials. Maybe programmers changed something since it was older version of SB.

    The thing that bothers me is that I haven't received same URLs compering STEP 2 with STEP 3 which should be exactly the same.

    Could some SB expert enlighten me what I am doing wrong? Please bear in mind that I didn't used proxies for any scraping therefore data should came from same Google data center. I am just starting using SB so I want build proper knowledge for quality backlinking and not spamming as many do. Any SB pro here who could help me?
     
  2. GoldenGlovez

    GoldenGlovez Moderator Staff Member Moderator Jr. VIP

    Joined:
    Mar 23, 2011
    Messages:
    701
    Likes Received:
    1,713
    Location:
    Guangdong, China
    Home Page:
    Stick with custom footprints for scraping. You'll have better results than using the in-built Wordpess, BlogEngine, MoveableType settings. While I don't remember the exact footprint used by the built in footprints (I could check my Squid logs if it's very important), it's certainly different than "Powered by Wordpress".
     
    • Thanks Thanks x 1
  3. ija61

    ija61 Senior Member

    Joined:
    Mar 2, 2011
    Messages:
    960
    Likes Received:
    634
    Gender:
    Male
    Occupation:
    The first SEO economist:)
    Location:
    Romania
    Home Page:
    Not sure if this is the response but one motive why you get different result may be cause by the use of proxy.

    Google show different result based on search history and ip location, so if your proxy (IP) have different search history and location from your home(internet browsing) IP this may be the reason you get different result.

    Another reason may be the different google page.

    EX:
    As a default base to my IP when I open google it redirect me to google.ro and scrapebox use google.com. so if I search for the same kw google.ro give me different result then google.com

    As I said this May be some of the reason.
     
    • Thanks Thanks x 1
  4. scrapebrokers

    scrapebrokers Regular Member

    Joined:
    Aug 2, 2011
    Messages:
    224
    Likes Received:
    64
    Occupation:
    PR @ ScrapeBrokers
    Location:
    London
    Home Page:
    I totally agree with ija61 however the OP said he not used proxies. Also if you are logged in Google and +ed some sites it will show higher in the rankings.

    Also I think scrapebox use a different footprint for scrapping wordpress sites. Example: "Powered by wordpress" blackhatworld will return all kind of sites where this 2 phrases are written including: http://www.blackhatworld.com/blackhat-seo/black-hat-seo-tools/441925-just-bought-scrapebox-want-understand-how-does-scrape-google-help-needed.html (your topic). But this is not a wordpress blog is a forum, so you can not comment on. But a footprint like blackhatworld "Posted in" "Tags:" return mostly wordpress sites. Exactly what you need to blast your comments on.

    Ofcourse this are not the exact footprints, I just gave you an example but you can check the forum for tons of wordpress footprints you can use to find a lot of targeted wordpress blogs
     
    • Thanks Thanks x 1
  5. turoc

    turoc Junior Member

    Joined:
    Oct 28, 2009
    Messages:
    117
    Likes Received:
    33
    @OP - if you look at step 2 and step 3, they are basically the same, except step 3 had a couple more deep links for certain urls, and as you are only getting 10 results, the last ones on the list on step 2 would have moved onto page 2 of the results in step 3.

    That said however, I agree with GoldenGlovez & Scrapebrokers - using custom footprints is the key to success with SB.
     
    • Thanks Thanks x 1
  6. cloakndagger

    cloakndagger Power Member

    Joined:
    Oct 31, 2010
    Messages:
    613
    Likes Received:
    173
    Scrapebox without proxies is not a good idea.
     
  7. Scritty

    Scritty Elite Member Premium Member

    Joined:
    May 1, 2010
    Messages:
    2,807
    Likes Received:
    4,496
    Occupation:
    Affiliate Marketer
    Location:
    UK
    Home Page:
    No - scrapebox without proxies = stopped after 3 minutes max.

    I've used the footprint from NHSEO and Scrapejet as well as the general ones from GSA - they all give different results. None of them seem to give the same results of Scrapebox standard footprint (which has changed more than once according to the info that comes out with the updates) and none of them are "Powered by Wordpress" or at least have more than just that.

    Scritty
     
    • Thanks Thanks x 1
  8. SpecialOne

    SpecialOne Registered Member

    Joined:
    Jan 12, 2011
    Messages:
    65
    Likes Received:
    21
    OK thx for your tip! I will try to get used of custom footprints. If it doesn't take you a lot of time and effort I would really appreciated your help on finding that WP custom footprint. Just to understand SB a little bit better. But if it is a long and tedious checking then don't even bother.

    Few additional question for you since I see from your signature that you are using SB.

    How come that I got different URL results in STEP 2 and STEP 3 with same queries? One was manually and second is automated. I think I should received same results, don't you think?
    And does it make difference if I put "dog training" with quotes or dog training without quotes for my keyword in SB?
     
  9. GoldenGlovez

    GoldenGlovez Moderator Staff Member Moderator Jr. VIP

    Joined:
    Mar 23, 2011
    Messages:
    701
    Likes Received:
    1,713
    Location:
    Guangdong, China
    Home Page:
    I'll take a look through my squid logs if I get a chance to see what the query is. As for the different results, it could do with the location of your search. At different times, different datacenters, return different results (phew XD). Using a "keyword" in quotation returns results that match the exact spelling and order of the words. Where as typing the keyword without quotations can return a page that contains those words but in no particular order.
     
    • Thanks Thanks x 1
  10. SpecialOne

    SpecialOne Registered Member

    Joined:
    Jan 12, 2011
    Messages:
    65
    Likes Received:
    21
    Good points there but I already excluded both of them. As I mentioned in my first post I wasn't using proxies to avoid different results. I was making sure that when I was manually typing I am in English Google and not in my local country Google. So no redirect was used.
     
  11. SpecialOne

    SpecialOne Registered Member

    Joined:
    Jan 12, 2011
    Messages:
    65
    Likes Received:
    21

    Yes I noticed that too now. There was two deep links from two domains so those other results were excluded because I put 10 URL results to be scraped. I think I might now what is ScrapeBox doing now. It is probably programmed in a smart way that when you automatically scrape for particular keyword it gives you most relevant results automatically excluding other unimportant URLs. By doing that it increases your chance for finding a place to post comments. That's my theory but maybe I am wrong!

    And btw BHW folks I already ranked my first keyword on first page of Google - dog training "POWERED BY WORDPRESS". Unintentionally... :p
     
    Last edited: May 17, 2012