Scrapebox problem with .edu and .gov

Discussion in 'Black Hat SEO Tools' started by kalseo, Dec 30, 2010.

  1. kalseo

    kalseo Newbie

    Joined:
    Aug 5, 2010
    Messages:
    22
    Likes Received:
    4
    Hi black hatter friends,

    Today I found that I have problem scraping .edu and .gov domains with SB. Does any of you have the same problem?

    Cheers

    Kal
     
  2. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,225
    Likes Received:
    3,373
    Location:
    zeta reticuli
    What problem you are facing exactly?
     
  3. kalseo

    kalseo Newbie

    Joined:
    Aug 5, 2010
    Messages:
    22
    Likes Received:
    4
    I am not getting any .edu or .gov domains. I am using custom footprints:

    inurl:.edu ?Powered by wordpress?
    inurl:.gov ?Powered by wordpress?

    The only results I am getting are websites that have mention those footprints.
     
  4. Yzord

    Yzord Newbie

    Joined:
    Dec 30, 2010
    Messages:
    4
    Likes Received:
    0
    I checked it out with the same footprint. Got 200 results, but a few of them are .edu's. Guess we have to use another footprint
     
  5. pasdoy

    pasdoy Senior Member

    Joined:
    Jul 17, 2008
    Messages:
    871
    Likes Received:
    281
    dont forget the keyword field...
     
  6. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,225
    Likes Received:
    3,373
    Location:
    zeta reticuli
    The footprint which you are using is correct. Make sure you are using proxies for scraping. I think there is a soft block on your IP by search engines, that's why you are not getting any results. You can also try the below given footprints.

    Edit : Oops just now saw your second update..so you are getting the results. It's common to get inappropriate result. Just export the result and remove all the non-edu sites

    site:.edu inurl:blog "Add comment" "Notify me when new comments are added" -"comments closed" -"you must be loggedin"
    site:.edu inurl:blog "Write a comment" -"comments closed" -"you must be loggedin"
    site:.edu inurl:blog "Notify me of followup comments via e-mail" -"comments closed" -"you must be loggedin"
    site:.edu in url:blog "comment" -"you must be logged in" -"posting closed" -"comment closed" "keyword"
    site:.gov in url:blog "comment" -"you must be logged in" -"posting closed" -"comment closed" "keyword"
    site:.gov 'Leave a Reply' 'Name (required)' 'Mail (will not be published) (required)' 'Website' + 'Keyword'
    site:.edu 'Leave a Reply' 'Name (required)' 'Mail (will not be published) (required)' 'Website' + 'Keyword'
    "site:.edu" "Powered By Wordpress" + 'keyword'
     
    • Thanks Thanks x 1
    Last edited: Dec 31, 2010
  7. kalseo

    kalseo Newbie

    Joined:
    Aug 5, 2010
    Messages:
    22
    Likes Received:
    4
    Thanks mate, those extra footprints will come very useful
     
  8. jrtaylor

    jrtaylor Newbie

    Joined:
    Dec 26, 2010
    Messages:
    25
    Likes Received:
    2
    I'm new at scrapebox.

    if I have "inurl:blog xyz" is this equivalent to a search string for the URL rather than the contents of the URL?
     
  9. artetatu

    artetatu Registered Member

    Joined:
    Apr 4, 2010
    Messages:
    82
    Likes Received:
    51
    can you give me more details?