1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scrapebox problem with .edu and .gov

Discussion in 'Black Hat SEO Tools' started by kalseo, Dec 30, 2010.

  1. kalseo

    kalseo Newbie

    Joined:
    Aug 5, 2010
    Messages:
    22
    Likes Received:
    4
    Hi black hatter friends,

    Today I found that I have problem scraping .edu and .gov domains with SB. Does any of you have the same problem?

    Cheers

    Kal
     
  2. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,363
    Location:
    zeta reticuli
    What problem you are facing exactly?
     
  3. kalseo

    kalseo Newbie

    Joined:
    Aug 5, 2010
    Messages:
    22
    Likes Received:
    4
    I am not getting any .edu or .gov domains. I am using custom footprints:

    inurl:.edu ?Powered by wordpress?
    inurl:.gov ?Powered by wordpress?

    The only results I am getting are websites that have mention those footprints.
     
  4. Yzord

    Yzord Newbie

    Joined:
    Dec 30, 2010
    Messages:
    4
    Likes Received:
    0
    I checked it out with the same footprint. Got 200 results, but a few of them are .edu's. Guess we have to use another footprint
     
  5. pasdoy

    pasdoy Power Member

    Joined:
    Jul 17, 2008
    Messages:
    727
    Likes Received:
    231
    dont forget the keyword field...
     
  6. cyberzilla

    cyberzilla Elite Member Premium Member

    Joined:
    Nov 15, 2009
    Messages:
    2,204
    Likes Received:
    3,363
    Location:
    zeta reticuli
    The footprint which you are using is correct. Make sure you are using proxies for scraping. I think there is a soft block on your IP by search engines, that's why you are not getting any results. You can also try the below given footprints.

    Edit : Oops just now saw your second update..so you are getting the results. It's common to get inappropriate result. Just export the result and remove all the non-edu sites

    site:.edu inurl:blog "Add comment" "Notify me when new comments are added" -"comments closed" -"you must be loggedin"
    site:.edu inurl:blog "Write a comment" -"comments closed" -"you must be loggedin"
    site:.edu inurl:blog "Notify me of followup comments via e-mail" -"comments closed" -"you must be loggedin"
    site:.edu in url:blog "comment" -"you must be logged in" -"posting closed" -"comment closed" "keyword"
    site:.gov in url:blog "comment" -"you must be logged in" -"posting closed" -"comment closed" "keyword"
    site:.gov 'Leave a Reply' 'Name (required)' 'Mail (will not be published) (required)' 'Website' + 'Keyword'
    site:.edu 'Leave a Reply' 'Name (required)' 'Mail (will not be published) (required)' 'Website' + 'Keyword'
    "site:.edu" "Powered By Wordpress" + 'keyword'
     
    • Thanks Thanks x 1
    Last edited: Dec 31, 2010
  7. kalseo

    kalseo Newbie

    Joined:
    Aug 5, 2010
    Messages:
    22
    Likes Received:
    4
    Thanks mate, those extra footprints will come very useful
     
  8. jrtaylor

    jrtaylor Newbie

    Joined:
    Dec 26, 2010
    Messages:
    25
    Likes Received:
    2
    I'm new at scrapebox.

    if I have "inurl:blog xyz" is this equivalent to a search string for the URL rather than the contents of the URL?
     
  9. artetatu

    artetatu Registered Member

    Joined:
    Apr 4, 2010
    Messages:
    82
    Likes Received:
    51
    can you give me more details?