Good footprints to scrape EDU blogs with scrapebox?

Discussion in 'Black Hat SEO Tools' started by maxken75, Feb 14, 2012.

  1. maxken75

    maxken75 Senior Member

    Joined:
    Nov 3, 2009
    Messages:
    872
    Likes Received:
    34
    Good footprints to scrape EDU blogs with scrapebox?
     
  2. ericbenson81

    ericbenson81 Junior Member

    Joined:
    Jun 8, 2011
    Messages:
    183
    Likes Received:
    171
    site:.edu "leave a reply"
     
  3. GoldenGlovez

    GoldenGlovez Moderator Staff Member Moderator Jr. VIP

    Joined:
    Mar 23, 2011
    Messages:
    912
    Likes Received:
    2,052
    Occupation:
    Implementing Crazy Ideas
    Location:
    Taiwan
    Home Page:
    site:.edu "Name" "Website" "Leave a Comment"
    site:.edu "Name" "Website" "Leave a Response"
    site:.edu "Name" "Website" "Leave a Reply"
    site:.edu "Name" "Website" "Post Comment"
    site:.edu "Name" "Website" "Post Response"
    site:.edu "Name" "Website" "Post Reply"

    Those should get you started. Add %KW% to the front of each of those and use the 'Merge' feature with your keyword list to harvest them at the same time.
    Ex: %KW% site:.edu "Name" "Website" "Leave a Comment"
     
    • Thanks Thanks x 3
  4. maxken75

    maxken75 Senior Member

    Joined:
    Nov 3, 2009
    Messages:
    872
    Likes Received:
    34
    What's mean "name" and "website"
    can you do an example ready merged?
     
  5. GoldenGlovez

    GoldenGlovez Moderator Staff Member Moderator Jr. VIP

    Joined:
    Mar 23, 2011
    Messages:
    912
    Likes Received:
    2,052
    Occupation:
    Implementing Crazy Ideas
    Location:
    Taiwan
    Home Page:
    "Name" and "Website" are used to find pages that contain both of those words as well as "Leave a Comment". This helps cut back on returning a lot of forum pages that have just "Leave a Comment" and instead is more likely to find blogs which can be posted to with Scrapebox.

    Example:
    Keyword1 site:.edu "Name" "Website "Leave a Comment"
    Keyword2 site:.edu "Name" "Website "Leave a Comment"
     
  6. kokoloko75

    kokoloko75 Elite Member

    Joined:
    Jan 1, 2011
    Messages:
    1,628
    Likes Received:
    1,943
    Occupation:
    Design director
    Location:
    Paris (France)
    Footprint :
    Code:
    cialis viagra "leave a reply" "website" "wordpress" "2012 at" inurl:"?p=" site:.edu
    Result example :
    Code:
    http://pages.uoregon.edu/badmin/?p=21
    Lol, this shit can be improved :D
    But it's a start...

    Beny
     
    • Thanks Thanks x 1
  7. kokoloko75

    kokoloko75 Elite Member

    Joined:
    Jan 1, 2011
    Messages:
    1,628
    Likes Received:
    1,943
    Occupation:
    Design director
    Location:
    Paris (France)
    And of course when you find a good auto-approved URL, scrape all other indexed page form this source.

    If the website has a Sitemap or RSS feed for pages, it's easy.
    Otherwise, you must use a footprint like (for my example URL above) :
    Code:
    inurl:"?p=" inurl:"badmin" site:pages.uoregon.edu
    Hop ! 30+ auto-approved pages.

    Beny