Usefull lost of FOOTPRINTS for scrapping urls

Discussion in 'Black Hat SEO Tools' started by royalmice, Sep 3, 2010.

  1. royalmice

    royalmice BANNED BANNED

    Joined:
    Aug 23, 2007
    Messages:
    1,186
    Likes Received:
    983
    I was busy search for list of footprints to scrape urls to add accounts to my BMD.

    I came across the below list and thought it might be useful to others:

    PLIGG
    Code:
    "http://www.pligg.com"
    "Powered by Pligg"
    "powered by pligg" Home Login "Register"
    "What Is Pligg?"
    allintitle:store share and tag your favorite links
    intitle:"Pligg Beta 9"
    intitle:"Pligg beta"
    inurl:"Pligg beta"
    inurl:"register.php"++"powered by pligg"
    inurl:/register intext:"Powered by Pligg" -inurl:.php
    inurl:/register intext:"Powered by Pligg" -inurl:.php
    inurl:/register intext:"upcoming" intext:"published" intext:"submit" intext:"Tag Cloud" -
    inurl:.php
    inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
    inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
    intitle:"register"
    inurl:/register.php intext:"Powered by Pligg"
    inurl:live_comments.php
    inurl:register.php intext:"upcoming" intext:"published" intext:"submit"
    inurl:story.php inanchor:upcoming
    
    PHPDUGG
    Code:
    "Powered by PHPDug"
    inurl:/upcoming/0/viewall/1.html
    - "Powered By PHPDug"
    - "Powered By PHPDug" inurl:signup
    - "Powered By PHPDug" inurl:login
    - "Powered By PHPDug" inurl:add_story
    - inurl:signup "Powered By PHPDug"
    - inurl:phpdug/signup
    - inurl:signup "Powered By PHPDug"
    - "PHPDug version 2.0.0"
    - "PHPDug version 1.4.2"
    - "PHPDug version 1.4.1"
    - "PHPDug version 1.4.0"
    - "PHPDug version 1.3.1"
    - "PHPDug version 1.3"
    - "PHPDug version 1.2"
    - "PHPDug version 1.1"
    - "PHPDug version 1.0"
    - "PHPDug Version 0.9.2"
    - "PHPDug Version 0.9.1"
    - "PHPDug Version 0.9.0"
    - "PHPDug Version 0.8.1"
    - "PHPDug Version 0.8.0"
    - "PHPDug Version 0.7.0"
    - link:http://www.kubelabs.com/phpdug/
    
    SCUTTLE
    Code:
    1. "Store all your favourite links in one place, accessible from anywhere"
    2. ?sort=alphabet_asc
    3. ?sort=popularity_asc
    4. Bookmarking the web 2.0
    5. intext:"bookmarks" "Store, share and tag your favourite links"
    6. intext:"date" "Store, share and tag your favourite links"
    7. intext:"first" "Store, share and tag your favourite links"
    8. intext:"next" "Store, share and tag your favourite links"
    9. intext:"Previous" "Store, share and tag your favourite links"
    10. intext:"register" "Store, share and tag your favourite links"
    11. intext:"Sort by:" "Store, share and tag your favourite links"
    12. intext:about "Store, share and tag your favourite links" about
    13. inurl:/populartags/
    14. inurl:?sort=url_asc
    15. inurl:?sort=url_asc AND "keyword"
    16. inurl:bookmarks.php scuttle
    17. inurl:by scuttlePLUS
    18. inurl:Populartags.php/ AND "keyword"
    19. inurl:scuttle/about.php
    20. inurl:scuttle/bookmarks.php
    21. inurl:scuttle/register
    22. inurl:scuttle/register.php
    23. Propulsed by SemanticScuttle
    24. Store, share and tag your favourite links
    25. "Speicher alle Deine Webseiten-Favoriten an einem Ort"
    
    EDU and GOV FORUMS
    Code:
    edu inurl:login (Create an account)
    site:edu ?powered by vbulletin?
    inurl:.edu/phpbb2
    inurl:.edu/ (Powered by Invision Power Board)
    site:edu ?powered by SMF?
    edu forums sites,gov forums sites
    site:.mil
    site:edu inurl:login (Create an account)
    site:edu "powered by vbulletin"
    inurl:.edu/phpbb2
    inurl:.edu/ (Powered by Invision Power Board)
    site:edu "powered by SMF"
    "keyword" forum site:.edu
    "keyword" forum site:.gov
    "keyword" blog site:.gov
    inurl:.gov +inurl:forum + inurl:register
    inurl:.gov +inurl:forum
    inurl:.edu/phpbb inurl:register
    inurl:edu forum
    inurl:gov forum
    inurl:.edu+inurl:forum
    
    EDU and GOV BLOGS
    Code:
    inurl:.gov+inurl:blog
    site:.edu inurl:wp-login.php +blog
    site:.gov inurl:wp-login.php +blog
    site:.edu inurl:?wp-admin? +login
    site:.edu inurl:blog ?post a comment?
    site:.edu inurl:blog ?post a comment? ??comments closed? -?you must be logged in?
    ?keyword?
    site:.edu ?no comments? +blogroll -?posting closed? -?you must be logged in? -
    ?comments are closed?
    site:.gov ?no comments? +blogroll -?posting closed? -?you must be logged in? -
    ?comments are closed?
    inurl:(edu|gov) ?no comments? +blogroll -?posting closed? -?you must be logged in? -
    ?comments are closed?
    site:.edu inurl:blog ?comment? -?you must be logged in? -?posting closed? -?comment
    closed?
    ?keyword?
    "keyword" blog site:.edu
    keyword +inurl:blog site:.edu
    
    EDU WIKIS
    Code:
    site:.edu wiki
    site:.edu Inurl:MediaWiki_talk
    
    WORDPRESS
    Code:
    site:.edu" "Powered By Wordpress" + keyword
    "powered by wordpress"
    keyword + "powered by wordpress"
    "proudly powered by WordPress MU and BuddyPress" inurl:/register intext:username
    'Leave a Reply' 'Name "(required)"' 'Mail (will not be published) "(required)"' 'Website' + 'KEYWORD'
    
    RECENT COMMENTS
    Code:
    allintext: recent+comments
    TOP COMMENTERS:
    Code:
    allintext: "top commentators" and "powered by wordpress"
    BACKLINK SPAMMING:
    Code:
    Webalizer
    -"Generated by Webalizer Version"
    - "usage statistics" "Summary Period: August 2008"
    -inurl:usage_200811 html
    
    Awstats
    -inurl:awstats.pl intitle:statistics
    -inurl:awstats.pl intext:?Created by awstats?
    -inurl:awstats.pl intext:?Advanced Web Statistics?
    
    Hope you can make use of it.
     
    • Thanks Thanks x 25
  2. bhserve

    bhserve BANNED BANNED

    Joined:
    Jan 15, 2009
    Messages:
    11
    Likes Received:
    6
    Hi Thanks for the share, this will come in really handy -- Good job
     
    • Thanks Thanks x 1
  3. Doctor

    Doctor BANNED BANNED

    Joined:
    Nov 14, 2009
    Messages:
    88
    Likes Received:
    24
    Than you for the share, i was actually looking for some footprints to use with Scrapebox, this will come in very handy, thanks
     
  4. royalmice

    royalmice BANNED BANNED

    Joined:
    Aug 23, 2007
    Messages:
    1,186
    Likes Received:
    983
    Thanks you are both welcome, glad i can be of use
     
  5. fjones5757

    fjones5757 Registered Member

    Joined:
    Apr 8, 2010
    Messages:
    69
    Likes Received:
    7
    good list!

    Do you know if any of these will search for publicly viewable profile pages on these platforms or have a list for those?
     
  6. sbsc

    sbsc Junior Member

    Joined:
    Oct 29, 2008
    Messages:
    168
    Likes Received:
    4
    thanks for this list.
    but i have two questions
    1) s/w to use for harvesting ? free option
    2) with this harvest we willg et exact same url but we need to hit base url how do we get base url from these sites ? any clue ? or tool to handle this?
     
  7. bilbo

    bilbo Power Member

    Joined:
    Jan 26, 2009
    Messages:
    649
    Likes Received:
    1,140
    Occupation:
    an actor on wizard of oz - the 3rd munchkin
    Location:
    middle earth
  8. PetreTerror

    PetreTerror Newbie

    Joined:
    Dec 1, 2009
    Messages:
    17
    Likes Received:
    0
    very usefull , i need more
     
  9. Delboy2424

    Delboy2424 Regular Member

    Joined:
    Oct 3, 2009
    Messages:
    452
    Likes Received:
    123
    Occupation:
    Entrepreneur
    Location:
    Peckham
    thks for the list - need some new ones... :)