Usefull lost of FOOTPRINTS for scrapping urls

royalmice

BANNED
Joined
Aug 23, 2007
Messages
1,186
Reaction score
990
I was busy search for list of footprints to scrape urls to add accounts to my BMD.

I came across the below list and thought it might be useful to others:

PLIGG
Code:
"http://www.pligg.com"
"Powered by Pligg"
"powered by pligg" Home Login "Register"
"What Is Pligg?"
allintitle:store share and tag your favorite links
intitle:"Pligg Beta 9"
intitle:"Pligg beta"
inurl:"Pligg beta"
inurl:"register.php"++"powered by pligg"
inurl:/register intext:"Powered by Pligg" -inurl:.php
inurl:/register intext:"Powered by Pligg" -inurl:.php
inurl:/register intext:"upcoming" intext:"published" intext:"submit" intext:"Tag Cloud" -
inurl:.php
inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
intitle:"register"
inurl:/register.php intext:"Powered by Pligg"
inurl:live_comments.php
inurl:register.php intext:"upcoming" intext:"published" intext:"submit"
inurl:story.php inanchor:upcoming

PHPDUGG
Code:
"Powered by PHPDug"
inurl:/upcoming/0/viewall/1.html
- "Powered By PHPDug"
- "Powered By PHPDug" inurl:signup
- "Powered By PHPDug" inurl:login
- "Powered By PHPDug" inurl:add_story
- inurl:signup "Powered By PHPDug"
- inurl:phpdug/signup
- inurl:signup "Powered By PHPDug"
- "PHPDug version 2.0.0"
- "PHPDug version 1.4.2"
- "PHPDug version 1.4.1"
- "PHPDug version 1.4.0"
- "PHPDug version 1.3.1"
- "PHPDug version 1.3"
- "PHPDug version 1.2"
- "PHPDug version 1.1"
- "PHPDug version 1.0"
- "PHPDug Version 0.9.2"
- "PHPDug Version 0.9.1"
- "PHPDug Version 0.9.0"
- "PHPDug Version 0.8.1"
- "PHPDug Version 0.8.0"
- "PHPDug Version 0.7.0"
- link:http://www.kubelabs.com/phpdug/

SCUTTLE
Code:
1. "Store all your favourite links in one place, accessible from anywhere"
2. ?sort=alphabet_asc
3. ?sort=popularity_asc
4. Bookmarking the web 2.0
5. intext:"bookmarks" "Store, share and tag your favourite links"
6. intext:"date" "Store, share and tag your favourite links"
7. intext:"first" "Store, share and tag your favourite links"
8. intext:"next" "Store, share and tag your favourite links"
9. intext:"Previous" "Store, share and tag your favourite links"
10. intext:"register" "Store, share and tag your favourite links"
11. intext:"Sort by:" "Store, share and tag your favourite links"
12. intext:about "Store, share and tag your favourite links" about
13. inurl:/populartags/
14. inurl:?sort=url_asc
15. inurl:?sort=url_asc AND "keyword"
16. inurl:bookmarks.php scuttle
17. inurl:by scuttlePLUS
18. inurl:Populartags.php/ AND "keyword"
19. inurl:scuttle/about.php
20. inurl:scuttle/bookmarks.php
21. inurl:scuttle/register
22. inurl:scuttle/register.php
23. Propulsed by SemanticScuttle
24. Store, share and tag your favourite links
25. "Speicher alle Deine Webseiten-Favoriten an einem Ort"

EDU and GOV FORUMS
Code:
edu inurl:login (Create an account)
site:edu ?powered by vbulletin?
inurl:.edu/phpbb2
inurl:.edu/ (Powered by Invision Power Board)
site:edu ?powered by SMF?
edu forums sites,gov forums sites
site:.mil
site:edu inurl:login (Create an account)
site:edu "powered by vbulletin"
inurl:.edu/phpbb2
inurl:.edu/ (Powered by Invision Power Board)
site:edu "powered by SMF"
"keyword" forum site:.edu
"keyword" forum site:.gov
"keyword" blog site:.gov
inurl:.gov +inurl:forum + inurl:register
inurl:.gov +inurl:forum
inurl:.edu/phpbb inurl:register
inurl:edu forum
inurl:gov forum
inurl:.edu+inurl:forum

EDU and GOV BLOGS
Code:
inurl:.gov+inurl:blog
site:.edu inurl:wp-login.php +blog
site:.gov inurl:wp-login.php +blog
site:.edu inurl:?wp-admin? +login
site:.edu inurl:blog ?post a comment?
site:.edu inurl:blog ?post a comment? ??comments closed? -?you must be logged in?
?keyword?
site:.edu ?no comments? +blogroll -?posting closed? -?you must be logged in? -
?comments are closed?
site:.gov ?no comments? +blogroll -?posting closed? -?you must be logged in? -
?comments are closed?
inurl:(edu|gov) ?no comments? +blogroll -?posting closed? -?you must be logged in? -
?comments are closed?
site:.edu inurl:blog ?comment? -?you must be logged in? -?posting closed? -?comment
closed?
?keyword?
"keyword" blog site:.edu
keyword +inurl:blog site:.edu

EDU WIKIS
Code:
site:.edu wiki
site:.edu Inurl:MediaWiki_talk

WORDPRESS
Code:
site:.edu" "Powered By Wordpress" + keyword
"powered by wordpress"
keyword + "powered by wordpress"
"proudly powered by WordPress MU and BuddyPress" inurl:/register intext:username
'Leave a Reply' 'Name "(required)"' 'Mail (will not be published) "(required)"' 'Website' + 'KEYWORD'

RECENT COMMENTS
Code:
allintext: recent+comments

TOP COMMENTERS:
Code:
allintext: "top commentators" and "powered by wordpress"

BACKLINK SPAMMING:
Code:
Webalizer
-"Generated by Webalizer Version"
- "usage statistics" "Summary Period: August 2008"
-inurl:usage_200811 html

Awstats
-inurl:awstats.pl intitle:statistics
-inurl:awstats.pl intext:?Created by awstats?
-inurl:awstats.pl intext:?Advanced Web Statistics?

Hope you can make use of it.
 

Doctor

BANNED
Joined
Nov 14, 2009
Messages
88
Reaction score
24
Than you for the share, i was actually looking for some footprints to use with Scrapebox, this will come in very handy, thanks
 

fjones5757

Registered Member
Joined
Apr 8, 2010
Messages
69
Reaction score
7
good list!

Do you know if any of these will search for publicly viewable profile pages on these platforms or have a list for those?
 

sbsc

Junior Member
Joined
Oct 29, 2008
Messages
168
Reaction score
4
thanks for this list.
but i have two questions
1) s/w to use for harvesting ? free option
2) with this harvest we willg et exact same url but we need to hit base url how do we get base url from these sites ? any clue ? or tool to handle this?
 
Top