1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Usefull lost of FOOTPRINTS for scrapping urls

Discussion in 'Black Hat SEO Tools' started by royalmice, Sep 3, 2010.

  1. royalmice

    royalmice BANNED BANNED

    Joined:
    Aug 23, 2007
    Messages:
    1,186
    Likes Received:
    982
    I was busy search for list of footprints to scrape urls to add accounts to my BMD.

    I came across the below list and thought it might be useful to others:

    PLIGG
    Code:
    "http://www.pligg.com"
    "Powered by Pligg"
    "powered by pligg" Home Login "Register"
    "What Is Pligg?"
    allintitle:store share and tag your favorite links
    intitle:"Pligg Beta 9"
    intitle:"Pligg beta"
    inurl:"Pligg beta"
    inurl:"register.php"++"powered by pligg"
    inurl:/register intext:"Powered by Pligg" -inurl:.php
    inurl:/register intext:"Powered by Pligg" -inurl:.php
    inurl:/register intext:"upcoming" intext:"published" intext:"submit" intext:"Tag Cloud" -
    inurl:.php
    inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
    inurl:/register intext:"upcoming" intext:"published" intext:"submit" -inurl:.php
    intitle:"register"
    inurl:/register.php intext:"Powered by Pligg"
    inurl:live_comments.php
    inurl:register.php intext:"upcoming" intext:"published" intext:"submit"
    inurl:story.php inanchor:upcoming
    
    PHPDUGG
    Code:
    "Powered by PHPDug"
    inurl:/upcoming/0/viewall/1.html
    - "Powered By PHPDug"
    - "Powered By PHPDug" inurl:signup
    - "Powered By PHPDug" inurl:login
    - "Powered By PHPDug" inurl:add_story
    - inurl:signup "Powered By PHPDug"
    - inurl:phpdug/signup
    - inurl:signup "Powered By PHPDug"
    - "PHPDug version 2.0.0"
    - "PHPDug version 1.4.2"
    - "PHPDug version 1.4.1"
    - "PHPDug version 1.4.0"
    - "PHPDug version 1.3.1"
    - "PHPDug version 1.3"
    - "PHPDug version 1.2"
    - "PHPDug version 1.1"
    - "PHPDug version 1.0"
    - "PHPDug Version 0.9.2"
    - "PHPDug Version 0.9.1"
    - "PHPDug Version 0.9.0"
    - "PHPDug Version 0.8.1"
    - "PHPDug Version 0.8.0"
    - "PHPDug Version 0.7.0"
    - link:http://www.kubelabs.com/phpdug/
    
    SCUTTLE
    Code:
    1. "Store all your favourite links in one place, accessible from anywhere"
    2. ?sort=alphabet_asc
    3. ?sort=popularity_asc
    4. Bookmarking the web 2.0
    5. intext:"bookmarks" "Store, share and tag your favourite links"
    6. intext:"date" "Store, share and tag your favourite links"
    7. intext:"first" "Store, share and tag your favourite links"
    8. intext:"next" "Store, share and tag your favourite links"
    9. intext:"Previous" "Store, share and tag your favourite links"
    10. intext:"register" "Store, share and tag your favourite links"
    11. intext:"Sort by:" "Store, share and tag your favourite links"
    12. intext:about "Store, share and tag your favourite links" about
    13. inurl:/populartags/
    14. inurl:?sort=url_asc
    15. inurl:?sort=url_asc AND "keyword"
    16. inurl:bookmarks.php scuttle
    17. inurl:by scuttlePLUS
    18. inurl:Populartags.php/ AND "keyword"
    19. inurl:scuttle/about.php
    20. inurl:scuttle/bookmarks.php
    21. inurl:scuttle/register
    22. inurl:scuttle/register.php
    23. Propulsed by SemanticScuttle
    24. Store, share and tag your favourite links
    25. "Speicher alle Deine Webseiten-Favoriten an einem Ort"
    
    EDU and GOV FORUMS
    Code:
    edu inurl:login (Create an account)
    site:edu ?powered by vbulletin?
    inurl:.edu/phpbb2
    inurl:.edu/ (Powered by Invision Power Board)
    site:edu ?powered by SMF?
    edu forums sites,gov forums sites
    site:.mil
    site:edu inurl:login (Create an account)
    site:edu "powered by vbulletin"
    inurl:.edu/phpbb2
    inurl:.edu/ (Powered by Invision Power Board)
    site:edu "powered by SMF"
    "keyword" forum site:.edu
    "keyword" forum site:.gov
    "keyword" blog site:.gov
    inurl:.gov +inurl:forum + inurl:register
    inurl:.gov +inurl:forum
    inurl:.edu/phpbb inurl:register
    inurl:edu forum
    inurl:gov forum
    inurl:.edu+inurl:forum
    
    EDU and GOV BLOGS
    Code:
    inurl:.gov+inurl:blog
    site:.edu inurl:wp-login.php +blog
    site:.gov inurl:wp-login.php +blog
    site:.edu inurl:?wp-admin? +login
    site:.edu inurl:blog ?post a comment?
    site:.edu inurl:blog ?post a comment? ??comments closed? -?you must be logged in?
    ?keyword?
    site:.edu ?no comments? +blogroll -?posting closed? -?you must be logged in? -
    ?comments are closed?
    site:.gov ?no comments? +blogroll -?posting closed? -?you must be logged in? -
    ?comments are closed?
    inurl:(edu|gov) ?no comments? +blogroll -?posting closed? -?you must be logged in? -
    ?comments are closed?
    site:.edu inurl:blog ?comment? -?you must be logged in? -?posting closed? -?comment
    closed?
    ?keyword?
    "keyword" blog site:.edu
    keyword +inurl:blog site:.edu
    
    EDU WIKIS
    Code:
    site:.edu wiki
    site:.edu Inurl:MediaWiki_talk
    
    WORDPRESS
    Code:
    site:.edu" "Powered By Wordpress" + keyword
    "powered by wordpress"
    keyword + "powered by wordpress"
    "proudly powered by WordPress MU and BuddyPress" inurl:/register intext:username
    'Leave a Reply' 'Name "(required)"' 'Mail (will not be published) "(required)"' 'Website' + 'KEYWORD'
    
    RECENT COMMENTS
    Code:
    allintext: recent+comments
    TOP COMMENTERS:
    Code:
    allintext: "top commentators" and "powered by wordpress"
    BACKLINK SPAMMING:
    Code:
    Webalizer
    -"Generated by Webalizer Version"
    - "usage statistics" "Summary Period: August 2008"
    -inurl:usage_200811 html
    
    Awstats
    -inurl:awstats.pl intitle:statistics
    -inurl:awstats.pl intext:?Created by awstats?
    -inurl:awstats.pl intext:?Advanced Web Statistics?
    
    Hope you can make use of it.
     
    • Thanks Thanks x 25
  2. bhserve

    bhserve BANNED BANNED

    Joined:
    Jan 15, 2009
    Messages:
    11
    Likes Received:
    6
    Hi Thanks for the share, this will come in really handy -- Good job
     
    • Thanks Thanks x 1
  3. Doctor

    Doctor BANNED BANNED

    Joined:
    Nov 14, 2009
    Messages:
    88
    Likes Received:
    24
    Than you for the share, i was actually looking for some footprints to use with Scrapebox, this will come in very handy, thanks
     
  4. royalmice

    royalmice BANNED BANNED

    Joined:
    Aug 23, 2007
    Messages:
    1,186
    Likes Received:
    982
    Thanks you are both welcome, glad i can be of use
     
  5. fjones5757

    fjones5757 Registered Member

    Joined:
    Apr 8, 2010
    Messages:
    69
    Likes Received:
    7
    good list!

    Do you know if any of these will search for publicly viewable profile pages on these platforms or have a list for those?
     
  6. sbsc

    sbsc Junior Member

    Joined:
    Oct 29, 2008
    Messages:
    168
    Likes Received:
    4
    thanks for this list.
    but i have two questions
    1) s/w to use for harvesting ? free option
    2) with this harvest we willg et exact same url but we need to hit base url how do we get base url from these sites ? any clue ? or tool to handle this?
     
  7. bilbo

    bilbo Power Member

    Joined:
    Jan 26, 2009
    Messages:
    644
    Likes Received:
    1,134
    Occupation:
    an actor on wizard of oz - the 3rd munchkin
    Location:
    middle earth
  8. PetreTerror

    PetreTerror Newbie

    Joined:
    Dec 1, 2009
    Messages:
    17
    Likes Received:
    0
    very usefull , i need more
     
  9. Delboy2424

    Delboy2424 Regular Member

    Joined:
    Oct 3, 2009
    Messages:
    452
    Likes Received:
    123
    Occupation:
    Entrepreneur
    Location:
    Peckham
    thks for the list - need some new ones... :)