1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

URL Harvesting Footprints

Discussion in 'Black Hat SEO' started by surajprakash31, Apr 22, 2009.

  1. surajprakash31

    surajprakash31 Regular Member

    Joined:
    Oct 7, 2008
    Messages:
    261
    Likes Received:
    459
    Home Page:
    I'm launching a url harvester, for which I need some default footprints for harvesting particular urls, like for finding Wordpress blogs, we can use the following footprint:

    ‘Leave a Reply' ‘Name "(required)"‘ ‘Mail (will not be published) "(required)"‘ ‘Website' + ‘Your Keyword'

    I have already added wordpress, pligg, vBulletin, phpbb to my harvester. Can anyone please provide some more footprints which they know?
     
  2. kelldog17

    kelldog17 Registered Member

    Joined:
    Nov 12, 2008
    Messages:
    78
    Likes Received:
    28
    maybe something for .edu forums..blogs!?
     
  3. sonneti

    sonneti Regular Member

    Joined:
    Jan 27, 2009
    Messages:
    205
    Likes Received:
    127
    Just include this in your query

    site:.edu
     
  4. pennyb

    pennyb Junior Member

    Joined:
    Aug 14, 2008
    Messages:
    119
    Likes Received:
    267
    Location:
    Necropolis
    site:edu inurl:login (Create an account)
    site:edu "powered by SMF"
    site:edu "powered by Fireboard"
    site:edu "powered by phpbb
    inurl:"register.php"++"powered by pligg"
    inurl:title "powered by pligg "
    inurl:live_comments.php
    "powered by pligg" Home Login "Register"
    "site:.edu" "Powered By Wordpress" + keyword
    "Speicher alle Deine Webseiten-Favoriten an einem Ort"
    "Store all your favourite links in one place"
    "store, share and tag your favourite links"
    Store all your favourite links in one place, accessible from anywhere.
    Share your bookmarks with everyone, with friends on your watchlist or just keep them private.
    Tag your bookmarks with as many labels as you want, instead of wrestling with folders.
    inurl:scuttle/about.php
    inurl:scuttle/register.php
    inurl:scuttle/register
    inurl: pliggbeta9
    intitle: powered by pligg
    ?sort=popularity_asc
    ?sort=alphabet_asc
    inurl:?sort=url_asc
    inurl:by scuttlePLUS
    etc...

    i was also using something like this username+scuttle where username is the username of someone who is spamming on a lot of bookmarking sites then i just collect the links where he is posting
     
    • Thanks Thanks x 1
    Last edited: Apr 22, 2009
  5. neta1o

    neta1o Regular Member

    Joined:
    Sep 29, 2008
    Messages:
    388
    Likes Received:
    318
    Home Page:
    Been done over and over
    http://www.blackhatworld.com/blackh...y-free-url-scrape-tool-finished-just-now.html
    http://www.blackhatworld.com/blackhat-seo/downloads/40553-get-neta1o-scraper.html

    Allow custom footprint entry etc...