Software to get statistics words form source code HTML

Discussion in 'Black Hat SEO Tools' started by zxc999, Oct 9, 2014.

  1. zxc999

    zxc999 Newbie

    Joined:
    Sep 9, 2014
    Messages:
    2
    Likes Received:
    0
    Im searching software to get statistics for all words used in HTML code I have to pull the common parts of HTML code in order to identify the CMS platforms - as this is solved in the Appendix Page Scanner Scrapebox FootprintFacory only can: From News FFactroy can pull statistics for News only, and I need for each element: "menu-367" "text-align: justify;" "/ node / 13" "News" In this way, the same easier I identify common parts for the CMS and the same will create a footprint for identification of the platform scrapebox_com/page-scanner (and not typical footprint) PS. Sory form my language ;)
     
  2. dannyhw

    dannyhw Senior Member

    Joined:
    Jul 16, 2008
    Messages:
    979
    Likes Received:
    468
    Occupation:
    Software Engineer
    Location:
    New York City Burbs
    If you can do some basic programming it's pretty easy with node.js and cheerio. You can use jquery selectors, but it's very very lightweight.