1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Software to get statistics words form source code HTML

Discussion in 'Black Hat SEO Tools' started by zxc999, Oct 9, 2014.

  1. zxc999

    zxc999 Newbie

    Joined:
    Sep 9, 2014
    Messages:
    2
    Likes Received:
    0
    Im searching software to get statistics for all words used in HTML code I have to pull the common parts of HTML code in order to identify the CMS platforms - as this is solved in the Appendix Page Scanner Scrapebox FootprintFacory only can: From News FFactroy can pull statistics for News only, and I need for each element: "menu-367" "text-align: justify;" "/ node / 13" "News" In this way, the same easier I identify common parts for the CMS and the same will create a footprint for identification of the platform scrapebox_com/page-scanner (and not typical footprint) PS. Sory form my language ;)
     
  2. dannyhw

    dannyhw Senior Member

    Joined:
    Jul 16, 2008
    Messages:
    980
    Likes Received:
    462
    Occupation:
    Software Engineer
    Location:
    New York City Burbs
    If you can do some basic programming it's pretty easy with node.js and cheerio. You can use jquery selectors, but it's very very lightweight.