Templates are wildly used in Web sites development. Finding the template for a given set of Web pages could be very important and useful for many applications like Web page classification and monitoring content and structure changes of Web pages. In this thesis, two novel sequence-based Web page template detection algorithms are presented. Different from tree mapping algorithms which are based on tree edit distance, sequence-based template detection algorithms operate on the Prüfer/Consolidated Prüfer sequences of trees.
Download count: 0
- Partial requirement for: M.S., Arizona State University, 2011Note typethesis
- Includes bibliographical references (p. 60-62)Note typebibliography
- Field of study: Computer science