HTML::Tree::Scanning

HTML::Tree::Scanning contains an article: "Scanning HTML".
Download

HTML::Tree::Scanning Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Perl Artistic License
  • Price:
  • FREE
  • Publisher Name:
  • Sean M. Burke
  • Publisher web site:
  • http://search.cpan.org/~sburke/

HTML::Tree::Scanning Tags


HTML::Tree::Scanning Description

HTML::Tree::Scanning contains an article: "Scanning HTML". HTML::Tree::Scanning contains an article: "Scanning HTML".SYNOPSIS # This an article, not a module.The following article by Sean M. Burke first appeared in The Perl Journal #19 and is copyright 2000 The Perl Journal. It appears courtesy of Jon Orwant and The Perl Journal. This document may be distributed under the same terms as Perl itself.Scanning HTML-- Sean M. BurkeIn The Perl Journal issue 17, Ken MacFarlane's article "Parsing HTML with HTML::Parser" describes how the HTML::Parser module scans HTML source as a stream of start-tags, end-tags, text, comments, etc. In TPJ #18, my "Trees" article kicked around the idea of tree-shaped data structures. Now I'll try to tie it together, in a discussion of HTML trees.The CPAN module HTML::TreeBuilder takes the tags that HTML::Parser picks out, and builds a parse tree -- a tree-shaped network of objects...Footnote: And if you need a quick explanation of objects, see my TPJ17 article "A User's View of Object-Oriented Modules"; or go whole hog and get Damian Conway's excellent book Object-Oriented Perl, from Manning Publications....representing the structured content of the HTML document. And once the document is parsed as a tree, you'll find the common tasks of extracting data from that HTML document/tree to be quite straightforward. Requirements: · Perl


HTML::Tree::Scanning Related Software