html2data

A simple way to transform a HTML file or URL to structured data
Download

html2data Ranking & Summary

Advertisement

  • Rating:
  • License:
  • BSD License
  • Publisher Name:
  • Daniel Perez Rada
  • Publisher web site:
  • http://zappedy.com

html2data Tags


html2data Description

A simple way to transform a HTML file or URL to structured data html2data offers a simple way to transform a HTML file or URL to structured data. For example:>>> ## start the console>>> from html2data import html2data>>> html = """< !DOCTYPE html >< html lang="en" >< head >< /head > < body > < h1 >< b >Title< /b >< /h1 > < div class="description" >This is not a valid HTML < /body > < /html >""">>> config = { 'map': , /text()'], ] }>>> handler = html2data()>>> received_obj = handler.load(html = html, config=config)>>> print received_obj{ 'body_title': 'Title', 'description': 'This is not a valid HTML'} Requirements: · Python · lxml · httplib2


html2data Related Software