Skip to content

Latest commit

 

History

History
10 lines (7 loc) · 337 Bytes

README.md

File metadata and controls

10 lines (7 loc) · 337 Bytes

HTMLparser

Simple HTML parsing library for python.

TO DO

  1. Fix '\' elements in content based tag, extra state needed to process the data.
  2. Fix for extra '<' character before actual closing tag in content based.
  3. Change the cur_tag_path to list instead os string.
  4. Add parent and child traversal ability to HTMLElement.