Path to this page:
./
textproc/py-html5lib,
HTML5 parser and tokenizer
Branch: CURRENT,
Version: 1.1nb3,
Package name: py313-html5lib-1.1nb3,
Maintainer: pkgsrc-usershtml5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.
Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.
Required to run:[
www/py-genshi] [
textproc/py-lxml] [
lang/py-six] [
textproc/py-webencodings] [
lang/python310]
Master sites:
Filesize: 265.835 KB
Version history: (Expand)
- (2025-10-09) Updated to version: py313-html5lib-1.1nb3
- (2025-04-13) Updated to version: py312-html5lib-1.1nb3
- (2024-11-11) Updated to version: py312-html5lib-1.1nb2
- (2024-04-30) Updated to version: py311-html5lib-1.1nb2
- (2024-01-14) Updated to version: py311-html5lib-1.1nb1
- (2022-11-09) Updated to version: py310-html5lib-1.1nb1
CVS history: (Expand)