./textproc/py-html5lib, HTML5 parser and tokenizer

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 1.1nb3, Package name: py313-html5lib-1.1nb3, Maintainer: pkgsrc-users

html5lib is a pure-python library for parsing HTML. The parser is
designed to handle all flavours of HTML and parses invalid documents
using well-defined error handling rules compatible with the behaviour of
major desktop web browsers.

Output is to a tree structure; the current release supports output to
DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a
simple custom format.


Required to run:
[www/py-genshi] [textproc/py-lxml] [lang/py-six] [textproc/py-webencodings] [lang/python310]

Master sites:

Filesize: 265.835 KB

Version history: (Expand)


CVS history: (Expand)


   2025-10-09 13:59:29 by Thomas Klausner | Files touched by this commit (2)
Log message:
py-html5lib: fix build with Python 3.14

Using a comment from an upstream bug report
   2025-10-09 09:58:14 by Thomas Klausner | Files touched by this commit (442)
Log message:
*: remove reference to (removed) Python 3.9
   2025-07-05 13:44:20 by Thomas Klausner | Files touched by this commit (116)
Log message:
*: some more recursive Python restrictions on Python 3.11+

Reported in SmartOS bulk build
   2025-04-13 13:57:06 by Thomas Klausner | Files touched by this commit (2)
Log message:
py-html5lib: fix for setuptools>=78
   2024-11-11 08:29:31 by Thomas Klausner | Files touched by this commit (862)
Log message:
py-*: remove unused tool dependency

py-setuptools includes the py-wheel functionality nowadays
   2024-04-30 11:01:14 by Thomas Klausner | Files touched by this commit (8)
Log message:
py-html5lib: convert to wheel.mk.

Add upstream patches to get test suite to progress a bit further.

Bump PKGREVISION.
   2024-01-14 10:45:00 by Thomas Klausner | Files touched by this commit (9) | Package updated
Log message:
*: update for py27-lxml
   2022-11-09 14:14:32 by Joerg Sonnenberger | Files touched by this commit (223)
Log message:
Reset MAINTAINER