./textproc/py-html2text, Convert HTML into easy-to-read plain ASCII text

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 2025.4.15, Package name: py312-html2text-2025.4.15, Maintainer: schmonz

html2text is a Python script that convers a page of HTML into clean,
easy-to-read plain ASCII text. Better yet, that ASCII also happens
to be valid Markdown (a text-to-HTML format).


Required to run:
[devel/py-setuptools] [lang/python310]

Master sites:

Filesize: 62.809 KB

Version history: (Expand)


CVS history: (Expand)


   2025-04-19 22:03:44 by Thomas Klausner | Files touched by this commit (1)
Log message:
py-html2text: add missing tool dependency
   2025-04-15 18:47:20 by Amitai Schleier | Files touched by this commit (3) | Package updated
Log message:
py-html2text: update to 2025.4.15. Changes:

* Support tri-backquote style code block and fix ordered list indent.
* Fix invalid character reference parsing.
   2025-04-14 14:06:25 by Adam Ciarcinski | Files touched by this commit (16) | Package updated
Log message:
Fix PLIST after py-setuptools update; bump depends and revision
   2024-11-11 08:29:31 by Thomas Klausner | Files touched by this commit (862)
Log message:
py-*: remove unused tool dependency

py-setuptools includes the py-wheel functionality nowadays
   2024-02-28 13:34:10 by Thomas Klausner | Files touched by this commit (1)
Log message:
py-html2text: also needs py-wheel to build
   2024-02-28 12:56:39 by Amitai Schleier | Files touched by this commit (1)
Log message:
py-html2text: needs py-setuptools to build.
   2024-02-27 20:34:05 by Amitai Schleier | Files touched by this commit (2) | Package updated
Log message:
py-html2text: update to 2024.2.26. Changes:

- Fix IndexError on empty strong mark on version.
   2024-02-25 23:31:25 by Amitai Schleier | Files touched by this commit (3) | Package updated
Log message:
py-html2text: update to 2024.2.25. Changes:

* Fix #332: Insert at most one space for multiple emphasis
* Feature #318: Make padded tables more similar to pandoc's pipe_tables.
* Add support for Python 3.9.
* Fix extra line breaks inside html link text (between '[' and ']')
* Fix #344: indent ``<ul>`` inside ``<ol>`` three spaces instead of two
  to comply with CommonMark, GFM, etc.
* Fix #324: unnecessary spaces around ``<b>``, ``<em>``, and \ 
``strike`` tags.
* Don't wrap tables by default and add a ``--wrap-tables`` config option.
* Feature #198: Ignore ``<p>`` tags inside table rows.
* Don't wrap tables by default and add a ``--wrap-tables`` config option
* Remove support for Python ≤ 3.5. Now requires Python 3.6+.
* Support for Python 3.10+.
* Fix #320 padding empty tables and tables with no ``</tr>`` tags.
* Add ``ignore_mailto_links`` config option to ignore ``mailto:``
  style links.
* Feature #407: Support the superscript and subscript tags.
* Fix #373: ``\n`` inside text of a Markdown link.
* Feature #406: Improve support for null atttibute values.