Python HTML Manipulation

Back

1. BeautifulSoup

Providing Pythonic idioms for iterating, searching, and modifying HTML or XML.

2. bleach

A whitelist-based HTML sanitization and text linkification library.

3. cssutils

A CSS library for Python.

4. html5lib

A standards-compliant library for parsing and serializing HTML documents and fragments.

5. lxml

A very fast, easy-to-use and versatile library for handling HTML and XML.

6. MarkupSafe

Implements a XML/HTML/XHTML Markup safe string for Python.

7. pyquery

A jQuery-like library for parsing HTML.

8. untangle

Converts XML documents to Python objects for easy access.

9. WeasyPrint

A visual rendering engine for HTML and CSS that can export to PDF.

10. xmldataset

Simple XML Parsing.

11. xmltodict

Working with XML feel like you are working with JSON.