Python Text Processing

Back

1. chardet

Python 2/3 compatible character encoding detector.

2. difflib

(Python standard library) Helpers for computing deltas.

3. ftfy

Makes Unicode text less broken and more consistent automagically.

4. fuzzywuzzy

Fuzzy String Matching.

5. Levenshtein

Fast computation of Levenshtein distance and string similarity.

6. pangu.py

Paranoid text spacing.

7. pyfiglet

An implementation of figlet written in Python.

8. pypinyin

Convert Chinese hanzi (漢字) to pinyin (拼音).

9. textdistance

Compute distance between sequences with 30+ algorithms.

10. unidecode

ASCII transliterations of Unicode text.

11. awesome-slugify

A Python slugify library that can preserve unicode.

12. python-slugify

A Python slugify library that translates unicode to ASCII.

13. unicode-slugify

A slugifier that generates unicode slugs with Django as a dependency.

14. hashids

Implementation of [hashids](http://hashids.org) in Python.

15. shortuuid

A generator library for concise, unambiguous and URL-safe UUIDs.

16. ply

Implementation of lex and yacc parsing tools for Python.

17. pygments

A generic syntax highlighter.

18. pyparsing

A general purpose framework for generating parsers.

19. python-nameparser

Parsing human names into their individual components.

20. python-phonenumbers

Parsing, formatting, storing and validating international phone numbers.

21. python-user-agents

Browser user agent parser.

22. sqlparse

A non-validating SQL parser.