Python Natural Language Processing

Back

1. gensim

Topic Modeling for Humans.

2. langid.py

Stand-alone language identification system.

3. nltk

A leading platform for building Python programs to work with human language data.

4. pattern

A web mining module.

5. polyglot

Natural language pipeline supporting hundreds of languages.

6. pytext

A natural language modeling framework based on PyTorch.

7. PyTorch-NLP

A toolkit enabling rapid deep learning NLP prototyping for research.

8. spacy

A library for industrial-strength natural language processing in Python and Cython.

9. Stanza

The Stanford NLP Group's official Python library, supporting 60+ languages.

10. funNLP

A collection of tools and datasets for Chinese NLP.

11. jieba

The most popular Chinese text segmentation library.

12. pkuseg-python

A toolkit for Chinese word segmentation in various domains.

13. snownlp

A library for processing Chinese text.