Machine-Learning C++ Natural Language Processing

Back

1. BLLIP Parser

BLLIP Natural Language Parser (also known as the Charniak-Johnson parser).

2. colibri-core

C++ library, command line tools, and Python binding for extracting and working with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way.

3. frog

Memory-based NLP suite developed for Dutch: PoS tagger, lemmatiser, dependency parser, NER, shallow parser, morphological analyzer.

4. libfolia

C++ library for the [FoLiA format](https://proycon.github.io/folia/)

5. MeTA

[MeTA : ModErn Text Analysis](https://meta-toolkit.org/) is a C++ Data Sciences Toolkit that facilitates mining big text data.

6. MIT Information Extraction Toolkit

C, C++, and Python tools for named entity recognition and relation extraction

7. ucto

Unicode-aware regular-expression based tokenizer for various languages. Tool and C++ library. Supports FoLiA format.