Go-lang Natural Language Processing

Back

1. address

Handles address representation, validation and formatting.

2. detectlanguage

Language Detection API Go Client. Supports batch requests, short phrase or single word language detection.

3. getlang

Fast natural language detection package.

4. go-i18n

Package and an accompanying tool to work with localized text.

5. go-localize

used for translating locale strings.

6. go-mystem

russian morphology analyzer.

7. go-nlp

Utilities for working with discrete probability distributions and other tools useful for doing NLP work.

8. go-pinyin

CN Hanzi to Hanyu Pinyin converter.

9. go-stem

Implementation of the porter stemming algorithm.

10. go-unidecode

ASCII transliterations of Unicode text.

11. go2vec

Reader and utility functions for word2vec embeddings.

12. gojieba

This is a Go implementation of [jieba](https://github.com/fxsjy/jieba) which a Chinese word splitting algorithm.

13. golibstemmer

Go bindings for the snowball libstemmer library including porter 2.

14. gosentiwordnet

Sentiment analyzer using sentiwordnet lexicon in Go.

15. gotokenizer

A tokenizer based on the dictionary and Bigram language models for Golang. (Now only support chinese segmentation)

16. gounidecode

Unicode transliterator (also known as unidecode) for Go.

17. govader

Go implementation of [VADER Sentiment Analysis](https://github.com/cjhutto/vaderSentiment).

18. gse

Go efficient text segmentation; support english, chinese, japanese and other.

19. icu

Cgo binding for icu4c C library detection and conversion functions. Guaranteed compatibility with version 50.1.

20. iuliia-go

Transliterate Cyrillic → Latin in every possible way.

21. kagome

JP morphological analyzer written in pure Go.

22. libtextcat

Cgo binding for libtextcat C library. Guaranteed compatibility with version 2.2.

23. MMSEGO

This is a GO implementation of [MMSEG](http://technology.chtsai.org/mmseg/) which a Chinese word splitting algorithm.

24. nlp

Go Natural Language Processing library supporting LSA (Latent Semantic Analysis).

25. paicehusk

Golang implementation of the Paice/Husk Stemming Algorithm.

26. petrovich

Petrovich is the library which inflects Russian names to given grammatical case.

27. porter

This is a fairly straightforward port of Martin Porter's C implementation of the Porter stemming algorithm.

28. porter2

Really fast Porter 2 stemmer.

29. prose

Library for text processing that supports tokenization, part-of-speech tagging, named-entity extraction, and more. English only.

30. RAKE.go

Go port of the Rapid Automatic Keyword Extraction Algorithm (RAKE).

31. segment

Go library for performing Unicode Text Segmentation as described in [Unicode Standard Annex #29](https://www.unicode.org/reports/tr29/)

32. sentences

Sentence tokenizer: converts text into a list of sentences.

33. shamoji

The shamoji is word filtering package written in Go.

34. snowball

Snowball stemmer port (cgo wrapper) for Go. Provides word stem extraction functionality [Snowball native](http://snowball.tartarus.org/).

35. spaGO

Self-contained Machine Learning and Natural Language Processing library in Go.

36. stemmer

Stemmer packages for Go programming language. Includes English and German stemmers.

37. t

Another i18n pkg for golang, which follows GNU gettext style and supports .po/.mo files: `t.T (gettext)`, `t.N (ngettext)`, etc. And it contains a cmd tool [xtemplate](https://github.com/youthlin/t/blob/main/cmd/xtemplate), which can extract messages as a pot file from text/html template.

38. textcat

Go package for n-gram based text categorization, with support for utf-8 and raw text.

39. transliterator

Provides one-way string transliteration with supporting of language-specific transliteration rules.

40. whatlanggo

Natural language detection package for Go. Supports 84 languages and 24 scripts (writing systems e.g. Latin, Cyrillic, etc).

41. when

Natural EN and RU language date/time parser with pluggable rules.