Go-lang Text Processing

Back

1. align

A general purpose application that aligns text.

2. allot

Placeholder and wildcard text parsing for CLI tools and bots.

3. bafi

Universal JSON, BSON, YAML, XML translator to ANY format using templates.

4. bbConvert

Converts bbCode to HTML that allows you to add support for custom bbCode tags.

5. blackfriday

Markdown processor in Go.

6. bluemonday

HTML Sanitizer.

7. codetree

Parses indented code (python, pixy, scarlet, etc.) and returns a tree structure.

8. colly

Fast and Elegant Scraping Framework for Gophers.

9. commonregex

A collection of common regular expressions for Go.

10. dataflowkit

Web scraping Framework to turn websites into structured data.

11. did

DID (Decentralized Identifiers) Parser and Stringer in Go.

12. doi

Document object identifier (doi) parser in Go.

13. editorconfig-core-go

Editorconfig file parser and manipulator for Go.

14. enca

Minimal cgo bindings for [libenca](https://cihar.com/software/enca/).

15. encdec

Package provides a generic interface to encoders and decodersa.

16. genex

Count and expand Regular Expressions into all matching Strings.

17. github_flavored_markdown

GitHub Flavored Markdown renderer (using blackfriday) with fenced code block highlighting, clickable header anchor links.

18. go-fixedwidth

Fixed-width text formatting (encoder/decoder with reflection).

19. go-humanize

Formatters for time, numbers, and memory size to human readable format.

20. go-nmea

NMEA parser library for the Go language.

21. go-output-format

Output go structures into multiple formats (YAML/JSON/etc) in your command line app.

22. go-runewidth

Functions to get fixed width of the character or string.

23. go-slugify

Make pretty slug with multiple languages support.

24. go-toml

Go library for the TOML format with query support and handy cli tools.

25. go-vcard

Parse and format vCard.

26. go-wildcard

Simple and lightweight wildcard pattern matching.

27. go-zero-width

Zero-width character detection and removal for Go.

28. gofeed

Parse RSS and Atom feeds in Go.

29. gographviz

Parses the Graphviz DOT language.

30. gommon/bytes

Format bytes to string.

31. gonameparts

Parses human names into individual name parts.

32. goq

Declarative unmarshaling of HTML using struct tags with jQuery syntax (uses GoQuery).

33. GoQuery

GoQuery brings a syntax and a set of features similar to jQuery to the Go language.

34. goregen

Library for generating random strings from regular expressions.

35. gospider

A simple golang spider/scraping framework,build a spider in 3 lines. migrated from [goribot](https://github.com/zhshch2002/goribot)

36. gotext

GNU gettext utilities for Go.

37. guesslanguage

Functions to determine the natural language of a unicode text.

38. html-to-markdown

Convert HTML to Markdown. Even works with entire websites and can be extended through rules.

39. htmlquery

An XPath query package for HTML, lets you extract data or evaluate from HTML documents by an XPath expression.

40. inject

Package inject provides a reflect based injector.

41. ltsv

High performance [LTSV (Labeled Tab Separated Value)](http://ltsv.org/) reader for Go.

42. mxj

Encode / decode XML as JSON or map[string]interface{}; extract values with dot-notation paths and wildcards. Replaces x2j and j2x packages.

43. normalize

Sanitize, normalize and compare fuzzy text.

44. omniparser

A versatile ETL library that parses text input (CSV/txt/JSON/XML/EDI/X12/EDIFACT/etc) in streaming fashion and transforms data into JSON output using data-driven schema.

45. pagser

Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawler.

46. podcast

iTunes Compliant and RSS 2.0 Podcast Generator in Golang

47. sdp

SDP: Session Description Protocol [[RFC 4566](https://tools.ietf.org/html/rfc4566)].

48. sh

Shell parser and formatter.

49. slug

URL-friendly slugify with multiple languages support.

50. Slugify

Go slugify application that handles string.

51. syndfeed

A syndication feed for Atom 1.0 and RSS 2.0.

52. toml

TOML configuration format (encoder/decoder with reflection).

53. gofuckyourself

A sanitization-based swear filter for Go.

54. gotabulate

Easily pretty-print your tabular data with Go.

55. kace

Common case conversions covering common initialisms.

56. parseargs-go

string argument parser that understands quotes and backslashes.

57. parth

URL path segmentation parsing.

58. radix

fast string sorting algorithm.

59. regroup

Match regex expression named groups into go struct using struct tags and automatic parsing.

60. Tagify

Produces a set of tags from given source.

61. textwrap

Implementation of `textwrap` module from Python.

62. TySug

Alternative suggestions with respect to keyboard layouts.

63. xj2go

Convert xml or json to go struct.

64. xurls

Extract urls from text.