Explore Projects

Discover 5 open source projects

Active filters (1):
Search: word-segmentationร—
Clear all

Showing 1-5 of 5 projects

google/sentencepiece

Unsupervised text tokenizer for neural network-based text generation and natural language processing.

11.7K
Active
C++
LLM Frameworks
#natural-language-processing#neural-machine-translation#word-segmentation

baidu/lac

A Chinese NLP library for tokenization, part-of-speech tagging, named entity recognition, and lexical analysis.

4.0K
Archived
C++
NLP Frameworks
API Frameworks
Java
#chinese-nlp#tokenization#part-of-speech-tagging

wolfgarbe/SymSpell

SymSpell is a lightning-fast library for spelling correction and fuzzy text search using the Symmetric Delete algorithm.

3.4K
Active
C#
API Clients & Testing
API Frameworks
#spelling-correction#fuzzy-search#text-segmentation

undertheseanlp/underthesea

Underthesea is a powerful Vietnamese NLP toolkit for developers working with natural language processing tasks.

1.7K
Active
Python
LLM Frameworks
API Frameworks
#vietnamese#nlp#natural-language-processing

PyThaiNLP/pythainlp

A Thai natural language processing library for Python that provides tools for text processing, word segmentation, and more.

1.1K
Active
Python
API Frameworks
ORMs & Query Builders
Python
#natural-language-processing#thai-language#text-processing

Stay in the loop

Get weekly updates on trending AI coding tools and projects.