Explore Projects

Discover 14 open source projects

Active filters (1):
Search: text-processingร—
Clear all

Showing 1-14 of 14 projects

learnbyexample/Command-line-text-processing

A command-line tool for text processing tasks like searching, replacing, sorting, and beautifying text.

10.2K
Archived
Shell
CLI Tools
#text-processing#awk#grep

pymupdf/PyMuPDF

A high-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other documents.

9.2K
Active
Python
Document Processing
#pdf#data-extraction#text-processing

google/diff-match-patch

A high-performance library for manipulating plain text differences, matches, and patches across multiple languages.

8.1K
Archived
Python
API Clients & Testing
Text Processing
Python
#diff#match#patch

chmln/sd

A Rust-based intuitive find & replace CLI tool, an alternative to sed for text processing.

7.0K
Stable
Rust
CLI Tools
API Frameworks
#cli#command-line#regex

fastnlp/fastNLP

An extensible NLP framework for building powerful text processing and analysis applications in Python.

3.1K
Archived
Python
NLP Frameworks
API Frameworks
Python
#nlp#text-processing#deep-learning

pyparsing/pyparsing

A Python library for creating Parsing Expression Grammar (PEG) parsers for text processing.

2.5K
Active
Python
CLI Tools
API Frameworks
Python
#parsing#parsing-library#peg-parsers

kk7nc/Text_Classification

A comprehensive survey of text classification algorithms and techniques in Python.

1.8K
Experimental
Python
Text Processing
Learning & Education
Python
#text-classification#nlp#machine-learning

tjmlabs/ColiVara

ColiVara is a high-performance document retrieval system that uses vision models instead of text processing.

1.5K
Experimental
Python
Computer Vision
Search-as-a-Service
Python
#document-retrieval#vision-models#text-extraction

roshan-research/hazm

A Persian natural language processing toolkit for tasks like tokenization, lemmatization, and part-of-speech tagging.

1.4K
Stable
Python
NLP
Backend Frameworks
Python
#natural-language-processing#persian#tokenizer

helix-editor/nucleo

A fast and convenient fuzzy matcher library for Rust that can be used in various text processing applications.

1.3K
Active
Rust
API Frameworks
CLI Tools
#fuzzy-matching#fuzzy-search#performance

pemistahl/lingua-go

A highly accurate natural language detection library for Go, suitable for short text and mixed-language text.

1.3K
Experimental
Go
Backend Frameworks
CLI Tools
#language-detection#natural-language-processing#text-analysis

BurntSushi/aho-corasick

A high-performance Aho-Corasick implementation in Rust for fast substring matching and text processing.

1.2K
Stable
Rust
CLI Tools
API Frameworks
#aho-corasick#substring-matching#text-processing

birchb1024/frangipanni

A Go library to convert lines of text into a tree structure for various text processing tasks.

1.2K
Stable
Go
API Frameworks
CLI Tools
#text-processing#tree-structure#go

PyThaiNLP/pythainlp

A Thai natural language processing library for Python that provides tools for text processing, word segmentation, and more.

1.1K
Active
Python
API Frameworks
ORMs & Query Builders
Python
#natural-language-processing#thai-language#text-processing

Stay in the loop

Get weekly updates on trending AI coding tools and projects.