Explore Projects

Discover 13 open source projects

Active filters (1):
Search: text-miningร—
Clear all

Showing 1-13 of 13 projects

keon/awesome-nlp

A curated list of resources dedicated to Natural Language Processing (NLP)

18.2K
Active
LLM Frameworks
#natural-language-processing#text-mining#deep-learning

adbar/trafilatura

Gathers text and metadata from the web using crawling, scraping, and extraction techniques.

5.4K
Stable
Python
React
#web-scraping#text-extraction#metadata-gathering

deanmalmgren/textract

A Python library that provides a simple and unified interface for extracting text from any document format.

4.5K
Archived
HTML
ETL & Pipelines
CLI Tools
Python
#text-extraction#pdf#docx

jbesomi/texthero

A Python library for text preprocessing, representation, and visualization in machine learning and NLP projects.

2.9K
Archived
Python
Text Preprocessing
Text Representation
Python
#nlp#text-mining#word-embeddings

JasonKessler/scattertext

A Python library for creating beautiful visualizations of language differences across document types.

2.3K
Experimental
Python
Data Visualization
Natural Language Processing
#text-visualization#natural-language-processing#exploratory-data-analysis

konlpy/konlpy

A Python package for Korean natural language processing, useful for vibe coders building AI-powered apps.

1.5K
Archived
Python
Natural Language Processing
CLI Tools
#korean#korean-nlp#morphology

juliasilge/tidy-text-mining

A manuscript for a book on tidy text mining with R, a popular data analysis language.

1.4K
Experimental
TeX
Data & Databases
Books & Guides
#text-mining#r#tidyverse

shangjingbo1226/AutoPhrase

An open-source library for automatic high-quality phrase mining from large text corpora.

1.2K
Archived
C++
Text Mining
#text-mining#phrase-extraction#lexicon-generation

juliasilge/tidytext

A library for text mining and natural language processing using tidy data principles in R.

1.2K
Experimental
R
Data Processing
CLI Tools
R
#text-mining#natural-language-processing#tidy-data

kavgan/nlp-in-practice

Starter code for solving real-world text data problems using NLP techniques like Gensim Word2Vec and text classification.

1.2K
Archived
Jupyter Notebook
LLM Frameworks
API Frameworks
Jupyter Notebook
#gensim#machine-learning#natural-language-processing

opensemanticsearch/open-semantic-search

Open-source search and text analytics platform for exploring large document collections with semantic search and NLP

1.1K
Experimental
Shell
Search-as-a-Service
Search
#search#semantic-search#text-analytics

csurfer/rake-nltk

A Python library that provides a fast and efficient keyword extraction algorithm using NLTK.

1.1K
Archived
Python
Text Mining
CLI Tools
Python
#keyword-extraction#text-processing#nlp

nlptown/nlp-notebooks

A collection of Jupyter Notebooks for learning and experimenting with Natural Language Processing (NLP) techniques.

1.0K
Archived
Jupyter Notebook
LLM Frameworks
Tutorials & Courses
#natural-language-processing#deep-learning#text-mining

Stay in the loop

Get weekly updates on trending AI coding tools and projects.