Explore Projects

Discover 173 open source projects

Active filters (1):
Search: word×
Clear all

Showing 41-60 of 173 projects

Picovoice/porcupine

Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.

4.7K
Active
Python
AI Voice & Speech
CLI Tools
Python
#speech-recognition#voice-activation#wake-word-detection

ONLYOFFICE/DesktopEditors

An open-source office suite with tools to work with documents, spreadsheets, presentations, PDFs, and PDF forms on multiple platforms.

4.5K
Active
Component Libraries (React)
API Frameworks
React
#collaboration#office#pdf

first20hours/google-10000-english

This repo contains a list of the 10,000 most common English words, useful for NLP and language modeling tasks.

4.3K
Archived
Databases
NLP
#nlp#language-modeling#dataset

RICHQAQ/PasteMD

A tool that simplifies the process of pasting Markdown and AI responses (ChatGPT/DeepSeek) into common productivity apps like Word, WPS, and Excel.

4.3K
Active
Python
AI App Builders
File Storage
Python
#clipboard#markdown#chatgpt

baidu/lac

A Chinese NLP library for tokenization, part-of-speech tagging, named entity recognition, and lexical analysis.

4.0K
Archived
C++
NLP Frameworks
API Frameworks
Java
#chinese-nlp#tokenization#part-of-speech-tagging

kevin2li/PDF-Guru

A PDF toolbox for Anki that helps developers efficiently convert knowledge from various sources into flashcards.

4.0K
Experimental
Vue
LLM Wrappers & SDKs
Tutorials & Courses
Vue
#ai-flashcards#anki-flashcards#pdf-toolbox

jasondavies/d3-cloud

A JavaScript library for creating interactive word clouds using the D3.js visualization framework.

3.9K
Stable
JavaScript
Charts & Visualization
Utilities & Libraries
D3.js
#d3#visualization#wordcloud

open-xml-templating/docxtemplater

A JavaScript library for generating Microsoft Office documents (Word, PowerPoint, Excel) from templates.

3.5K
Stable
JavaScript
Component Libraries (React)
API Frameworks
React
#docx#office#templating

tangshimin/MuJing

A Kotlin multiplatform app that helps users learn English words in the context of movies, TV shows, and documents.

3.5K
Stable
Kotlin
Tutorials & Courses
Cross-Platform
Kotlin
#english-learning#kotlin-multiplatform#compose-desktop

ownthink/Jiagu

Jiagu is a deep learning-based NLP toolkit that provides features like Chinese word segmentation, NER, sentiment analysis, and more.

3.4K
Archived
Python
NLP Frameworks
API Frameworks
Python
#chinese-nlp#natural-language-processing#word-segmentation

wolfgarbe/SymSpell

SymSpell is a lightning-fast library for spelling correction and fuzzy text search using the Symmetric Delete algorithm.

3.4K
Active
C#
API Clients & Testing
API Frameworks
#spelling-correction#fuzzy-search#text-segmentation

Kitt-AI/snowboy

A C++ library for detecting custom wake words using a deep neural network, useful for AI voice assistants.

3.4K
Archived
C++
AI Voice & Speech
API Frameworks
#voice-recognition#wake-word-detection#neural-network

facebookresearch/MUSE

A library for training multilingual word embeddings, useful for NLP tasks across languages.

3.2K
Archived
Python
LLM Frameworks
Vector Databases
Python
#nlp#embeddings#multilingual

yanyiwu/nodejieba

A Node.js library for Chinese word segmentation, providing a simple and efficient way to tokenize Chinese text.

3.2K
Stable
JavaScript
API Frameworks
#chinese#nlp#text-processing

hankcs/pyhanlp

An open-source Chinese NLP library providing state-of-the-art tools for word segmentation, dependency parsing, named entity recognition, and more.

3.2K
Archived
Python
NLP
Python
#chinese-nlp#word-segmentation#dependency-parsing

yoksel/common-words

A collection of commonly used CSS class names to help developers build user interfaces more efficiently.

3.2K
Archived
Component Libraries (React)
CLI Tools
React
#css#class-names#ui-components

konsheng/Sensitive-lexicon

A continuously updated Chinese sensitive word library to help developers and content reviewers quickly identify and filter inappropriate text.

3.2K
Stable
API Frameworks
CLI Tools
#sensitive-text-filtering#content-moderation#chinese-language

cemoody/lda2vec

LDA2Vec is a Python library for topic modeling and word embeddings, useful for vibe coders building AI-powered applications.

3.2K
Archived
Python
LLM Frameworks
NLP
Python
#topic-modeling#word-embeddings#nlp

InkTimeRecord/TTime

A multi-purpose screenshot, OCR, and translation software tool for developers working with AI tools.

3.2K
Archived
TypeScript
AI App Builders
Component Libraries (React)
React
#screenshots#ocr#translation

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors for semantic search and topic modeling.

3.1K
Archived
Python
LLM Wrappers & SDKs
API Frameworks
Python
#topic-modeling#semantic-search#sentence-embedding
124...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.