Explore Projects

Discover 1,335 open source projects

Active filters (1):

Search: text×

Clear all

Showing 501-520 of 1,335 projects

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K

Archived

AI Voice & Speech

#speech-recognition#speech-synthesis#language-modeling

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors for semantic search and topic modeling.

3.1K

Archived

Python

LLM Wrappers & SDKs

API Frameworks

Python

#topic-modeling#semantic-search#sentence-embedding

kepano/flexoki

Custom inky color scheme for various terminals and editors

3.1K

Active

CSS

Component Libraries (React)

Tailwind

#color-scheme#terminal-colors#inky

SamurAIGPT/AI-Youtube-Shorts-Generator

A Python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

3.1K

Stable

Python

Video Diffusion

Text-to-Video Generation

#ai-video-generator#video-generation#video-editor

VOICEVOX/voicevox

An open-source text-to-speech software that enables high-quality, free-to-use voice generation.

3.0K

Active

TypeScript

AI Voice & Speech

TypeScript

#text-to-speech#voice-generation#open-source

IAHispano/Applio

A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.

3.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-to-speech#text-to-speech#voice-conversion

rahulnyk/knowledge_graph

Convert text to knowledge graph for Graph Augmented Generation

3.0K

Experimental

Jupyter Notebook

AI Editors/Agents/Copilot

#Knowledge Graph#Graph Augmented Generation#QnA

breezedeus/Pix2Text

An open-source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them to Markdown.

3.0K

Experimental

Jupyter Notebook

Computer Vision

File Storage

PyTorch

#ocr#latex#math-formula-recognition

Alexey-T/CudaText

Cross-platform text editor written in Free Pascal, suitable for general-purpose text editing tasks.

3.0K

Active

Python

IDE Extensions

Frontend Frameworks

#cross-platform#text-editor#pascal

speaches-ai/speaches

An open-source library for converting speech to text using OpenAI's Whisper AI model, with Docker support.

3.0K

Active

Python

AI Voice & Speech

API Frameworks

Docker

#speech-to-text#whisper-ai#openai-api

remirror/remirror

A toolkit for building rich text editors in React, with a focus on extensibility and flexibility.

3.0K

Active

TypeScript

Component Libraries (React)

API Frameworks

React

#rich-text-editor#prosemirror#react-component

rsxdalv/TTS-WebUI

A versatile WebUI for various AI-powered text-to-speech engines, enabling vibe coders to explore and utilize cutting-edge audio generation tools.

3.0K

Active

TypeScript

AI Voice & Speech

Component Libraries (React)

React

#text-to-speech#audio-generation#ai-tools

evgenyneu/keychain-swift

A secure, cross-platform library for storing text data in the Keychain on iOS, macOS, tvOS, and watchOS.

3.0K

Archived

Swift

General Utilities

iOS

Swift

#keychain#security#cross-platform

enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.

3.0K

Archived

Python

LLM Frameworks

AI Voice & Speech

PyTorch

#text-to-speech#tts#audio-lm

urwid/urwid

A console user interface library for Python, providing a set of tools for building text-based user interfaces.

3.0K

Active

Python

Component Libraries (React)

CLI Tools

React

#ui#console#text-based

halilb/react-native-textinput-effects

A React Native library with custom text input animations and UI effects for iOS and Android.

3.0K

Archived

JavaScript

Animation & Motion

Component Libraries (React)

React

#react-native#text-input#animations

CatchTheTornado/text-extract-api

An API for extracting, anonymizing, and parsing text from various document formats using state-of-the-art OCR and LLM models.

3.0K

Stable

Python

LLM Wrappers & SDKs

API Clients & Testing

Python

#anonymization#ocr#pdf

HeyWillow/willow

Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools

3.0K

Experimental

AI Voice & Speech

AI App Builders

ESP-IDF

#alexa#google-home#speech-recognition

x4nth055/pythoncode-tutorials

A collection of Python tutorials covering a wide range of topics from computer vision to network security.

3.0K

Stable

Jupyter Notebook

Tutorials & Courses

ETL & Pipelines

#python#tutorials#machine-learning

KevinWang676/Bark-Voice-Cloning

An open-source project for voice cloning and speech-to-text in Chinese, built using Jupyter Notebooks.

3.0K

Stable

Jupyter Notebook

AI Voice & Speech

#speech-to-text#voice-cloning#chinese-speech

1...2527...67

Stay in the loop

Get weekly updates on trending AI coding tools and projects.