Showing 501-520 of 1,335 projects
A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.
Top2Vec learns jointly embedded topic, document and word vectors for semantic search and topic modeling.
Custom inky color scheme for various terminals and editors
A Python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
An open-source text-to-speech software that enables high-quality, free-to-use voice generation.
A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.
Convert text to knowledge graph for Graph Augmented Generation
An open-source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them to Markdown.
Cross-platform text editor written in Free Pascal, suitable for general-purpose text editing tasks.
An open-source library for converting speech to text using OpenAI's Whisper AI model, with Docker support.
A toolkit for building rich text editors in React, with a focus on extensibility and flexibility.
A versatile WebUI for various AI-powered text-to-speech engines, enabling vibe coders to explore and utilize cutting-edge audio generation tools.
A secure, cross-platform library for storing text data in the Keychain on iOS, macOS, tvOS, and watchOS.
An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.
A console user interface library for Python, providing a set of tools for building text-based user interfaces.
A React Native library with custom text input animations and UI effects for iOS and Android.
An API for extracting, anonymizing, and parsing text from various document formats using state-of-the-art OCR and LLM models.
Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools
A collection of Python tutorials covering a wide range of topics from computer vision to network security.
An open-source project for voice cloning and speech-to-text in Chinese, built using Jupyter Notebooks.
Get weekly updates on trending AI coding tools and projects.