Showing 81-100 of 368 projects
An open-source machine learning course with Python, covering algorithms, AI, and more.
Fast and accurate automatic speech recognition (ASR) for edge devices
Officially maintained deep learning models by PaddlePaddle, covering computer vision, NLP, speech, and more.
A comprehensive reading list for research topics in multimodal machine learning.
Speech recognition library for your web application, enabling voice interactions.
A high-performance Chinese text segmentation library with support for named entity recognition and part-of-speech tagging.
Facebook AI Research's end-to-end speech recognition toolkit written in C++.
A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.
PaddleX is an all-in-one development tool based on PaddlePaddle, providing AI pipelines for computer vision, NLP, and more.
An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.
Orpheus-TTS is a high-quality, real-time text-to-speech library for creating human-sounding AI voices.
Pre-trained text-to-speech models for various languages, made simple to use.
An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.
Inference and training library for high-quality text-to-speech (TTS) models.
This repository provides a curated collection of resources for Prompt Engineering with a focus on large language models like ChatGPT and GPT-3.
A TensorFlow implementation of DeepMind's WaveNet paper for generating high-quality speech audio.
Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper
Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities
A PyTorch implementation of Tacotron 2, a state-of-the-art text-to-speech model, with faster-than-realtime inference.
Lucida is a speech and vision based intelligent personal assistant built with Java.
Get weekly updates on trending AI coding tools and projects.