Showing 1-20 of 89 projects
Few-shot voice cloning and TTS with 1 min training data
Instant voice cloning model with tone color cloning and multi-lingual support
CLIP is a neural network for zero-shot image-text matching and understanding
An efficient zero-shot text-to-speech system with fine-grained control over the generated voice.
Terminalizer is a tool for recording your terminal and generating animated GIFs or web players to share your terminal sessions.
GPT-3 is a large language model developed by OpenAI, showcasing few-shot learning capabilities.
A comprehensive repository covering papers, codes, datasets, tutorials, and applications for transfer learning, domain adaptation, and more.
Open source implementation of CLIP, a contrastive learning model for multi-modal tasks like zero-shot classification.
Zero-shot identity-preserving text generation in seconds for vibe coders.
A framework for few-shot evaluation of language models, useful for vibe coders working with AI tools.
A collection of tutorials and notebooks on state-of-the-art computer vision models and techniques for developers.
A Jupyter Notebook project for zero-shot speech editing and text-to-speech using AI models.
An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.
A Python library for adapting the Segment Anything Model for zero-shot visual tracking with motion-aware memory.
This repository provides a curated collection of resources for Prompt Engineering with a focus on large language models like ChatGPT and GPT-3.
A curated list of resources related to domain adaptation, a technique used to improve AI model performance on new datasets.
Code for robust monocular depth estimation, a key computer vision task for AI-powered apps.
A PyTorch-based implementation of the Single Shot MultiBox Detector for object detection in computer vision tasks.
This project is a penetration testing tool for hacking cameras and GPS locations of target devices.
Tune-A-Video is a one-shot text-to-video generation tool that fine-tunes image diffusion models.
Get weekly updates on trending AI coding tools and projects.