Explore Projects

Discover 25 open source projects

Active filters (1):
Search: zero-shotร—
Clear all

Showing 1-20 of 25 projects

openai/CLIP

CLIP is a neural network for zero-shot image-text matching and understanding

32.7K
Archived
Jupyter Notebook
Computer Vision
PyTorch
#image-text-matching#zero-shot-learning#computer-vision

index-tts/index-tts

An efficient zero-shot text-to-speech system with fine-grained control over the generated voice.

19.1K
Stable
Python
AI Voice & Speech
Python
#text-to-speech#zero-shot#voice-cloning

mlfoundations/open_clip

Open source implementation of CLIP, a contrastive learning model for multi-modal tasks like zero-shot classification.

13.5K
Stable
Python
Computer Vision
PyTorch
#computer-vision#contrastive-learning#pretrained-model

instantX-research/InstantID

Zero-shot identity-preserving text generation in seconds for vibe coders.

11.9K
Archived
Python
LLM Frameworks
Python
#text-generation#identity-preservation#zero-shot

jasonppy/VoiceCraft

A Jupyter Notebook project for zero-shot speech editing and text-to-speech using AI models.

8.5K
Experimental
Jupyter Notebook
AI Voice & Speech
Notebooks
#zero-shot#speech-editing#text-to-speech

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K
Archived
Python
LLM Wrappers & SDKs
AI Voice & Speech
Python
#emotional-speech#text-to-speech#transformer-architecture

yangchris11/samurai

A Python library for adapting the Segment Anything Model for zero-shot visual tracking with motion-aware memory.

7.0K
Experimental
Python
Computer Vision
Backend Frameworks
Python
#computer-vision#tracking#segmentation

isl-org/MiDaS

Code for robust monocular depth estimation, a key computer vision task for AI-powered apps.

5.3K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#depth-estimation#computer-vision#pytorch

Picsart-AI-Research/Text2Video-Zero

A powerful text-to-video generation model that can turn prompts into high-quality videos, built for AI-driven developers.

4.2K
Archived
Python
Computer Vision
AI Image & Video
Python
#text-to-video#video-generation#diffusion-models

Plachtaa/seed-vc

Zero-shot voice conversion and singing voice conversion library with real-time support for vibe coders.

3.6K
Experimental
Python
AI Voice & Speech
Python
#voice-conversion#singing-voice-conversion#zero-shot

NVlabs/FoundationStereo

FoundationStereo is a CVPR 2025 Best Paper Nomination project for zero-shot stereo matching using AI.

2.5K
Stable
Python
Computer Vision
#computer-vision#stereo-matching#zero-shot-learning

protectai/vulnhuntr

Zero-shot vulnerability discovery using large language models (LLMs) for security researchers.

2.5K
Experimental
Python
LLM Frameworks
Security Research
Python
#security#vulnerability-detection#static-analysis

lifeiteng/vall-e

A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.

2.2K
Stable
Python
LLM Frameworks
AI Voice & Speech
PyTorch
#chatgpt#in-context-learning#large-language-models

roboterax/humanoid-gym

Reinforcement learning framework for training humanoid robots with zero-shot sim-to-real transfer.

1.9K
Archived
Python
Agents & Orchestration
Computer Vision
Python
#reinforcement-learning#humanoid-robot#control-systems

roboflow/awesome-openai-vision-api-experiments

A must-have resource for experimenting with and building on the OpenAI vision API.

1.7K
Archived
Python
Computer Vision
API Clients & Testing
Python
#chatgpt#clip#computer-vision

om-ai-lab/OmDet

Real-time and accurate open-vocabulary end-to-end object detection library for computer vision applications.

1.4K
Archived
Python
Computer Vision
API Frameworks
Python
#computer-vision#object-detection#real-time

wyhuai/DDNM

Zero-shot image restoration using a denoising diffusion model that can remove various types of noise and artifacts.

1.3K
Archived
Python
Computer Vision
Diffusion Models
PyTorch
#image-restoration#zero-shot#diffusion-models

ali-vilab/MimicBrush

A zero-shot image editing tool that allows users to transfer textures and styles from reference images.

1.3K
Archived
Python
Computer Vision
Image & Video
Python
#image-editing#texture-transfer#zero-shot

OpenMOSS/MOSS-TTSD

An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.

1.2K
Stable
Python
AI Voice & Speech
API Frameworks
#speech-dialogue-generation#multi-speaker-voice-cloning#long-form-speech-generation

ChenyangQiQi/FateZero

FateZero is a video editing tool that uses AI-powered text-driven video editing and style transfer.

1.2K
Archived
Jupyter Notebook
Computer Vision
AI Image & Video
Jupyter Notebook
#video-editing#text-driven-editing#style-transfer
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.