Explore Projects

Discover 36 open source projects

Active filters (1):
Search: vitsร—
Clear all

Showing 21-36 of 36 projects

hila-chefer/Transformer-Explainability

Official PyTorch implementation for a novel method to visualize classifications by Transformer based networks.

2.0K
Archived
Jupyter Notebook
Computer Vision
Documentation
PyTorch
#attention-visualization#explainability#computer-vision

microsoft/Cream

A collection of Microsoft's work on NAS and Vision Transformer for efficient AI models.

1.8K
Archived
Python
Computer Vision
ML Ops
Python
#automl#efficiency#knowledge-distillation

czczup/ViT-Adapter

A PyTorch library that provides Vision Transformer (ViT) adapters for dense prediction tasks like object detection and semantic segmentation.

1.5K
Experimental
Python
Computer Vision
Backend Frameworks
PyTorch
#vision-transformer#object-detection#semantic-segmentation

Yangzhangcst/Transformer-in-Computer-Vision

A curated list of recent Transformer-based computer vision papers and implementations.

1.4K
Stable
Computer Vision
#computer-vision#deep-learning#transformer

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K
Active
Python
AI Voice & Speech
CLI Tools
Python
#gpt-sovits#text-to-speech#tts

innnky/emotional-vits

An emotion-controllable text-to-speech model for vibe coders, built on the VITS framework.

1.4K
Archived
Jupyter Notebook
AI Voice & Speech
Jupyter Notebook
#text-to-speech#emotion-control#ai-voice

Voine/ChatWaifu_Mobile

A mobile app that enables developers to create 2D anime-style AI companions using ChatGPT and Live2D.

1.4K
Archived
C++
Animation & Motion
LLM Frameworks
Compose
#chatgpt#live2d#lipsync

lightly-ai/lightly-train

All-in-one training for vision models with pretraining, fine-tuning, and distillation capabilities.

1.3K
Active
Python
Computer Vision
Fine-tuning
PyTorch
#computer-vision#deep-learning#pretrained-models

BR-IDL/PaddleViT

PaddleViT is a state-of-the-art Visual Transformer and MLP model library for the PaddlePaddle 2.0+ deep learning framework.

1.2K
Archived
Python
Computer Vision
ML Ops
PaddlePaddle
#computer-vision#deep-learning#transformer

PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

1.2K
Archived
Python
AI Voice & Speech
API Frameworks
#tts#bert#vits

yitu-opensource/T2T-ViT

A Tokens-to-Token Vision Transformer (T2T-ViT) model for training Vision Transformers from scratch on ImageNet.

1.2K
Archived
Jupyter Notebook
Computer Vision
Frontend Frameworks
Jupyter Notebook
#t2t-transformer#vision-transformer#vit

PriesiaMioShirakana/DragonianVoice

A C++ inference library for various SVC/TTS models, including DiffSinger, DiffSVC, HiFiGAN, and VITS.

1.1K
Experimental
C
AI Voice & Speech
API Frameworks
#speech-synthesis#text-to-speech#voice-conversion

baofff/U-ViT

A PyTorch implementation of the paper 'All are Worth Words: A ViT Backbone for Diffusion Models'.

1.1K
Archived
Jupyter Notebook
LLM Frameworks
Computer Vision
PyTorch
#diffusion-models#vision-transformers#computer-vision

THU-MIG/RepViT

A PyTorch library for training and deploying efficient computer vision models using ViT-based architectures.

1.1K
Archived
Jupyter Notebook
Computer Vision
API Frameworks
PyTorch
#computer-vision#efficient-models#ViT

jacobgil/vit-explain

A library for explaining the decisions made by Vision Transformers, a type of AI model used for computer vision tasks.

1.1K
Archived
Python
Explainable AI
ML Ops
PyTorch
#explainable-ai#vision-transformer#computer-vision

Artrajz/vits-simple-api

A simple API for the VITS text-to-speech model, with additional features for vibe coders.

1.0K
Stable
Python
AI Voice & Speech
API Clients & Testing
#tts#vits#text-to-speech
1

Stay in the loop

Get weekly updates on trending AI coding tools and projects.