Explore Projects

Discover 36 open source projects

Active filters (1):

Search: vits×

Clear all

Showing 21-36 of 36 projects

hila-chefer/Transformer-Explainability

Official PyTorch implementation for a novel method to visualize classifications by Transformer based networks.

2.0K

Archived

Jupyter Notebook

Computer Vision

Documentation

PyTorch

#attention-visualization#explainability#computer-vision

microsoft/Cream

A collection of Microsoft's work on NAS and Vision Transformer for efficient AI models.

1.8K

Archived

Python

Computer Vision

ML Ops

Python

#automl#efficiency#knowledge-distillation

czczup/ViT-Adapter

A PyTorch library that provides Vision Transformer (ViT) adapters for dense prediction tasks like object detection and semantic segmentation.

1.5K

Experimental

Python

Computer Vision

Backend Frameworks

PyTorch

#vision-transformer#object-detection#semantic-segmentation

Yangzhangcst/Transformer-in-Computer-Vision

A curated list of recent Transformer-based computer vision papers and implementations.

1.4K

Stable

Computer Vision

#computer-vision#deep-learning#transformer

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K

Active

Python

AI Voice & Speech

CLI Tools

Python

#gpt-sovits#text-to-speech#tts

innnky/emotional-vits

An emotion-controllable text-to-speech model for vibe coders, built on the VITS framework.

1.4K

Archived

Jupyter Notebook

AI Voice & Speech

Jupyter Notebook

#text-to-speech#emotion-control#ai-voice

Voine/ChatWaifu_Mobile

A mobile app that enables developers to create 2D anime-style AI companions using ChatGPT and Live2D.

1.4K

Archived

C++

Animation & Motion

LLM Frameworks

Compose

#chatgpt#live2d#lipsync

lightly-ai/lightly-train

All-in-one training for vision models with pretraining, fine-tuning, and distillation capabilities.

1.3K

Active

Python

Computer Vision

Fine-tuning

PyTorch

#computer-vision#deep-learning#pretrained-models

BR-IDL/PaddleViT

PaddleViT is a state-of-the-art Visual Transformer and MLP model library for the PaddlePaddle 2.0+ deep learning framework.

1.2K

Archived

Python

Computer Vision

ML Ops

PaddlePaddle

#computer-vision#deep-learning#transformer

PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

1.2K

Archived

Python

AI Voice & Speech

API Frameworks

#tts#bert#vits

yitu-opensource/T2T-ViT

A Tokens-to-Token Vision Transformer (T2T-ViT) model for training Vision Transformers from scratch on ImageNet.

1.2K

Archived

Jupyter Notebook

Computer Vision

Frontend Frameworks

Jupyter Notebook

#t2t-transformer#vision-transformer#vit

PriesiaMioShirakana/DragonianVoice

A C++ inference library for various SVC/TTS models, including DiffSinger, DiffSVC, HiFiGAN, and VITS.

1.1K

Experimental

AI Voice & Speech

API Frameworks

#speech-synthesis#text-to-speech#voice-conversion

baofff/U-ViT

A PyTorch implementation of the paper 'All are Worth Words: A ViT Backbone for Diffusion Models'.

1.1K

Archived

Jupyter Notebook

LLM Frameworks

Computer Vision

PyTorch

#diffusion-models#vision-transformers#computer-vision

THU-MIG/RepViT

A PyTorch library for training and deploying efficient computer vision models using ViT-based architectures.

1.1K

Archived

Jupyter Notebook

Computer Vision

API Frameworks

PyTorch

#computer-vision#efficient-models#ViT

jacobgil/vit-explain

A library for explaining the decisions made by Vision Transformers, a type of AI model used for computer vision tasks.

1.1K

Archived

Python

Explainable AI

ML Ops

PyTorch

#explainable-ai#vision-transformer#computer-vision

Artrajz/vits-simple-api

A simple API for the VITS text-to-speech model, with additional features for vibe coders.

1.0K

Stable

Python

AI Voice & Speech

API Clients & Testing

#tts#vits#text-to-speech

Stay in the loop

Get weekly updates on trending AI coding tools and projects.