Showing 1-16 of 16 projects
A Python library for using and fine-tuning over 900 large language models and multimodal models for various AI tasks.
Official implementation of a controllable virtual try-on system using latent diffusion models.
A transformer-based time series forecasting library for vibe coders using AI tools.
DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.
A real-time, trimap-free portrait matting solution using AI and computer vision techniques.
This repository provides an official implementation of a paper on using transformers for time series forecasting.
A PyTorch-based framework for reproducible deep learning studies with 26 knowledge distillation methods.
A curated list of resources for deep reinforcement learning and the future of AI.
A graph convolutional network for text classification, useful for NLP tasks and AI-powered applications.
An implementation of a pose-guided text-to-video generation model using the LAION Pose Dataset.
A Python library for using diffusion models to segment and reconstruct medical images.
An interactive tool for generating customizable human images with flexible garment, pose, and scene control for virtual dressing.
A curated list of resources for relation extraction, a key NLP task, useful for vibe coders.
AnomalyGPT is a powerful tool for detecting industrial anomalies using large vision-language models.
Graph Transformer Architecture for developing graph neural networks with attention mechanisms.
A PyTorch-based framework for non-autoregressive text-to-speech synthesis, including PortaSpeech and DiffSpeech models.
Get weekly updates on trending AI coding tools and projects.