Showing 1-20 of 24 projects
A Python library for generating videos from text and images using large language models and AI tools.
A Python library for generating high-quality videos from text and images using diffusion models.
This repository provides a curated collection of resources for Prompt Engineering with a focus on large language models like ChatGPT and GPT-3.
A lightweight video generation model that can create videos from images and text using AI.
Tune-A-Video is a one-shot text-to-video generation tool that fine-tunes image diffusion models.
A powerful text-to-video generation model that can turn prompts into high-quality videos, built for AI-driven developers.
A Python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Official PyTorch implementation of TokenFlow, a novel diffusion-based method for consistent video editing presented at ICLR 2024.
A Colab notebook for text-to-video synthesis, enabling vibe coders to experiment with AI-powered video generation.
An open-source benchmarking tool for evaluating video generation models.
Phantom is a subject-consistent video generation tool that aligns text and video via cross-modal alignment.
An AI-powered video super-resolution model that enhances real-world videos using text-to-video generation.
An implementation of a pose-guided text-to-video generation model using the LAION Pose Dataset.
MagicTime is a diffusion-based time-lapse video generation model for metamorphic video simulation.
An extension for Auto1111 Stable Diffusion webui that adds text-to-video diffusion models like ModelScope or VideoCrafter.
Generates high-fidelity foley audio with multimodal diffusion and representation alignment.
An AI-powered video agent framework for next-generation video interactions and workflows.
Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.
A text-to-video generation model that combines pixel-level and latent diffusion approaches.
Allegro is a powerful text-to-video model that generates high-quality videos from text input.
Get weekly updates on trending AI coding tools and projects.