Showing 1-20 of 47 projects
A set of Jupyter Notebooks that combine Grounding DINO, Segment Anything, and Stable Diffusion for automatic detection, segmentation, and generation of anything in images.
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation
A Python library for downloading photos, videos, and metadata from Instagram.
LAVIS is a comprehensive library for multimodal deep learning, including image captioning, visual question answering, and more.
A simple iOS photo and video browser with grid view, captions and selections.
Automagically synchronize subtitles with video using audio alignment and speech detection.
A Python API to get YouTube video transcripts without an API key or headless browser
All-in-one WebUI for AI generative image and video creation, captioning and processing
PyTorch code for Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
A modular deep learning framework for multimodal AI research and applications from Facebook AI Research (FAIR).
Efficient image captioning code in Torch that runs on GPU for vibe coders working with AI tools.
A Keras model that generates HTML code from hand-drawn website mockups using an image captioning architecture.
A Python-based utility to download courses from Udemy for personal offline use across multiple platforms.
An open-source project that enables developers to build chatbots with video understanding using large language models.
InternGPT is an open-source demo platform that showcases various AI models, including DragGAN, ChatGPT, ImageBind, and multimodal chat.
A PyTorch tutorial for building an image captioning model using the Show, Attend, and Tell technique.
Translate subtitle files (.srt, .ass, .vtt) with customizable API keys for affordable pricing.
Official repository for the OFA (Unifying Architectures, Tasks, and Modalities) AI model, supporting various vision-language tasks.
A free API toolkit for businesses, creators, and developers to streamline advanced media processing, including video editing, image transformations, and Python code execution.
A curated list of awesome streaming video tools, frameworks, libraries, and learning resources.
Get weekly updates on trending AI coding tools and projects.