Explore Projects

Discover 123 open source projects

Active filters (1):
Search: 2025ร—
Clear all

Showing 61-80 of 123 projects

NoriSte/ui-testing-best-practices

A comprehensive guide to best practices for UI testing, covering tools like Cypress and Puppeteer.

1.7K
Stable
Frontend Frameworks
Testing
React
#ui-testing#best-practices#cypress

bytedance/DreamO

A unified framework for image customization using AI tools, targeting vibe-focused developers.

1.7K
Stable
Python
AI Design-to-Code
Computer Vision
React
#image-editing#design-tools#ai-powered

franciszzj/Leffa

This is a Python library for learning flow fields in attention for controllable person image generation.

1.6K
Stable
Python
Computer Vision
Inference
React
#image-generation#computer-vision#attention-mechanisms

Picsart-AI-Research/StreamingT2V

Generates long videos from text using a consistent and dynamic approach.

1.6K
Experimental
Python
React
#streaming#long-video-generation#AI-powered

zanfranceschi/rinha-de-backend-2025

Rinha de Backend is a Lua-based backend framework and development platform for building AI-powered applications.

1.6K
Stable
Lua
LLM Frameworks
API Frameworks
#lua#backend-framework#ai-tools

Fantasy-AMAP/fantasy-talking

A diffusion-based model for generating realistic talking portrait videos.

1.6K
Active
Python
Computer Vision
AI Image & Video
Python
#diffusion#diffusion-models#diffusion-transformer

Zheng-Chong/CatVTON

A lightweight and efficient virtual try-on diffusion model for fashion applications.

1.6K
Stable
Python
Computer Vision
Animation & Motion
PyTorch
#diffusion-models#fashion#try-on

ZJU4HealthCare/HealthGPT

Official repository for a paper on a large vision-language model for medical applications

1.6K
Stable
Python
LLM Frameworks
Computer Vision
Python
#medical-ai#vision-language-model#icml

FoundationVision/Infinity

A high-performance, autoregressive model for generating high-resolution images from text prompts.

1.5K
Stable
Python
LLM Frameworks
Text-to-Image
Python
#autoregressive-model#generative-ai#text-to-image

opendatalab/OmniDocBench

A comprehensive benchmark for document parsing and evaluation, designed for CVPR 2025.

1.5K
Stable
Python
Computer Vision
Datasets
#computer-vision#document-parsing#benchmark

ShengranHu/ADAS

A Python library for building AI-powered agent-based systems, with a focus on automated design and optimization.

1.5K
Archived
Python
Agents & Orchestration
LLM Frameworks
Python
#AI-powered#agent-based#automated-design

Drexubery/ViewCrafter

A Python-based video diffusion model for high-fidelity novel view synthesis

1.5K
Stable
Python
Computer Vision
ML Ops
Python
#computer-vision#video-synthesis#novel-view-synthesis

facebookresearch/fast3r

Fast3R is a Python library for 3D reconstruction from a large number of images in a single forward pass.

1.5K
Experimental
Python
Computer Vision
CLI Tools
#3d-reconstruction#computer-vision#image-processing

Tencent/DepthCrafter

DepthCrafter is a Python library that generates consistent long depth sequences for open-world videos using AI.

1.5K
Stable
Python
Computer Vision
#computer-vision#depth-estimation#video-processing

pq-yang/MatAnyone

This is an AI-powered video matting library that provides stable and consistent video segmentation.

1.5K
Stable
Python
Computer Vision
#video-matting#computer-vision#ai-model

TIGER-AI-Lab/TheoremExplainAgent

A Python library for creating video-based multimodal explanations for LLM theorem understanding.

1.5K
Experimental
Python
LLM Frameworks
Agents & Orchestration
#llm#explanation#video

NJU-PCALab/STAR

An AI-powered video super-resolution model that enhances real-world videos using text-to-video generation.

1.5K
Experimental
Python
Computer Vision
Animation & Motion
Python
#video-enhancement#text-to-video#super-resolution

NVlabs/describe-anything

An implementation for detailed localized image and video captioning using large multimodal models.

1.5K
Experimental
Python
Computer Vision
LLM Frameworks
Python
#describe-anything#detailed-localized-captioning#large-multimodal-models

Intellindust-AI-Lab/DEIM

DEIM: A real-time object detection system using DETR with improved matching for fast convergence.

1.4K
Stable
Python
React
#real-time#object-detection#DETR

apple/ml-mobileclip

This repository contains the official implementation of the MobileCLIP and MobileCLIP2 research papers, focused on AI-powered mobile app development.

1.4K
Stable
Python
Computer Vision
LLM Frameworks
Python
#computer-vision#llm#ai-powered

Stay in the loop

Get weekly updates on trending AI coding tools and projects.