Explore Projects

Discover 123 open source projects

Active filters (1):
Search: 2025×
Clear all

Showing 41-60 of 123 projects

VITA-MLLM/VITA

A powerful multimodal AI model for real-time vision and speech interaction, built for developers who work with AI tools.

2.5K
Experimental
Python
LLM Frameworks
Agents & Orchestration
Python
#large-language-model#multimodal#video-understanding

OmniSVG/OmniSVG

An end-to-end multimodal SVG generator that leverages pre-trained Vision-Language Models to create complex and detailed SVGs.

2.4K
Active
Python
LLM Frameworks
Animation & Motion
Python
#svg-generation#vision-language-models#multimodal-ai

wgsxm/PartCrafter

PartCrafter is a 3D mesh generation tool that uses compositional latent diffusion transformers to create structured 3D objects.

2.4K
Stable
Python
Computer Vision
3D-Object-Generation
Python
#3d-generation#computer-vision#deep-learning

vladkens/twscrape

Scrapes Twitter API search results and user profiles with authorization support.

2.3K
Experimental
Python
React
#authentication#scraping#authorization

vanshb03/New-Grad-2026

A GitHub repository with a list of 2025 & 2026 new grad full-time roles in SWE, Quant, and PM.

2.2K
Active
Tutorials & Courses
GitHub Profiles
#college#fulltime#jobs

yifan123/flow_grpo

An open-source implementation of a Flow Matching Model for training AI agents via online reinforcement learning.

2.0K
Stable
Python
LLM Frameworks
Agents & Orchestration
Python
#ai-agents#reinforcement-learning#flow-matching

jenndryden/Canadian-Tech-Internships-and-New-Grad-2025

A crowdsourced list of Canadian tech companies hiring interns and new grads for 2025

2.0K
Archived
Tutorials & Courses
Full-Stack Frameworks
#canadian-tech-companies#internships#new-grad

HCPLab-SYSU/Embodied_AI_Paper_List

A comprehensive paper list and resource repository for Embodied AI research and development.

1.9K
Stable
Agents & Orchestration
Computer Vision
#embodied-ai#robotics#computer-vision

Vchitect/Latte

A latent diffusion transformer for generating high-quality videos, suitable for vibe coders building AI-powered applications.

1.9K
Stable
Python
LLM Frameworks
AI Image & Video
Python
#video-generation#latent-diffusion#transformer

LeCAR-Lab/ASAP

A robotics research project focused on aligning simulation and real-world physics for learning agile humanoid whole-body skills.

1.9K
Active
Python
Agents & Orchestration
Reinforcement Learning
Python
#humanoid#reinforcement-learning#robotics

microsoft/Magma

Magma is a foundation model for building multimodal AI agents, enabling next-gen AI applications.

1.9K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#foundation-model#multimodal-ai#computer-vision

Yuanshi9815/OminiControl

OminiControl is a minimal and universal control system for diffusion transformer models like DALL-E and Stable Diffusion.

1.9K
Experimental
Python
LLM Frameworks
Inference
Python
#diffusion-models#computer-vision#image-generation

showlab/Show-o

A powerful multimodal transformer for combining language, vision, and other modalities in AI applications.

1.9K
Active
Python
LLM Frameworks
Multimodal
PyTorch
#multimodal-ai#language-models#vision-models

THUDM/LongWriter

LongWriter is a fine-tuned large language model (LLM) that can generate high-quality long-form text of 10,000+ words from long-form context.

1.8K
Experimental
Python
LLM Frameworks
Fine-tuning
Python
#llm#long-context#long-text

facebookresearch/MetaCLIP

A research project from Facebook that explores multimodal AI models for computer vision and language tasks.

1.8K
Stable
Python
LLM Frameworks
Computer Vision
PyTorch
#multimodal-ai#computer-vision#language-models

chillzhuang/blade-tool

SpringBlade is a commercial-grade microservices platform built with Spring Boot 3.5 and Spring Cloud 2025.

1.8K
Active
Java
Full-Stack Frameworks
API Frameworks
Spring Boot
#microservices#enterprise#multi-tenant

DepthAnything/Video-Depth-Anything

A PyTorch-based library for consistent depth estimation in super-long videos using transformers.

1.8K
Stable
Python
Computer Vision
API Frameworks
PyTorch
#depth-estimation#monocular-depth-estimation#transformer

liyupi/yu-ai-agent

A comprehensive project covering AI-powered coding tools, MCP frameworks, and backend-as-a-service for building AI-driven applications.

1.8K
Active
Java
AI Coding Agents
MCP Frameworks
Spring Boot
#ai-agent#mcp#spring-ai

xtekky/TikTok-ViewBot

A Python bot that automates increasing TikTok video views for content creators.

1.7K
Experimental
Python
Uncategorized
#tiktok#bot#views

showlab/ShowUI

Open-source end-to-end vision-language-action model for GUI agents and computer usage analysis.

1.7K
Active
Python
Agents & Orchestration
Component Libraries (React)
React
#agent#computer-use#gui-agent

Stay in the loop

Get weekly updates on trending AI coding tools and projects.