Explore Projects

Discover 79 open source projects

Active filters (1):
Search: neuripsร—
Clear all

Showing 1-20 of 79 projects

haotian-liu/LLaVA

LLaVA is a visual instruction tuning framework for large language and vision models, enabling GPT-4 level capabilities.

24.5K
Archived
Python
Computer Vision
LLM Frameworks
PyTorch
#llava#gpt-4#instruction-tuning

SWE-agent/SWE-agent

SWE-agent is an AI-powered tool that automatically fixes GitHub issues using large language models.

18.6K
Active
Python
AI Code Agents
Python
#ai#github#issue-fixing

sczhou/CodeFormer

A PyTorch-based library for robust and high-quality blind face restoration using a codebook lookup transformer.

17.8K
Stable
Python
Computer Vision
PyTorch
#face-enhancement#face-restoration#super-resolution

THU-MIG/yolov10

An open-source real-time object detection library powered by the YOLOv10 neural network model.

11.2K
Experimental
Python
Computer Vision
Python
#object-detection#real-time#neural-network

FoundationVision/VAR

An ultra-simple, state-of-the-art codebase for autoregressive image generation using advanced AI models.

8.6K
Stable
Jupyter Notebook
LLM Frameworks
Computer Vision
Jupyter Notebook
#autoregressive-models#diffusion-models#image-generation

DepthAnything/Depth-Anything-V2

A highly capable foundation model for monocular depth estimation, a key component in computer vision.

7.7K
Archived
Python
Computer Vision
NeurIPS Submissions
Python
#depth-estimation#monocular-depth#foundation-model

HVision-NKU/StoryDiffusion

An AI-powered story generation tool for developers interested in vibe coding and creative AI applications.

6.4K
Archived
Jupyter Notebook
LLM Frameworks
Agents & Orchestration
Jupyter Notebook
#story-generation#creative-ai#llm

princeton-nlp/tree-of-thought-llm

A research paper that introduces a novel approach for using large language models to solve complex problems through a tree-based reasoning process.

5.9K
Archived
Python
LLM Frameworks
#large-language-models#prompting#tree-of-thoughts

UX-Decoder/Segment-Everything-Everywhere-All-At-Once

Official implementation of a paper on segmentation models for computer vision tasks.

4.8K
Archived
Python
Computer Vision
Python
#computer-vision#segmentation#neural-networks

stanford-futuredata/ColBERT

State-of-the-art neural search engine

3.8K
Stable
Python
Neural Search
#ColBERT#Natural Language Processing#Search Engine Optimization

OSU-NLP-Group/HippoRAG

HippoRAG is a novel RAG framework that enables LLMs to continuously integrate knowledge across external documents.

3.3K
Stable
Python
LLM Frameworks
RAG & Vector
Python
#llm#knowledge-integration#knowledge-graphs

guandeh17/Self-Forcing

Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion

3.2K
Stable
Python
LLM Frameworks
None
React
#autoregressive video diffusion#self forcing#neurips 2025 spotlight

Pointcept/Pointcept

Pointcept is a codebase for point cloud perception research, featuring the latest works on 3D computer vision.

2.9K
Active
Python
Computer Vision
API Frameworks
PyTorch
#3d-vision#point-cloud#computer-vision

MeiGen-AI/MultiTalk

Multimodal conversational video generation powered by AI, enabling new vibe-coder collaboration experiences.

2.8K
Stable
Python
LLM Frameworks
Agents & Orchestration
Python
#ai-powered#multimodal#conversational

sunsmarterjie/yolov12

Real-time object detector using YOLOv12 with attention-centric architecture

2.8K
Stable
Python
React
#real-time#object-detection#YOLOv12

xlang-ai/OSWorld

A benchmark for multimodal AI agents to tackle open-ended tasks in real computer environments.

2.6K
Active
Python
Agents & Orchestration
Benchmark
Python
#multimodal-ai#agent-benchmarking#open-ended-tasks

VITA-MLLM/VITA

A powerful multimodal AI model for real-time vision and speech interaction, built for developers who work with AI tools.

2.5K
Experimental
Python
LLM Frameworks
Agents & Orchestration
Python
#large-language-model#multimodal#video-understanding

thuml/Autoformer

Autoformer: A deep learning model for long-term time series forecasting, focused on developers building with AI tools.

2.4K
Experimental
Jupyter Notebook
LLM Frameworks
Inference
PyTorch
#time-series-forecasting#deep-learning#transformer

OmniSVG/OmniSVG

An end-to-end multimodal SVG generator that leverages pre-trained Vision-Language Models to create complex and detailed SVGs.

2.4K
Active
Python
LLM Frameworks
Animation & Motion
Python
#svg-generation#vision-language-models#multimodal-ai

wgsxm/PartCrafter

PartCrafter is a 3D mesh generation tool that uses compositional latent diffusion transformers to create structured 3D objects.

2.4K
Stable
Python
Computer Vision
3D-Object-Generation
Python
#3d-generation#computer-vision#deep-learning

Stay in the loop

Get weekly updates on trending AI coding tools and projects.