Explore Projects

Discover 4 open source projects

Active filters (1):
Search: iclr2024ร—
Clear all

Showing 1-4 of 4 projects

xinyu1205/recognize-anything

Open-source, strong foundation image recognition models for developers building with AI tools.

3.6K
Experimental
Jupyter Notebook
Computer Vision
LLM Frameworks
Jupyter Notebook
#image-recognition#object-detection#classification

facebookresearch/MetaCLIP

A research project from Facebook that explores multimodal AI models for computer vision and language tasks.

1.8K
Stable
Python
LLM Frameworks
Computer Vision
PyTorch
#multimodal-ai#computer-vision#language-models

omerbt/TokenFlow

Official PyTorch implementation of TokenFlow, a novel diffusion-based method for consistent video editing presented at ICLR 2024.

1.7K
Experimental
Python
Computer Vision
AI Image & Video
PyTorch
#iclr2024#stable-diffusion#text-to-image

bytedance/SALMONN

SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.

1.4K
Stable
LLM Frameworks
Speech Recognition
#audio-processing#speech-recognition#video-understanding

Stay in the loop

Get weekly updates on trending AI coding tools and projects.