Explore Projects

Discover 12 open source projects

Active filters (1):

Search: cross-modality×

Clear all

Showing 1-12 of 12 projects

jina-ai/clip-as-service

Scalable embedding, reasoning, ranking for images and sentences with CLIP

12.8K

Archived

Python

React

#authentication#streaming#real-time

zai-org/CogVLM

A state-of-the-art open visual language model for multimodal pretraining and applications.

6.7K

Archived

Python

LLM Frameworks

Computer Vision

Python

#cross-modality#language-model#multi-modal

OFA-Sys/Chinese-CLIP

Chinese version of CLIP for cross-modal retrieval and representation generation

5.8K

Stable

Jupyter Notebook

Computer Vision

LLM Frameworks

PyTorch

#chinese#clip#computer-vision

jina-ai/discoart

A Python library for creating Disco Diffusion artworks using a simple one-line interface.

3.8K

Archived

Python

AI Image & Video

Animation & Motion

#generative-art#disco-diffusion#prompt-engineering

docarray/docarray

A Python library for representing, sending, storing, and searching multimodal data in AI and ML applications.

3.1K

Active

Python

LLM Frameworks

Vector Databases

PyTorch

#cross-modal#multimodal#neural-search

KimMeen/Time-LLM

An official implementation of a time series forecasting model using large language models.

2.5K

Stable

Python

LLM Frameworks

Time Series

Python

#time-series-forecasting#large-language-models#deep-learning

astorfi/lip-reading-deeplearning

Cross-modal lip reading using 3D convolutional neural networks for speech recognition.

1.9K

Archived

Python

Computer Vision

Speech Recognition

TensorFlow

#speech-recognition#computer-vision#deep-learning

shaoxiongji/knowledge-graphs

A comprehensive collection of research on knowledge graphs, covering various applications and techniques.

1.8K

Archived

JavaScript

Knowledge Graph

Databases

JavaScript

#knowledge-graphs#natural-language-processing#information-retrieval

Phantom-video/Phantom

Phantom is a subject-consistent video generation tool that aligns text and video via cross-modal alignment.

1.5K

Stable

Python

Text-to-Video

Video Generation

Python

#text-to-video#video-generation#cross-modal-alignment

zju3dv/MatchAnything

Cross-modal image matching framework with large-scale pre-training for AI-powered coding tools.

1.2K

Stable

Computer Vision

Inference

PyTorch

#cross-modal-matching#computer-vision#ai-powered-coding

OFA-Sys/ONE-PEACE

A general representation model for cross-modal learning across vision, audio, and language.

1.1K

Archived

Python

LLM Frameworks

Representation Learning

Python

#multimodal#contrastive-learning#foundation-models

microsoft/VideoX

VideoX is a collection of video cross-modal models for developers working with AI-powered video tools.

1.1K

Archived

Python

Computer Vision

API Frameworks

Python

#computer-vision#video-processing#cross-modal

Stay in the loop

Get weekly updates on trending AI coding tools and projects.