Explore Projects

Discover 12 open source projects

Active filters (1):
Search: cross-modalityร—
Clear all

Showing 1-12 of 12 projects

jina-ai/clip-as-service

Scalable embedding, reasoning, ranking for images and sentences with CLIP

12.8K
Archived
Python
React
#authentication#streaming#real-time

zai-org/CogVLM

A state-of-the-art open visual language model for multimodal pretraining and applications.

6.7K
Archived
Python
LLM Frameworks
Computer Vision
Python
#cross-modality#language-model#multi-modal

OFA-Sys/Chinese-CLIP

Chinese version of CLIP for cross-modal retrieval and representation generation

5.8K
Stable
Jupyter Notebook
Computer Vision
LLM Frameworks
PyTorch
#chinese#clip#computer-vision

jina-ai/discoart

A Python library for creating Disco Diffusion artworks using a simple one-line interface.

3.8K
Archived
Python
AI Image & Video
Animation & Motion
#generative-art#disco-diffusion#prompt-engineering

docarray/docarray

A Python library for representing, sending, storing, and searching multimodal data in AI and ML applications.

3.1K
Active
Python
LLM Frameworks
Vector Databases
PyTorch
#cross-modal#multimodal#neural-search

KimMeen/Time-LLM

An official implementation of a time series forecasting model using large language models.

2.5K
Stable
Python
LLM Frameworks
Time Series
Python
#time-series-forecasting#large-language-models#deep-learning

astorfi/lip-reading-deeplearning

Cross-modal lip reading using 3D convolutional neural networks for speech recognition.

1.9K
Archived
Python
Computer Vision
Speech Recognition
TensorFlow
#speech-recognition#computer-vision#deep-learning

shaoxiongji/knowledge-graphs

A comprehensive collection of research on knowledge graphs, covering various applications and techniques.

1.8K
Archived
JavaScript
Knowledge Graph
Databases
JavaScript
#knowledge-graphs#natural-language-processing#information-retrieval

Phantom-video/Phantom

Phantom is a subject-consistent video generation tool that aligns text and video via cross-modal alignment.

1.5K
Stable
Python
Text-to-Video
Video Generation
Python
#text-to-video#video-generation#cross-modal-alignment

zju3dv/MatchAnything

Cross-modal image matching framework with large-scale pre-training for AI-powered coding tools.

1.2K
Stable
Computer Vision
Inference
PyTorch
#cross-modal-matching#computer-vision#ai-powered-coding

OFA-Sys/ONE-PEACE

A general representation model for cross-modal learning across vision, audio, and language.

1.1K
Archived
Python
LLM Frameworks
Representation Learning
Python
#multimodal#contrastive-learning#foundation-models

microsoft/VideoX

VideoX is a collection of video cross-modal models for developers working with AI-powered video tools.

1.1K
Archived
Python
Computer Vision
API Frameworks
Python
#computer-vision#video-processing#cross-modal

Stay in the loop

Get weekly updates on trending AI coding tools and projects.