Showing 1-4 of 4 projects
A collection of novel deep learning research works using the PaddlePaddle framework for computer vision, NLP, and more.
An AI-powered video super-resolution model that enhances real-world videos using text-to-video generation.
VideoLLaMA 2 is a Python library that advances spatial-temporal modeling and audio understanding in video-based large language models.
An open-source toolkit for urban spatial-temporal data mining and traffic prediction tasks.
Get weekly updates on trending AI coding tools and projects.