Category
Showing 3401-3450 of 6,802 trending projects
A collection of demo projects for running machine learning models on iOS using CoreML, TensorFlow Lite, and ML Kit.
AdvancedEAST is an algorithm for accurate scene text detection, with improvements over the EAST algorithm.
A minimal implementation of DeepMind's Genie world model for AI and machine learning research.
A simple tool to calibrate neural networks and improve their performance.
Kimi-VL is a multimodal AI model for advanced vision-language understanding and reasoning.
A Rust library for image similarity comparison, simulating human perception using multiscale SSIM.
A fast and customizable PyTorch library for audio data augmentation, useful for deep learning applications.
A collection of resources for developers interested in data-centric AI approaches and techniques.
This repository contains a CVPR2024 paper on referring human dance generation using AI tools.
A library that expands natural language instructions for use in AI/ML models and applications.
A high-performance, GPU-accelerated multimedia/video processing framework for transcoding, AI inference, and more.
A comprehensive collection of resources for anomaly detection, including books, papers, videos, and toolboxes.
A tutorial for building Neural Machine Translation models using TensorFlow.
Chinese version of CLIP for cross-modal retrieval and representation generation
Official implementation of DeepLabCut, a markerless pose estimation toolkit for animal behavior analysis using deep learning.
A comprehensive paper list of multi-agent reinforcement learning (MARL) research.
Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets.
A collection of 200+ flashcards covering topics in machine learning, computer vision, and computer science.
A simple, powerful, and efficient API for building deep learning models in Python.
Nvdiffrast is a modular library for high-performance differentiable rendering, useful for AI and graphics applications.
A highly efficient and compact MobileNet-based object detection model for real-time inference on mobile devices.
An open-source diffusion-based multimodal LLM framework for unified understanding and generation.
An open-source framework for detecting, highlighting, and correcting grammatical errors in natural language text.
A toolkit to optimize machine learning models for deployment, including quantization and pruning.
A deep-dive on the entire history of deep-learning, useful for developers interested in AI and machine learning.
A Python library that provides native multimodal models for building world-learning AI systems.
A Python re-implementation of the pi0 vision-language-action (VLA) model for developers working with AI tools.
Distributed compiler based on Triton for parallel systems, focused on AI and high-performance computing.
A low-level C++ library for building AI accelerator kernels and operators for Tenstorrent's METAL platform.
A fast communication-overlapping library for tensor/expert parallelism on GPUs, useful for AI/ML applications.
A simple, readable, and helpful unit testing library optimized for AI-driven development workflows.
A collection of research papers on autonomous agents and large language models (LLMs) updated daily.
A comprehensive introduction to deep learning using TensorFlow, covering a range of AI and machine learning topics.
A WASM vector similarity search library written in Rust for building AI-powered search and recommendation features.
A repository containing research papers on text summarization, focused on language models and NLP.
A repository for a book on understanding the fundamentals of deep learning.
Neural style transfer library in TensorFlow for developers interested in AI-powered image generation.
Real-time person removal from complex backgrounds using TensorFlow.js in the web browser.
This open-source project is a deep learning-based music generation and understanding system for AI-powered music creation.
An AI-powered file management tool that organizes local files while ensuring privacy.
A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.
2D Gaussian splatting for geometrically accurate radiance field reconstruction, useful for novel view synthesis.
A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.
This is a React and Electron-based app that runs the FreedomGPT LLM locally for offline and private use.
An open-source Python library that allows controlling any computer using large language models (LLMs) like GPT-4.
An open-source driver assistance system that supports over 350 car makes and models.
A customizable image-to-video model based on HunyuanVideo for developers building AI-powered video applications.
A curriculum for learning about foundation models, from scratch to the frontier for AI-focused developers.
A set of Swift types and functions that make it easier to work with Core ML in iOS apps.
A custom Home Assistant component that uses OpenAI to control devices, enabling conversational AI home automation.
Get weekly updates on trending AI coding tools and projects.