Category
Showing 6101-6150 of 6,802 trending projects
A single-header-only modern ray tracing kernel for vibe coders building AI-powered graphics apps.
Lhotse is a set of tools for handling multimodal data in machine learning projects, with a focus on speech and audio.
An open-source text-to-speech server built with Python, useful for adding voice functionality to applications.
Automated, hardware-independent Hand-Eye Calibration tool for robotics and computer vision applications.
Res2Net-PretrainedModels is an official PyTorch implementation of the Res2Net multi-scale backbone architecture for computer vision tasks.
A modular voice assistant app for experimenting with state-of-the-art transcription, response generation, and text-to-speech models.
A C-based toolkit for deploying AI models on Rockchip AI accelerated hardware.
A Python library for training and using AI models in the ComfyUI AI generation tool.
A zk-SNARK library written in Rust for building cryptographic applications.
A Python package for causal inference in quasi-experimental settings
A comprehensive collection of SLAM (Simultaneous Localization and Mapping) applications and comparisons for robotics and lidar-based navigation.
A curated list of resources for realistic image composition and object insertion using AI and computer vision techniques.
A project exploring human-machine collaboration using a robotic arm, large language models, and multimodal AI.
A Python API wrapper for Poe.com, providing free access to GPT-4, Claude, Llama, and other AI models.
A frontier multimodal foundation model for advanced image and video understanding tasks.
A collection of samples and tools for building AI-powered Windows applications using frameworks like TensorFlow and PyTorch.
A GPU-accelerated TSDF and ESDF library for robots equipped with RGB-D cameras.
Sample Node.js application for the IBM Watson Speech to Text service
A Chinese GPT2 project for generating news article titles with detailed annotations.
High-performance TensorFlow Lite library for React Native with GPU acceleration.
A state-of-the-art AI text-to-GIF model for developers building with AI tools and Stable Diffusion XL.
A multi-agent framework for end-to-end film automation in virtual 3D spaces, focused on AI-powered filmmaking.
A fast data versioning system for ML datasets, making it easy to version and track changes like code.
A math OCR model that outputs LaTeX and Markdown, useful for vibe coders working with AI tools.
A flexible and powerful library to develop your own Transformer variants for AI/ML applications.
A repository for learning TensorFlow, a popular open-source machine learning library in Python.
ToRA is a series of Tool-integrated Reasoning LLM Agents for solving mathematical reasoning problems.
MyoSuite is a collection of environments/tasks for musculoskeletal models simulated with MuJoCo and wrapped in OpenAI gym.
A comprehensive collection of deep learning interview questions for developers building with AI tools.
End-to-end Chinese license plate recognition using MXNet, a machine learning framework.
The final version of an AI-designed keyboard layout, written in C, for vibe coders.
A PyTorch implementation of the Style-Based GAN architecture for generative adversarial networks.
Large-scale 3D scene reconstruction and novel view synthesis using Gaussian representations.
Sample code for the Google Cloud Vision API, a powerful computer vision tool for image analysis.
ChatdollKit enables developers to create virtual AI companions by integrating 3D models and chatbot functionality.
A tightly coupled GNSS-Visual-Inertial system for smooth and consistent state estimation in complex environments.
Portfolio optimization tool using deep learning and PyTorch for finance and wealth management.
A research repository focused on tabular deep learning, providing papers and Python packages.
Implementation of the QuickDraw game, a computer vision and deep learning project focused on image classification.
A collection of resources on controllable generation with text-to-image diffusion models for vibe coders.
Tools to train a generative model on arbitrary audio samples for vibe coders focused on AI tools.
A Python library that provides a single interface to use and evaluate different AI agent frameworks.
Offline Russian voice assistant with plugin-based skills for developers working with AI tools.
An async RL training library for scaling AI model training and deployment.
SETR is a transformer-based approach for rethinking semantic segmentation from a sequence-to-sequence perspective.
A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.
Read Pilot is an AI-powered tool that analyzes articles and generates Q&A cards, built with Next.js and OpenAI.
A collection of scripts and notebooks to help you get started with Numerai's quant finance platform.
AI-powered data enrichment tool that transforms emails into rich datasets with company profiles, funding data, tech stacks, and more.
This Python project provides utilities to convert datasets to COCO and VOC formats for object detection tasks.
Get weekly updates on trending AI coding tools and projects.