Category
Showing 1401-1450 of 6,802 trending projects
PyTorch implementation of the MAR+DiffLoss paper for AI-powered coding tools and frameworks.
A pipeline parallel training script for diffusion models, useful for AI and machine learning researchers.
PyBullet-based Gymnasium environments for reinforcement learning of quadcopter control
This is a curated list of papers related to large language model (LLM) systems, useful for developers working with AI tools.
A hands-on guide to using machine learning for algorithmic trading, published by Packt
A Rust library for interacting with the OpenAI API, enabling vibe coders to leverage AI capabilities in their projects.
A curated list of safety-related resources for large language models (LLMs) to help researchers and practitioners understand the safety implications.
A Python package that provides advanced image background removal and object/face/clothes segmentation using multiple AI models.
SunoAPI allows developers to create music and audio in seconds using AI-powered tools.
A Python library that provides type annotations and runtime checking for JAX/NumPy/PyTorch arrays.
An open-source reinforcement learning system from ByteDance Seed and Tsinghua AIR for AI-driven development tools.
Open-source end-to-end vision-language-action model for GUI agents and computer usage analysis.
A Python tool that automatically converts videos into optimized social media posts for platforms like Xiaohongshu.
RoboVerse is a unified platform, dataset, and benchmark for scalable and generalizable robot learning.
A repository containing solutions to exercises from the machine learning book by Zhou Zhihua.
A Python library for silent face anti-spoofing using deep learning, useful for mobile apps and computer vision projects.
Inference tool for Microsoft's Florence2 Versatile Language Model (VLM), built for vibe coders using AI tools.
Open-source tools for computational pathology and digital pathology research using deep learning and weakly-supervised learning.
An open-source Python framework for implementing the Chanlun technical analysis methodology, supporting trading strategies and visualizations.
A curated list of awesome resources, tools, and other shiny things for LLM prompt engineering.
A Python-based video diffusion model for high-fidelity novel view synthesis
This is a Jupyter Notebook project focused on biological foundation modeling from molecular to genome scale.
Open-source RPA framework for Python and Robot Framework, focused on automation and AI-powered document processing.
A Python library for creating video-based multimodal explanations for LLM theorem understanding.
A comprehensive Python library for time series forecasting using machine learning models.
Stability-AI/sd3.5 is a Python library for the Stable Diffusion 3.5 model, a powerful AI-based image generation tool.
Comprehensive list of 1,500+ resources and tools related to AI agents for developers building with AI tools.
A real-time game translator with OCR that allows developers to build translation features into their games.
DataComp for Language Models is a library for training, evaluating, and deploying large language models.
A Python library and tools for generating and inspecting data for pre-training large language models (LLMs).
This is a dataset of character animation and motion capture data for developers working on AI-powered animation tools.
A Python script that automatically updates daily Computer Vision papers from the ArXiv using GitHub Actions.
Democratizing internet-scale financial data for developers through natural language processing.
Envoy-based API gateway that manages unified access to generative AI services like GPT, DALL-E, and Stable Diffusion.
A Python library for building multi-agent systems with large language models (LLMs) and the LangGraph framework.
A CUDA-accelerated robotics library for motion planning and control using PyTorch.
Step-Audio 2 is an end-to-end multi-modal large language model for industry-strength audio understanding and speech conversation.
UNO is a universal customization method for both single and multi-subject image generation using diffusion models.
A curated list of resources for leveraging visual information in large vision-language models (LVLMs) for complex reasoning, planning, and generation.
FireRedTTS2 is a long-form streaming TTS system for generating multi-speaker dialogue in Python.
This repository is a Jupyter Notebook focused on the Google Health GEMMA project, which is not clearly defined.
This handbook provides practical guidance for researchers and practitioners on the latest Text-to-SQL techniques.
ReCall is a library for training large language models (LLMs) to reason and use tools via reinforcement learning.
A platform for developers to discover, learn, and experiment with state-of-the-art AI models.
Cluely is an AI-powered desktop assistant that provides real-time insights and support during meetings, interviews, and professional conversations.
An open-source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifact.
A robust, real-time LiDAR-inertial initialization method for robotics and SLAM applications.
A Python library that uses LLMs, computer vision, and speech recognition to analyze video content.
Docling is an API service for running Docling, a platform for building AI-powered applications.
An open-source speech recognition library for the Espressif ESP32 microcontroller platform.
Get weekly updates on trending AI coding tools and projects.