Category
Showing 1051-1100 of 6,802 trending projects
ClearML is an MLOps/LLMOps solution that streamlines AI/ML workloads with experiment management, data management, pipelines, orchestration, and serving.
A PyTorch-based LLM framework for adapting to light novels and visual novels.
High-performance symbolic regression library for Python and Julia, with support for explainable AI.
An open-source codebase for generating high-fidelity podcasts from text using AI models.
This repository provides examples and utilities for fine-tuning large language models (LLMs) using the PEFT library.
An awesome list for Whisper, an open-source AI-powered speech recognition system by OpenAI.
An on-device LLM execution library for React Native, compatible with Vercel AI SDK.
Open R1 is a fully open reproduction of DeepSeek-R1, enabling replication and extension of its reasoning capabilities.
A high-performance gradient boosting framework for machine learning tasks like ranking, classification, and more.
A vision transformer model for image classification tasks, part of the Google Research project.
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
A curated list of 120+ LLM libraries categorized for LLM engineers and AI developers.
An optimized library for efficient multi-GPU communication in deep learning applications.
An open-source toolkit for speech processing, supporting enhancement, separation, and target speaker extraction.
A comprehensive resource for industrial image anomaly/defect detection papers and datasets.
GPU programming related news and material links for developers who use AI tools.
A Python library that can generate videos from images at any resolution using a flexible framework.
Run AI models like LLaMA locally on your machine with Node.js bindings for llama.cpp and enforce JSON schema on the output.
A Python library that trains a large language model on data from specific time periods to reduce modern bias.
An AI-powered knowledge base and workflow agent platform for WeChat public accounts, aimed at becoming a leading vertical AI assistant.
A project that helps automate the creation and distribution of short-form and long-form video content across social media platforms.
RWKV is an RNN-based language model with high performance, fast training, and flexible transformer-like architecture.
A PDF version of the renowned MIT Deep Learning Book, a comprehensive resource for studying machine learning and neural networks.
Fay is an agent framework that helps connect digital humans and large language models to business systems.
Official PyTorch implementation of a scalable diffusion model with Transformers for AI-powered applications.
OpenMMLab Pose Estimation Toolbox and Benchmark for developers working on computer vision and AI-powered applications.
An open-ended embodied agent with large language models for exploring and learning in Minecraft environments.
An open-source Python library for democratizing deep learning in drug discovery, quantum chemistry, materials science, and biology.
Open-source full-song music generation foundation model for developers building AI-powered audio applications.
A curated collection of recent diffusion models for video generation, editing, and various other applications.
Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.
A Python and OpenCV-based scene cut/transition detection library for video processing.
Collection of generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
DATAGEN is an AI-driven multi-agent research assistant that automates hypothesis generation, data analysis, and report writing.
A real-time dashboard for monitoring NVIDIA GPU usage and performance metrics.
An OWASP project that provides security guidance for developers building applications with large language models (LLMs).
Object detection toolbox for PyTorch with support for multiple tasks and state-of-the-art models.
Curated computer vision resources for developers
A set of Jupyter Notebooks that combine Grounding DINO, Segment Anything, and Stable Diffusion for automatic detection, segmentation, and generation of anything in images.
A curated list of awesome tools and projects for the LangChain framework, a popular AI development toolkit.
A Python library for machine learning security, providing tools for adversarial attacks and defenses.
A library for creating lip-synced videos using Stable Diffusion, focused on research and virtual avatars.
TorchGeo is a Python library for working with geospatial data using PyTorch, providing datasets, samplers, transforms, and pre-trained models.
A comprehensive document search and storage platform for building AI applications using Python.
RamaLama simplifies local serving of AI models and enables their use for inference in production via containers.
A Python desktop app for automatically translating comics in various formats and languages using computer vision and machine translation.
An all-in-one productivity app and AI assistant with Tasks, Notes, Calendar, Diary and Bookmarks.
An open-source implementation for fine-tuning Qwen-VL series, a multimodal vision-language model by Alibaba Cloud.
Source separation library for audio processing with pretrained models
A curated list of resources dedicated to Natural Language Processing (NLP)
Get weekly updates on trending AI coding tools and projects.