Category
Showing 1001-1050 of 6,802 trending projects
Real-time webcam demo with SmolVLM and llama.cpp server for AI-powered coding tools and applications.
An open-source project that provides a state-of-the-art image restoration model using the Swin Transformer architecture.
A multimodal evaluation toolkit for assessing AI models across text, image, video, and audio tasks.
A minimal LLM chat app that runs entirely in your browser, focused on vibe coders using AI tools.
All-in-one AI framework for semantic search, LLM orchestration and language model workflows
An open-source, large language model-based multimodal dialogue system that achieves near-GPT-4o performance.
A Python-based emotional companionship program powered by large language models (LLMs) for building AI-driven chatbots and virtual characters.
An Android app for the Susi AI assistant, a conversational AI platform.
An open-source question-answering tool that leverages large language models to provide answers to any query.
HunyuanVideo is a systematic framework for large-scale video generation using diffusion models and transformers.
A high-performance real-time instance segmentation library for computer vision applications.
Evidently is an open-source ML and LLM observability framework to evaluate, test, and monitor AI-powered systems.
An open-source Python tool to transform multimodal content into captivating multilingual audio podcasts powered by GenAI.
OpenCV bindings for Node.js, enabling computer vision and image processing in Node applications.
Image Restoration Toolbox with PyTorch-based training and testing codes for various AI-powered restoration models.
A modular full-stack reinforcement learning library for large language models (LLMs).
This repository contains an efficient implementation of a vision encoding model for vision-language models.
Visual analysis and diagnostic tools to facilitate machine learning model selection.
An open-source audio wake word detection framework with a focus on performance and simplicity.
An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.
OpenWash is a format and specification for defining and sharing AI model deployment packaging.
Source separation library for audio processing with pretrained models
An open-source autonomous driving framework that focuses on planning-oriented autonomous driving.
A comprehensive AI content generation and publishing system for WeChat, supporting multi-source data collection, intelligent analysis, and automated publishing.
A simple API for the VITS text-to-speech model, with additional features for vibe coders.
This repo is an experiment where ChatGPT manages a real-money micro-cap stock portfolio.
MindSpore is an open-source deep learning framework for mobile, edge, and cloud scenarios.
Universal and transferable attacks on aligned language models, useful for security researchers.
Build effective AI agents using Model Context Protocol and simple workflow patterns in Python.
Official implementation of the paper 'Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection'.
A Go-based tool that uses LLMs and LLM Vision (OCR) to digitize documents powered by AI.
A Kotlin library that runs Stable Diffusion on Android devices with Snapdragon NPU acceleration.
A Python-based LinkedIn automation tool for visiting profiles, connecting, and messaging using AI.
A blueprint for building production-ready RAG systems that minimize hallucination, with switchable pipelines.
AIGCPanel is an all-in-one AI digital human system that simplifies local model management, import, and use.
Interactive machine learning algorithms in Python with Jupyter demos
Open-source toolkit for evaluating large multi-modal AI models, supporting 220+ models and 80+ benchmarks.
A comprehensive library for few-shot learning in Python with PyTorch, featuring state-of-the-art algorithms.
A versatile image inpainting model that supports various AI-powered image editing capabilities.
An open platform for managing, monitoring, and optimizing large language models (LLMs) and AI workflows.
An open-source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them to Markdown.
Text-to-audio model for generating realistic speech and sounds
SHAP explains ML model outputs using Shapley values for interpretability.
AlphaFold 3 is a Python-based inference pipeline for protein structure prediction using deep learning.
This repository provides examples and tutorials to help developers build AI systems using popular AI tools and frameworks.
TimeGPT-1 is a production-ready pre-trained time series foundation model for forecasting and anomaly detection.
A streamlined framework for efficient evaluation and performance benchmarking of large models like LLMs and VLMs.
Reusable skills and extensions for Gemini API, enabling model capabilities and agent interactions
A lightweight PyTorch library with training tools and utilities for deep learning and machine learning developers.
A PyTorch implementation of the TernausNet model for image segmentation, pre-trained on the Kaggle Carvana dataset.
Get weekly updates on trending AI coding tools and projects.