Category
Showing 4151-4200 of 6,802 trending projects
A LLM-based research assistant that allows you to converse with research papers.
Open source neural machine translation library in Torch, focused on deep learning for natural language processing.
A Chinese prompt plugin for the Stable Diffusion WebUI, providing a library of prompts for AI-assisted image generation.
TokenOps provides easy token price estimates for over 400 large language models (LLMs).
A PyTorch library for calculating the Structural Similarity Index (SSIM) loss for image analysis and processing tasks.
This GitHub repository appears to be a backup of a vulnerability library, likely for developers working with AI tools.
An open-source deep reinforcement learning library that combines several state-of-the-art improvements.
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Phantom is a subject-consistent video generation tool that aligns text and video via cross-modal alignment.
A Python script that automatically updates daily Computer Vision papers from the ArXiv using GitHub Actions.
A Python toolbox to investigate neural networks' predictions and understand model behavior.
A platform for creating, using, and sharing ChatGPT prompts, focused on the vibe coder developer community.
Dromedary is a framework for building helpful, ethical, and reliable large language models (LLMs).
A high-performance and lightweight PyTorch-based license plate recognition framework.
Multi-camera live object tracking and traffic counting using YOLO v4, Deep SORT, and Flask.
A curated list of practical natural language processing tools and libraries in Ruby for developers.
Xwin-LM is a powerful, stable, and reproducible LLM alignment library for Python developers.
Study plan for software engineers to become machine learning engineers
A multi-voice text-to-speech (TTS) system with a focus on high-quality audio output.
Basic Machine Learning and Deep Learning library written in Python
An AI-powered tool for automated test generation and code coverage enhancement.
DouZero is a deep reinforcement learning framework for mastering the Chinese card game DouDizhu.
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Official implementation of a paper on multimodal chain-of-thought reasoning in language models.
Deformable DETR is a state-of-the-art object detection model that uses deformable transformers for end-to-end detection.
A 3D computer vision framework for 3D reconstruction, camera tracking, and photogrammetry.
A Python library for automatic keyword extraction and text summarization from Chinese text using the TextRank algorithm.
Optical character recognition for Japanese manga comics, built with Python and deep learning.
Tacotron-2 is a state-of-the-art text-to-speech model that vibe coders can use to build speech synthesis applications.
Burr is a Python-based framework for building AI-powered applications and agents that can monitor, trace, and execute on your own infrastructure.
SLING is a natural language frame semantics parser written in C++ that can be used for machine learning and natural language processing.
A spaCy pipeline and models for processing scientific/biomedical documents.
A curated list of data mining papers about fraud detection.
SOLO and SOLOv2 are instance segmentation models for computer vision tasks, built with PyTorch.
Reference implementations of MLPerf® training benchmarks for evaluating machine learning performance.
A curated collection of resources for remote sensing and foundation models in machine learning.
A rigorous benchmark for evaluating the code quality and efficiency of large language models like GPT-4.
A Python-based agent that uses speech recognition and text-to-speech to enable conversational interactions via WhatsApp.
The Hylo programming language is a new open-source language focused on building AI-powered applications.
This repository contains the source code for a tool to analyze the real context size of long-context language models.
Unsupervised image classification library using contrastive learning and SCAN algorithm.
This repository provides code and models for large-scale computer vision research projects using AI tools.
A large and diverse 3D human motion-language dataset for deep learning and motion generation.
A CUDA-accelerated robotics library for motion planning and control using PyTorch.
3D Procedural Game Engine Using OpenGL for building procedural environments, terrains, and games.
Real-time facial emotion detection using deep learning and computer vision.
An open-source AI agent that interacts with graphical user interfaces using natural language
A library for building multi-layer recurrent neural networks (RNNs) for word-level language models in Python using TensorFlow.
An AI-powered video agent framework for next-generation video interactions and workflows.
A new padding scheme for convolutional neural networks using partial convolution-based padding.
Get weekly updates on trending AI coding tools and projects.