Showing 81-100 of 817 projects
Open-source code release for NeRF, a neural radiance field technique for 3D scene representation.
Public facing notes page for the CS231n course on Convolutional Neural Networks.
A fast, local neural text-to-speech system for developers building voice-enabled applications.
This repository contains tutorials, assignments, and competitions for MIT's deep learning courses, covering a wide range of AI and machine learning topics.
A comprehensive repository of resources for 3D machine learning, including papers, datasets, and frameworks.
A collection of techniques for deep learning with satellite and aerial imagery, including object detection and classification.
A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.
A TensorFlow-based neural network library for building AI models.
A deep learning library for generating and manipulating images, including semantic style transfer.
Implementation of Nougat Neural Optical Understanding for Academic Documents
A comprehensive repository for computer vision best practices, code samples, and documentation.
Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.
LaMa is a PyTorch-based library for high-resolution image inpainting using Fourier convolutions.
A Python library for outlier and anomaly detection, integrating classical and deep learning techniques.
A high-level deep learning library for TensorFlow, enabling developers to build complex neural networks with ease.
An open-source project that uses deep learning and OCR to translate text in manga/images
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
AutoKeras is an open-source AutoML library for deep learning that automates the model selection and hyperparameter tuning process.
Comprehensive set of TensorFlow tutorials with accompanying YouTube videos for learning deep learning and machine learning.
A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.
Get weekly updates on trending AI coding tools and projects.