Showing 1-6 of 6 projects
A high-performance, zero-overhead, and extensible Python compiler with built-in NumPy support.
A high-performance task-parallel programming system for C++ developers building concurrent and heterogeneous applications.
A header-only, fast and memory-friendly hashmap and btree container library for parallel and concurrent applications.
NVIDIA's CUDA Core Compute Libraries for accelerated computing and GPU programming in C++
A collection of research papers and tools related to using machine learning for compiler and system optimization.
TornadoVM is a heterogeneous programming framework that enables developers to leverage GPUs and other accelerators for improved performance in Java applications.
Get weekly updates on trending AI coding tools and projects.