Showing 3441-3460 of 5,250 projects
Nessie is a transactional data catalog for data lakes that provides Git-like semantics and functionality.
Scalable data pre processing and curation toolkit for Large Language Models (LLMs)
A high-performance C++ library for neural machine translation, with CUDA support for GPU acceleration.
A Python module to fetch stock data from Yahoo Finance API for financial analysis and trading applications.
Open-source GNSS + inertial navigation simulator for motion trajectory generation and sensor fusion.
A fast, reliable search database written in Rust without the AI hype.
An open-source, multi-tenant, self-building knowledge graph for developers building with AI tools.
Lightning-fast cluster computing in Java, Scala and Python.
tidyr is an R package that provides a set of functions to tidy messy data into a format suitable for analysis.
This repository contains a collection of portfolio projects for a data analyst, not a developer discovery platform.
Type-driven code generation for Go, enabling powerful generic programming.
DataComp for Language Models is a library for training, evaluating, and deploying large language models.
A modern C++ order matching engine for building trading platforms and financial applications.
An R package that provides support for simple features, a standardized way to encode spatial vector data.
Python driver for Apache Cassandra, a distributed database management system.
A magic memoization function in PHP that helps improve performance by caching function results.
Alpakka Kafka connector - a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.
A free newsletter with bite-sized R-tips and code tutorials for data scientists and developers.
A Python library and tools for generating and inspecting data for pre-training large language models (LLMs).
Async, Netty-based database drivers for PostgreSQL and MySQL, written in Scala.
Get weekly updates on trending AI coding tools and projects.