Category
Showing 401-450 of 897 trending projects
A Python library for creating easy-to-use, visually appealing data tables and summaries.
Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.
A high-performance B-tree implementation for Go, useful for building database-like applications.
Intake is a lightweight Python package for discovering, investigating, loading and distributing data.
An Awesome List of open-source data engineering projects for developers.
A Python library with most common stock market technical indicators, making it easy to implement quantitative finance and algorithmic trading.
A Python library to access historical market data from the Binance cryptocurrency exchange.
No description provided for this medical data repository.
A collection of solutions to Chinese data competitions, primarily using Python.
Mondrian is an OLAP server that enables real-time analysis of large data sets for business users.
A definition and DDLs for the OMOP Common Data Model (CDM), a data model for healthcare data.
Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.
Programmable CUDA/C++ GPU Graph Analytics library for high-performance parallel graph processing.
An open-source repository for parsing electricity data and powering a comprehensive electricity data platform.
A Rust library that provides persistent data structures for efficient and immutable data management.
An educational distributed SQL database written in Rust, not focused on AI coding tools.
A concise guide to the MongoDB NoSQL database for developers.
NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.
A collection of study notes, ebooks, and resources on big data, machine learning, Linux, and more for developers.
A curated list of resources for time series forecasting, including papers, code, and other materials.
A Rust-based implementation of an LSM-Tree storage engine (database) for developers to build and learn from.
A free database of geographic place names and corresponding geospatial data for developers to use.
A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.
A Python library for analyzing movement trajectory data using GeoPandas.
Modern database IDE for dev & data workflows, supporting MySQL, PostgreSQL & MongoDB.
Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.
A high-performance, concurrent, embedded key-value database written in Rust for vibe coders.
A Python toolbox for gaining geometric insights into high-dimensional data, useful for vibe coders working with AI tools.
A curated list of awesome materials and resources for database development.
A JavaScript library for working with multidimensional arrays, useful for data visualization and scientific computing.
A data quality and observability tool for monitoring and fixing data issues before they become problems.
Redisson is a Java client for Redis and Valkey with distributed objects and services
Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.
Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.
Self-Driving Database Management System from Carnegie Mellon University
A powerful Python package to manage and work with extremely large amounts of data.
A high-performance, open-source data processing pipeline for ingesting Kafka data and sending it to Elasticsearch.
This is a Python library focused on basketball analytics and data processing.
A tutorial for writing a SQLite clone from scratch in C, a useful resource for developers building database-backed applications.
A Python library for 3D plotting and mesh analysis using the Visualization Toolkit (VTK)
A Python library for downloading, parsing, and analyzing health data from Garmin, FitBit, and MS Health.
PaxosStore is a high-performance, distributed database solution built for large-scale applications.
A Python library for calculating customer lifetime value metrics and cohort analysis.
Rust-based bindings for the NumPy C-API, enabling developers to leverage Rust for numerical computing.
A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.
A JavaScript library that allows you to run SQLite on the web, enabling local database functionality for web apps.
Fast in-memory cache library for Go with low GC overhead, optimized for a large number of entries.
A corpus of company names, abbreviations, and brands that can be used for Chinese text segmentation and entity recognition.
An ordered map implementation in Go with amortized O(1) performance for common operations.
Get weekly updates on trending AI coding tools and projects.