Category
Showing 201-250 of 897 trending projects
A lightweight SQLite3 driver for Go that implements the database/sql interface.
A lightweight Python OLAP framework for multi-dimensional data analysis and reporting.
A Rust-based implementation of an LSM-Tree storage engine (database) for developers to build and learn from.
A high-performance GPU DataFrame library for data analysis and machine learning workloads.
OrioleDB is a cloud-native PostgreSQL extension that solves performance and scalability challenges.
A Chinese translation of a popular book on using Python for data analysis with libraries like pandas and numpy.
A free, open-source SQLite database manager for multiple platforms.
A Python library for fast, customizable, and interactive data profiling and exploratory data analysis.
An in-depth tutorial covering mainstream database knowledge for backend developers.
Open source research data repository software built with Java.
Fast, lightweight search backend alternative to Elasticsearch
Distributed SQL database middleware for sharding, scalability, and security
This repository contains code samples for SQL Server, Azure SQL, and related data services from Microsoft.
A database migration and schema management tool for PHP developers, supporting multiple database engines.
ORM for Node.js/TypeScript with multiple database support
GDAL is an open-source library for working with various geospatial data formats, useful for remote sensing and GIS applications.
Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.
A Redis module that provides a time series data structure for storing and querying time series data.
This repository provides a comprehensive guide on optimizing MySQL performance and solving common database problems.
This repository contains efficient tools for LiDAR processing, focused on working with point cloud data.
Synthea is an open-source synthetic patient population simulator for generating realistic healthcare data.
A tutorial and implementation of a disease-centered medical knowledge graph and QA system.
A Python library for data migration and transformation in the Blaze project.
WCDB is a cross-platform database framework developed by WeChat for Android, iOS, Linux, macOS, and Windows.
Lightweight local JSON database for JavaScript/TypeScript apps
PRQL is a modern, powerful, and pipelined SQL replacement for transforming data.
Kedro is a Python toolkit for building production-ready data science and machine learning pipelines.
A specification for storing geospatial vector data (point, line, polygon) in the Parquet file format, enabling efficient cloud-native geospatial data processing.
The Feldera Incremental Computation Engine is a Rust-based library for building real-time data pipelines and materialized views.
A JavaScript library for visualizing and understanding complex data structures.
Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.
An open-source N-body simulation library for astrophysics and planetary science.
An educational relational database management system (RDBMS) implementation in C++.
Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.
A high-performance, concurrent, embedded key-value database written in Rust for vibe coders.
An Awesome List of open-source data engineering projects for developers.
A concise guide to the MongoDB NoSQL database for developers.
A collection of open data sets and tools for data science and machine learning tasks.
Apache Cassandra is a distributed, wide-column store database system designed for high availability, scalability, and performance.
A cross-platform TUI database management tool written in Go for developers working with databases.
OpenRefine is a powerful data cleaning and transformation tool that helps developers work with messy data.
A tutorial for using the popular Python data analysis library Pandas, presented at PyCon 2015.
An exabyte-scale, multi-region distributed file system for developers building AI-powered applications.
A tutorial for writing a SQLite clone from scratch in C, a useful resource for developers building database-backed applications.
Rill is a tool for transforming data sets into powerful dashboards using SQL, enabling BI-as-code.
A free, interactive SQL learning platform with an online SQL editor, real-time query results, and syntax highlighting.
A collection of data science projects in Python using Jupyter Notebook.
A curated list of awesome database tools and resources to make working with databases easier.
Framework for collecting and analyzing prediction market data with comprehensive Polymarket/Kalshi datasets.
Get weekly updates on trending AI coding tools and projects.