Category
Showing 601-650 of 897 trending projects
Python demos for spatial data analytics, geostatistics, and machine learning to support courses.
Tegola is an open-source Mapbox Vector Tile server written in Go, enabling efficient geospatial data visualization.
Diagrams and documentation for InnoDB, the storage engine used by MySQL and MariaDB databases.
A C++ library for reading and writing .npy and .npz files, commonly used in scientific computing.
A Python library that provides common financial risk and performance metrics used in financial analysis.
A Python library providing SQL views for Dune Analytics, a popular blockchain data analysis platform.
A distributed, scalable Prometheus-compatible time series database written in Scala.
A book that teaches the basics of using the Redis in-memory data structure store.
TensorBase is a new big data warehousing solution built with Rust, focused on high-performance analytics.
QueryKit is a simple CoreData query language for Swift and Objective-C developers.
Percona Toolkit is a collection of advanced open source database tools for MySQL, MongoDB, and PostgreSQL.
A powerful suite of sparse matrix algorithms and libraries for scientific and numerical computing.
A frequency word list generator and processed files for text analysis and natural language processing.
Non-native graph database abstraction layer for Node.js and web browsers.
A fast and efficient C++ hash map and hash set implementation using robin hood hashing.
Open source SQL query assistant service for databases and data warehouses
A Python library that provides support for the pgvector vector database, enabling efficient vector search and storage.
Transporter is a powerful ETL tool that allows developers to sync data between various persistence engines.
Build vector tilesets from large collections of GeoJSON features.
CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.
A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.
A collection of simple tools for data cleaning and wrangling in R for data science tasks.
A Python library for performing multivariate exploratory data analysis, including techniques like PCA, CA, MCA, MFA, and FAMD.
A tool for comparing and evaluating databases for time series data.
Python interface for the igraph library, a powerful tool for network analysis and visualization.
A data platform that enables building data pipelines with SQL, Python, and ingesting from various sources.
LibRaw is a C++ library for reading RAW image files from digital cameras.
Nessie is a transactional data catalog for data lakes that provides Git-like semantics and functionality.
A C# library for reading and writing metadata in media files, useful for audio and video processing applications.
tidyr is an R package that provides a set of functions to tidy messy data into a format suitable for analysis.
This repository contains a collection of portfolio projects for a data analyst, not a developer discovery platform.
An R package that provides support for simple features, a standardized way to encode spatial vector data.
A parallel corpus of classical Chinese and modern Chinese texts for language processing and analysis.
A pure Go library for reading and writing Parquet files, a columnar data format.
Graft is an open-source transactional storage engine optimized for lazy, partial, and strongly consistent replication, ideal for edge, offline-first, and distributed applications.
A distributed, Redis-compatible NoSQL database that provides high performance and scalability.
A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL databases seamlessly
This repository provides code examples for Oracle's AI-enabled database features and integrations.
A Python tool that generates Entity Relationship Diagrams (ERDs) from SQLAlchemy models.
Firebird is a relational database management system (RDBMS) suitable for a wide range of applications from desktop to client-server to large databases.
MetPy is a Python library for reading, visualizing, and performing calculations with weather data.
PumpkinDB is an immutable, ordered key-value database engine written in Rust.
A powerful 3D visualization library for scientific data in Python.
A Python library that syncs data from Postgres to Elasticsearch/OpenSearch, enabling real-time data pipelines.
R package for Bayesian generalized multivariate non-linear multilevel models using Stan
A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.
An end-to-end data engineering project example showcasing tools and technologies for building data pipelines.
A Python library for conveniently reading data from the Tongdaxin financial data platform.
First open-source data discovery and observability platform for data practitioners.
This is a book that teaches how to use Apache Spark for lightning-fast data analytics.
Get weekly updates on trending AI coding tools and projects.