Category
Showing 801-850 of 897 trending projects
A functional, type-safe, composable Scala data access library for Postgres databases.
A Python library that provides a Predictive Power Score (PPS) to measure the predictive power between variables.
An R project focused on providing high-performance statistical models, data analysis, and visualization tools.
A Python library that generates fake data for custom test databases.
A collection of articles and source code on using the pandas data analysis library.
Python demos for spatial data analytics, geostatistics, and machine learning to support courses.
Diagrams and documentation for InnoDB, the storage engine used by MySQL and MariaDB databases.
A powerful 3D visualization library for scientific data in Python.
A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.
A Python package for handling messy CSV files with improved dialect detection and a command-line interface.
A PHP library that provides a MySQL backup functionality, similar to the mysqldump CLI tool.
A Python-based image processing framework with plugins for common image processing libraries.
A JavaScript library for working with multidimensional arrays, useful for data visualization and scientific computing.
A PHP library for dumping the contents of a database to a file, supporting multiple database engines.
Contextualise is a powerful tool for organizing diverse information resources in knowledge-intensive projects.
A Python library for comparing data across databases, supporting various database engines.
Feather is a fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow.
A JavaScript library that converts CSV and tab-delimited data to web-friendly formats like JSON and XML.
A fast, hierarchical key-value storage engine written in C++ for applications that require high performance and scalability.
A Rust library that enables querying Excel spreadsheets using SQLite, making data extraction and analysis more efficient.
A library for text mining and natural language processing using tidy data principles in R.
A collection of monthly reports on the internals of Alibaba Cloud's database products.
A repository containing various NLP datasets collected and organized by the owner.
Collaborative offline-first SQLite wrapper for syncing app state across users & devices
A composable data framework for building ambitious web applications using TypeScript.
Open source time series library for Python, useful for statistical analysis and modeling.
A high-performance C++ linear algebra library focused on solvers, sparse matrices, and numerical computing.
TensorBase is a new big data warehousing solution built with Rust, focused on high-performance analytics.
DataLink is a real-time and offline data exchange platform that supports synchronization between heterogeneous data sources.
A data warehouse for COVID-19 time series data, useful for data analysis and visualization.
A versatile ORM for multiple databases including MySQL, SQLite, MariaDB, PostgreSQL, and MongoDB in Deno.
A blazingly fast analytics database built with Rust, optimized for rapidly devouring large amounts of data.
HiBench is a big data benchmark suite for evaluating the performance of different big data frameworks.
A collection of efficient Python tricks and tools for data scientists to improve their productivity.
EJDB2 is an embeddable JSON database engine with a simple XPath-like query language (JQL) for C/C++ applications.
A book that teaches the basics of using the Redis in-memory data structure store.
CarbonData is a high-performance data store solution for big data analytics on Hadoop and Spark.
Quilt is a data mesh for connecting people with actionable data, built with TypeScript.
A Swift extension for RealmSwift that provides reactive programming support using RxSwift.
A pure Python library for reading and writing ESRI Shapefiles, a popular geospatial data format.
A library for calling Python functions from the Ruby language, enabling data science and ML workflows.
FeatureBase is a fast analytical database built on bitmaps, perfect for ML and data-intensive applications.
A space-efficient trie data structure in Go with fast lookup performance.
A columnar storage extension for Postgres built as a foreign data wrapper.
A MongoDB schema analysis tool that helps developers understand and optimize their NoSQL database.
SQLite with Branches - a lightweight, embedded database with version control capabilities.
A distributed, scalable Prometheus-compatible time series database written in Scala.
A tool to easily import CSV and JSON data into PostgreSQL databases.
Anatomy of Matplotlib tutorial for SciPy conference, focused on data visualization for scientific computing.
The LevelDB key-value database in the Go programming language.
Get weekly updates on trending AI coding tools and projects.