Category
Showing 551-600 of 897 trending projects
A framework-agnostic, datastore-agnostic JavaScript ORM built for ease of use and peace of mind.
A Go-based tool for database anonymization and synthetic data generation to help with security, QA, and data masking.
A Python library for pulling current and historical baseball statistics, including Statcast, Baseball Reference, and FanGraphs data.
Concurrent data pipelines in Python for building efficient and scalable data processing workflows.
A curated list of awesome materials and resources for database development.
An open-source, TypeScript-based Entity-Relationship Diagram (ERD) editor for developers working with databases.
A Python library for scraping soccer data from various sources for sports analytics and data science.
Cartopy is a Python library for creating maps and visualizing spatial data with matplotlib support.
Dozer is a real-time data movement tool that leverages CDC to move data between various sources and sinks.
A free database of geographic place names and corresponding geospatial data for developers to use.
A collection of articles and source code on using the pandas data analysis library.
A data quality and observability tool for monitoring and fixing data issues before they become problems.
A fast spatial index library for 2D points and rectangles in JavaScript, useful for geospatial applications.
Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.
A Python library for extracting schema, statistics, and entities from datasets, useful for data profiling and privacy analysis.
A collection of SQL queries to analyze social media datasets.
Agile data preparation workflows made easy with popular Python data science libraries.
cryo is a Rust library for extracting blockchain data to parquet, CSV, JSON, or Python dataframes.
AWS Glue code samples for building data integration and ETL pipelines on AWS.
A Python library for retrieving administrative division codes for China's GB/T 2260 standard.
A fast, embeddable column database written in Go, optimized for AI/ML workloads.
This GitHub repository provides a collection of Bible versions and cross-reference databases, but it does not appear to be related to the given developer discovery platform focused on vibe coders.
MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.
A searchable compilation of Kaggle past solutions for data science and machine learning developers.
Tonbo is an embedded database for serverless and edge runtimes, optimized for offline-first and big data use cases.
A JavaScript library for efficient querying and transformation of array-backed data tables.
A curated collection of resources related to image registration, including books, papers, videos, and toolboxes.
Open source hot backup tool for InnoDB and XtraDB databases
A C++ library for reading and writing large multi-dimensional arrays, useful for scientific and data-intensive applications.
A Go ORM and query builder for interacting with databases in Go applications.
An end-to-end data pipeline for building a data lake, data warehouse, and analytics platform from GoodReads data.
A concise guide to the MongoDB NoSQL database for developers.
A comprehensive Python library for modeling and forecasting financial time series data using ARCH models.
HiBench is a big data benchmark suite for evaluating the performance of different big data frameworks.
A popular Scala library for parsing and manipulating JSON data in Scala applications.
An offline IP database for developers to look up IP address geolocation information.
A Python library for cleaning and transforming data, inspired by the R package Janitor.
A collection of SQL practice problems for developers to improve their SQL skills.
A data workflow tool for data engineers and analysts, similar to 'Make for data'.
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
A collection of efficient Python tricks and tools for data scientists to improve their productivity.
A lightweight Python OLAP framework for multi-dimensional data analysis and reporting.
Synth is a Rust library for generating realistic, randomized test data for applications and databases.
AgensGraph is a transactional graph database based on PostgreSQL for enterprise-level applications.
A cross-platform way to express data transformation, relational algebra, and standardized record expression and plans.
PySAL is a Python Spatial Analysis Library meta-package for geographical data analysis and modeling.
A Python library for calculating customer lifetime value metrics and cohort analysis.
Dremio is an open-source data analytics platform that simplifies and accelerates big data analysis.
A high-performance compression library written in C for developers working with large data sets.
EJDB2 is an embeddable JSON database engine with a simple XPath-like query language (JQL) for C/C++ applications.
Get weekly updates on trending AI coding tools and projects.