Trending Projects

Discover the fastest growing open source projects

Showing 551-600 of 897 trending projects

#551
js-data/js-data

A framework-agnostic, datastore-agnostic JavaScript ORM built for ease of use and peace of mind.

0
0.0%
1.6K
total stars
#552
GreenmaskIO/greenmask

A Go-based tool for database anonymization and synthetic data generation to help with security, QA, and data masking.

0
0.0%
1.6K
total stars
#553
jldbc/pybaseball

A Python library for pulling current and historical baseball statistics, including Statcast, Baseball Reference, and FanGraphs data.

0
0.0%
1.6K
total stars
#554
cgarciae/pypeln

Concurrent data pipelines in Python for building efficient and scalable data processing workflows.

0
0.0%
1.6K
total stars
#555
huachaohuang/awesome-dbdev

A curated list of awesome materials and resources for database development.

0
0.0%
1.6K
total stars
#556
dineug/erd-editor

An open-source, TypeScript-based Entity-Relationship Diagram (ERD) editor for developers working with databases.

0
0.0%
1.6K
total stars
#557
probberechts/soccerdata

A Python library for scraping soccer data from various sources for sports analytics and data science.

0
0.0%
1.6K
total stars
#558
SciTools/cartopy

Cartopy is a Python library for creating maps and visualizing spatial data with matplotlib support.

0
0.0%
1.6K
total stars
#559
getdozer/dozer

Dozer is a real-time data movement tool that leverages CDC to move data between various sources and sinks.

0
0.0%
1.6K
total stars
#560
delight-im/FreeGeoDB

A free database of geographic place names and corresponding geospatial data for developers to use.

0
0.0%
1.6K
total stars
#561
TomAugspurger/effective-pandas

A collection of articles and source code on using the pandas data analysis library.

0
0.0%
1.6K
total stars
#562
re-data/re-data

A data quality and observability tool for monitoring and fixing data issues before they become problems.

0
0.0%
1.6K
total stars
#563
mourner/flatbush

A fast spatial index library for 2D points and rectangles in JavaScript, useful for geospatial applications.

0
0.0%
1.6K
total stars
#564
narwhals-dev/narwhals

Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.

0
0.0%
1.5K
total stars
#565
capitalone/DataProfiler

A Python library for extracting schema, statistics, and entities from datasets, useful for data profiling and privacy analysis.

0
0.0%
1.5K
total stars
#566
antontarasenko/smq

A collection of SQL queries to analyze social media datasets.

0
0.0%
1.5K
total stars
#567
hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

0
0.0%
1.5K
total stars
#568
paradigmxyz/cryo

cryo is a Rust library for extracting blockchain data to parquet, CSV, JSON, or Python dataframes.

0
0.0%
1.5K
total stars
#569
aws-samples/aws-glue-samples

AWS Glue code samples for building data integration and ETL pipelines on AWS.

0
0.0%
1.5K
total stars
#570
cn/GB2260

A Python library for retrieving administrative division codes for China's GB/T 2260 standard.

0
0.0%
1.5K
total stars
#571
polarsignals/frostdb

A fast, embeddable column database written in Go, optimized for AI/ML workloads.

0
0.0%
1.5K
total stars
#572
scrollmapper/bible_databases

This GitHub repository provides a collection of Bible versions and cross-reference databases, but it does not appear to be related to the given developer discovery platform focused on vibe coders.

0
0.0%
1.5K
total stars
#573
dbt-labs/metricflow

MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.

0
0.0%
1.5K
total stars
#574
EliotAndres/kaggle-past-solutions

A searchable compilation of Kaggle past solutions for data science and machine learning developers.

0
0.0%
1.5K
total stars
#575
tonbo-io/tonbo

Tonbo is an embedded database for serverless and edge runtimes, optimized for offline-first and big data use cases.

0
0.0%
1.5K
total stars
#576
uwdata/arquero

A JavaScript library for efficient querying and transformation of array-backed data tables.

0
0.0%
1.5K
total stars
#577
Awesome-Image-Registration-Organization/awesome-image-registration

A curated collection of resources related to image registration, including books, papers, videos, and toolboxes.

0
0.0%
1.5K
total stars
#578
percona/percona-xtrabackup

Open source hot backup tool for InnoDB and XtraDB databases

0
0.0%
1.5K
total stars
#579
google/tensorstore

A C++ library for reading and writing large multi-dimensional arrays, useful for scientific and data-intensive applications.

0
0.0%
1.5K
total stars
#580
gobuffalo/pop

A Go ORM and query builder for interacting with databases in Go applications.

0
0.0%
1.5K
total stars
#581
san089/goodreads_etl_pipeline

An end-to-end data pipeline for building a data lake, data warehouse, and analytics platform from GoodReads data.

0
0.0%
1.5K
total stars
#582
karlseguin/the-little-mongodb-book

A concise guide to the MongoDB NoSQL database for developers.

0
0.0%
1.5K
total stars
#583
bashtage/arch

A comprehensive Python library for modeling and forecasting financial time series data using ARCH models.

0
0.0%
1.5K
total stars
#584
Intel-bigdata/HiBench

HiBench is a big data benchmark suite for evaluating the performance of different big data frameworks.

0
0.0%
1.5K
total stars
#585
json4s/json4s

A popular Scala library for parsing and manipulating JSON data in Scala applications.

0
0.0%
1.5K
total stars
#586
itbdw/ip-database

An offline IP database for developers to look up IP address geolocation information.

0
0.0%
1.5K
total stars
#587
pyjanitor-devs/pyjanitor

A Python library for cleaning and transforming data, inspired by the R package Janitor.

0
0.0%
1.5K
total stars
#588
XD-DENG/SQL-exercise

A collection of SQL practice problems for developers to improve their SQL skills.

0
0.0%
1.5K
total stars
#589
Factual/drake

A data workflow tool for data engineers and analysts, similar to 'Make for data'.

0
0.0%
1.5K
total stars
#590
locationtech/geomesa

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

0
0.0%
1.5K
total stars
#591
CodeCutTech/Efficient_Python_tricks_and_tools_for_data_scientists

A collection of efficient Python tricks and tools for data scientists to improve their productivity.

0
0.0%
1.5K
total stars
#592
DataBrewery/cubes

A lightweight Python OLAP framework for multi-dimensional data analysis and reporting.

0
0.0%
1.5K
total stars
#593
shuttle-hq/synth

Synth is a Rust library for generating realistic, randomized test data for applications and databases.

0
0.0%
1.5K
total stars
#594
skaiworldwide-oss/agensgraph

AgensGraph is a transactional graph database based on PostgreSQL for enterprise-level applications.

0
0.0%
1.5K
total stars
#595
substrait-io/substrait

A cross-platform way to express data transformation, relational algebra, and standardized record expression and plans.

0
0.0%
1.5K
total stars
#596
pysal/pysal

PySAL is a Python Spatial Analysis Library meta-package for geographical data analysis and modeling.

0
0.0%
1.5K
total stars
#597
CamDavidsonPilon/lifetimes

A Python library for calculating customer lifetime value metrics and cohort analysis.

0
0.0%
1.5K
total stars
#598
dremio/dremio-oss

Dremio is an open-source data analytics platform that simplifies and accelerates big data analysis.

0
0.0%
1.5K
total stars
#599
Cyan4973/FiniteStateEntropy

A high-performance compression library written in C for developers working with large data sets.

0
0.0%
1.5K
total stars
#600
Softmotions/ejdb

EJDB2 is an embeddable JSON database engine with a simple XPath-like query language (JQL) for C/C++ applications.

0
0.0%
1.5K
total stars
1...1113...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.