Trending Projects

Discover the fastest growing open source projects

Showing 451-500 of 897 trending projects

#451
nullptrlabs/pgmodeler

An open-source data modeling tool designed for PostgreSQL, allowing developers to generate DDL commands visually.

+74
+2.1%
3.5K
total stars
#452
alexkay/spek

An acoustic spectrum analyzer library written in C++ for audio analysis and visualization.

+74
+2.4%
3.2K
total stars
#453
delta-io/delta-rs

A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.

+74
+2.4%
3.2K
total stars
#454
chezou/tabula-py

A simple Python wrapper for the Tabula Java library, which extracts tables from PDF files into Pandas DataFrames.

+74
+3.3%
2.3K
total stars
#455
skfolio/skfolio

A Python library for portfolio optimization using scikit-learn and convex optimization techniques.

+74
+4.1%
1.9K
total stars
#456
orlp/slotmap

A Rust data structure for efficiently storing and accessing data in a sparse set.

+74
+6.1%
1.3K
total stars
#457
mybatis/mybatis-3

MyBatis SQL Mapper for Java simplifies database interactions with object mapping.

+73
+0.4%
20.4K
total stars
#458
apache/beam

Apache Beam is a unified programming model for batch and streaming data processing.

+73
+0.9%
8.5K
total stars
#459
OpenRefine/OpenRefine

OpenRefine is a powerful data cleaning and transformation tool that helps developers work with messy data.

+71
+0.6%
11.8K
total stars
#460
eddwebster/football_analytics

A collection of football analytics projects, data, and analysis by Edd Webster (@eddwebster).

+70
+2.9%
2.5K
total stars
#461
enhancedformysql/The-Art-of-Problem-Solving-in-Software-Engineering_How-to-Make-MySQL-Better

This repository provides a comprehensive guide on optimizing MySQL performance and solving common database problems.

+70
+3.8%
1.9K
total stars
#462
Kyubyong/numpy_exercises

A repository of NumPy exercises for developers looking to improve their Python and data manipulation skills.

+70
+4.2%
1.7K
total stars
#463
marsupialtail/quokka

A scalable, distributed ETL framework for building data lake analytics pipelines.

+70
+6.3%
1.2K
total stars
#464
pixiedust/pixiedust

A Python helper library for enhancing Jupyter Notebooks with data visualization and analysis capabilities.

+70
+7.2%
1.0K
total stars
#465
sacridini/Awesome-Geospatial

A comprehensive collection of geospatial tools and resources for data analysis, machine learning, and spatial applications.

+69
+1.5%
4.8K
total stars
#466
meltano/meltano

Meltano is a declarative, code-first data integration engine for building and scaling data and ML-powered products.

+69
+3.0%
2.4K
total stars
#467
scrollmapper/bible_databases

This GitHub repository provides a collection of Bible versions and cross-reference databases, but it does not appear to be related to the given developer discovery platform focused on vibe coders.

+69
+4.8%
1.5K
total stars
#468
thinh-vu/vnstock

A beginner-friendly Python toolkit for financial data extraction, analysis, and automation.

+69
+6.3%
1.2K
total stars
#469
tidyverse/readr

A fast and flexible R package for reading flat files (CSV, TSV, fixed-width) into R data frames.

+69
+7.2%
1.0K
total stars
#470
msiemens/tinydb

A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.

+68
+0.9%
7.5K
total stars
#471
orioledb/orioledb

OrioleDB is a cloud-native PostgreSQL extension that solves performance and scalability challenges.

+68
+1.7%
4.0K
total stars
#472
DrTimothyAldenDavis/SuiteSparse

A powerful suite of sparse matrix algorithms and libraries for scientific and numerical computing.

+68
+4.9%
1.5K
total stars
#473
felt/tippecanoe

Build vector tilesets from large collections of GeoJSON features.

+68
+4.9%
1.4K
total stars
#474
datacrypt-project/hitchhiker-tree

A high-performance, persistent, off-heap data structure written in Clojure for data-intensive applications.

+68
+5.9%
1.2K
total stars
#475
rougier/scientific-visualization-book

An open-access book on scientific visualization using Python and Matplotlib for data-driven developers

+67
+0.6%
11.2K
total stars
#476
apache/hamilton

Hamilton is an open-source ETL framework that helps data scientists and engineers build modular, testable dataflows with lineage and metadata.

+67
+2.9%
2.4K
total stars
#477
jstat/jstat

A JavaScript statistical library that provides a wide range of statistical functions for data analysis.

+67
+3.9%
1.8K
total stars
#478
liucongg/NLPDataSet

A repository containing various NLP datasets collected and organized by the owner.

+67
+6.6%
1.1K
total stars
#479
apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.

+66
+2.1%
3.2K
total stars
#480
faroit/awesome-python-scientific-audio

Curated list of Python software and packages for scientific research in audio

+66
+4.1%
1.7K
total stars
#481
dineug/erd-editor

An open-source, TypeScript-based Entity-Relationship Diagram (ERD) editor for developers working with databases.

+66
+4.3%
1.6K
total stars
#482
san089/goodreads_etl_pipeline

An end-to-end data pipeline for building a data lake, data warehouse, and analytics platform from GoodReads data.

+66
+4.6%
1.5K
total stars
#483
nicodv/kmodes

Python library for clustering categorical data using k-modes and k-prototypes algorithms.

+66
+5.4%
1.3K
total stars
#484
elixir-explorer/explorer

A fast and elegant data exploration library for Elixir, providing series and dataframes for data science workflows.

+66
+5.5%
1.3K
total stars
#485
TablePlus/DBngin

DBngin is a free, open-source, cross-platform database management tool for developers.

+66
+5.8%
1.2K
total stars
#486
valeriansaliou/sonic

Fast, lightweight search backend alternative to Elasticsearch

+65
+0.3%
21.2K
total stars
#487
holistics/dbml

A database modeling language (DBML) that helps define and document database structures.

+65
+1.9%
3.5K
total stars
#488
huandu/go-sqlbuilder

A flexible and powerful SQL string builder library plus a zero-config ORM for Go developers.

+65
+4.0%
1.7K
total stars
#489
dbt-labs/metricflow

MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.

+65
+4.5%
1.5K
total stars
#490
jeremycole/innodb_diagrams

Diagrams and documentation for InnoDB, the storage engine used by MySQL and MariaDB databases.

+65
+4.6%
1.5K
total stars
#491
NiuTrans/Classical-Modern

A parallel corpus of classical Chinese and modern Chinese texts for language processing and analysis.

+65
+4.8%
1.4K
total stars
#492
PoloDB/PoloDB

PoloDB is an embedded document database written in Rust for building cross-platform, local-first applications.

+65
+5.8%
1.2K
total stars
#493
MIT-LCP/mimic-code

Open-source repository for sharing code related to the MIMIC family of critical care databases.

+64
+2.1%
3.1K
total stars
#494
lerocha/chinook-database

Sample database for SQL Server, Oracle, MySQL, PostgreSQL, SQLite, DB2

+64
+2.7%
2.5K
total stars
#495
DQinYuan/chinese_province_city_area_mapper

A Python module for extracting and mapping Chinese province, city, and district data.

+64
+3.7%
1.8K
total stars
#496
alan-turing-institute/CleverCSV

A Python package for handling messy CSV files with improved dialect detection and a command-line interface.

+64
+5.1%
1.3K
total stars
#497
allegro/bigcache

Efficient in-memory cache in Go for storing and retrieving large amounts of data.

+63
+0.8%
8.1K
total stars
#498
vlcn-io/cr-sqlite

A Rust library that provides multi-writer and CRDT support for SQLite databases.

+63
+1.8%
3.6K
total stars
#499
malloydata/malloy

Malloy is an open-source language for describing data relationships and transformations.

+63
+2.7%
2.4K
total stars
#500
feldera/feldera

The Feldera Incremental Computation Engine is a Rust-based library for building real-time data pipelines and materialized views.

+63
+3.6%
1.8K
total stars
1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.