Trending Projects

Discover the fastest growing open source projects

Showing 451-500 of 897 trending projects

#451

nullptrlabs/pgmodeler

An open-source data modeling tool designed for PostgreSQL, allowing developers to generate DDL commands visually.

+74

+2.1%

3.5K

total stars

C++

#452

alexkay/spek

An acoustic spectrum analyzer library written in C++ for audio analysis and visualization.

+74

+2.4%

3.2K

total stars

C++

#453

delta-io/delta-rs

A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.

+74

+2.4%

3.2K

total stars

Rust

#454

chezou/tabula-py

A simple Python wrapper for the Tabula Java library, which extracts tables from PDF files into Pandas DataFrames.

+74

+3.3%

2.3K

total stars

Python

#455

skfolio/skfolio

A Python library for portfolio optimization using scikit-learn and convex optimization techniques.

+74

+4.1%

1.9K

total stars

Python

#456

orlp/slotmap

A Rust data structure for efficiently storing and accessing data in a sparse set.

+74

+6.1%

1.3K

total stars

Rust

#457

mybatis/mybatis-3

MyBatis SQL Mapper for Java simplifies database interactions with object mapping.

+73

+0.4%

20.4K

total stars

Java

#458

apache/beam

Apache Beam is a unified programming model for batch and streaming data processing.

+73

+0.9%

8.5K

total stars

Java

#459

OpenRefine/OpenRefine

OpenRefine is a powerful data cleaning and transformation tool that helps developers work with messy data.

+71

+0.6%

11.8K

total stars

Java

#460

eddwebster/football_analytics

A collection of football analytics projects, data, and analysis by Edd Webster (@eddwebster).

+70

+2.9%

2.5K

total stars

Jupyter Notebook

#461

enhancedformysql/The-Art-of-Problem-Solving-in-Software-Engineering_How-to-Make-MySQL-Better

This repository provides a comprehensive guide on optimizing MySQL performance and solving common database problems.

+70

+3.8%

1.9K

total stars

#462

Kyubyong/numpy_exercises

A repository of NumPy exercises for developers looking to improve their Python and data manipulation skills.

+70

+4.2%

1.7K

total stars

Python

#463

marsupialtail/quokka

A scalable, distributed ETL framework for building data lake analytics pipelines.

+70

+6.3%

1.2K

total stars

Python

#464

pixiedust/pixiedust

A Python helper library for enhancing Jupyter Notebooks with data visualization and analysis capabilities.

+70

+7.2%

1.0K

total stars

Jupyter Notebook

#465

sacridini/Awesome-Geospatial

A comprehensive collection of geospatial tools and resources for data analysis, machine learning, and spatial applications.

+69

+1.5%

4.8K

total stars

#466

meltano/meltano

Meltano is a declarative, code-first data integration engine for building and scaling data and ML-powered products.

+69

+3.0%

2.4K

total stars

Python

#467

scrollmapper/bible_databases

This GitHub repository provides a collection of Bible versions and cross-reference databases, but it does not appear to be related to the given developer discovery platform focused on vibe coders.

+69

+4.8%

1.5K

total stars

Python

#468

thinh-vu/vnstock

A beginner-friendly Python toolkit for financial data extraction, analysis, and automation.

+69

+6.3%

1.2K

total stars

Python

#469

tidyverse/readr

A fast and flexible R package for reading flat files (CSV, TSV, fixed-width) into R data frames.

+69

+7.2%

1.0K

total stars

#470

msiemens/tinydb

A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.

+68

+0.9%

7.5K

total stars

Python

#471

orioledb/orioledb

OrioleDB is a cloud-native PostgreSQL extension that solves performance and scalability challenges.

+68

+1.7%

4.0K

total stars

#472

DrTimothyAldenDavis/SuiteSparse

A powerful suite of sparse matrix algorithms and libraries for scientific and numerical computing.

+68

+4.9%

1.5K

total stars

#473

felt/tippecanoe

Build vector tilesets from large collections of GeoJSON features.

+68

+4.9%

1.4K

total stars

C++

#474

datacrypt-project/hitchhiker-tree

A high-performance, persistent, off-heap data structure written in Clojure for data-intensive applications.

+68

+5.9%

1.2K

total stars

Clojure

#475

rougier/scientific-visualization-book

An open-access book on scientific visualization using Python and Matplotlib for data-driven developers

+67

+0.6%

11.2K

total stars

Python

#476

apache/hamilton

Hamilton is an open-source ETL framework that helps data scientists and engineers build modular, testable dataflows with lineage and metadata.

+67

+2.9%

2.4K

total stars

Jupyter Notebook

#477

jstat/jstat

A JavaScript statistical library that provides a wide range of statistical functions for data analysis.

+67

+3.9%

1.8K

total stars

JavaScript

#478

liucongg/NLPDataSet

A repository containing various NLP datasets collected and organized by the owner.

+67

+6.6%

1.1K

total stars

#479

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.

+66

+2.1%

3.2K

total stars

Java

#480

faroit/awesome-python-scientific-audio

Curated list of Python software and packages for scientific research in audio

+66

+4.1%

1.7K

total stars

#481

dineug/erd-editor

An open-source, TypeScript-based Entity-Relationship Diagram (ERD) editor for developers working with databases.

+66

+4.3%

1.6K

total stars

TypeScript

#482

san089/goodreads_etl_pipeline

An end-to-end data pipeline for building a data lake, data warehouse, and analytics platform from GoodReads data.

+66

+4.6%

1.5K

total stars

Python

#483

nicodv/kmodes

Python library for clustering categorical data using k-modes and k-prototypes algorithms.

+66

+5.4%

1.3K

total stars

Python

#484

elixir-explorer/explorer

A fast and elegant data exploration library for Elixir, providing series and dataframes for data science workflows.

+66

+5.5%

1.3K

total stars

Elixir

#485

TablePlus/DBngin

DBngin is a free, open-source, cross-platform database management tool for developers.

+66

+5.8%

1.2K

total stars

#486

valeriansaliou/sonic

Fast, lightweight search backend alternative to Elasticsearch

+65

+0.3%

21.2K

total stars

Rust

#487

holistics/dbml

A database modeling language (DBML) that helps define and document database structures.

+65

+1.9%

3.5K

total stars

JavaScript

#488

huandu/go-sqlbuilder

A flexible and powerful SQL string builder library plus a zero-config ORM for Go developers.

+65

+4.0%

1.7K

total stars

#489

dbt-labs/metricflow

MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.

+65

+4.5%

1.5K

total stars

Python

#490

jeremycole/innodb_diagrams

Diagrams and documentation for InnoDB, the storage engine used by MySQL and MariaDB databases.

+65

+4.6%

1.5K

total stars

#491

NiuTrans/Classical-Modern

A parallel corpus of classical Chinese and modern Chinese texts for language processing and analysis.

+65

+4.8%

1.4K

total stars

Python

#492

PoloDB/PoloDB

PoloDB is an embedded document database written in Rust for building cross-platform, local-first applications.

+65

+5.8%

1.2K

total stars

Rust

#493

MIT-LCP/mimic-code

Open-source repository for sharing code related to the MIMIC family of critical care databases.

+64

+2.1%

3.1K

total stars

Jupyter Notebook

#494

lerocha/chinook-database

Sample database for SQL Server, Oracle, MySQL, PostgreSQL, SQLite, DB2

+64

+2.7%

2.5K

total stars

TSQL

#495

DQinYuan/chinese_province_city_area_mapper

A Python module for extracting and mapping Chinese province, city, and district data.

+64

+3.7%

1.8K

total stars

Python

#496

alan-turing-institute/CleverCSV

A Python package for handling messy CSV files with improved dialect detection and a command-line interface.

+64

+5.1%

1.3K

total stars

Python

#497

allegro/bigcache

Efficient in-memory cache in Go for storing and retrieving large amounts of data.

+63

+0.8%

8.1K

total stars

#498

vlcn-io/cr-sqlite

A Rust library that provides multi-writer and CRDT support for SQLite databases.

+63

+1.8%

3.6K

total stars

Rust

#499

malloydata/malloy

Malloy is an open-source language for describing data relationships and transformations.

+63

+2.7%

2.4K

total stars

TypeScript

#500

feldera/feldera

The Feldera Incremental Computation Engine is a Rust-based library for building real-time data pipelines and materialized views.

+63

+3.6%

1.8K

total stars

Rust

1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.