Trending Projects

Discover the fastest growing open source projects

Showing 201-250 of 897 trending projects

#201
dbt-labs/metricflow

MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.

+3
+0.2%
1.5K
total stars
#202
scrollmapper/bible_databases

This GitHub repository provides a collection of Bible versions and cross-reference databases, but it does not appear to be related to the given developer discovery platform focused on vibe coders.

+3
+0.2%
1.5K
total stars
#203
LibRaw/LibRaw

LibRaw is a C++ library for reading RAW image files from digital cameras.

+3
+0.2%
1.4K
total stars
#204
NiuTrans/Classical-Modern

A parallel corpus of classical Chinese and modern Chinese texts for language processing and analysis.

+3
+0.2%
1.4K
total stars
#205
LongOnly/Quantitative-Notebooks

Educational notebooks on quantitative finance, algorithmic trading, financial modeling, and investment strategy.

+3
+0.2%
1.3K
total stars
#206
openbabel/openbabel

Open Babel is a chemical toolbox for working with chemical data and cheminformatics.

+3
+0.2%
1.3K
total stars
#207
CliMA/Oceananigans.jl

A fast, flexible, ocean-flavored fluid dynamics library for climate and ocean modeling on CPUs and GPUs.

+3
+0.2%
1.3K
total stars
#208
cvxgrp/cvxportfolio

A Python library for portfolio optimization and back-testing in finance.

+3
+0.3%
1.2K
total stars
#209
Mrkuhuo/data-warehouse-learning

Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.

+3
+0.3%
1.1K
total stars
#210
google/leveldb

Fast key-value storage library for C++

+2
+0.0%
38.9K
total stars
#211
apache/flink

Apache Flink is a stream processing framework for real-time and batch data processing.

+2
+0.0%
25.8K
total stars
#212
arangodb/arangodb

ArangoDB is a multi-model database supporting documents, graphs, and key-values for high-performance applications.

+2
+0.0%
14.1K
total stars
#213
dask/dask

Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.

+2
+0.0%
13.8K
total stars
#214
wangzhiwubigdata/God-Of-BigData

A comprehensive collection of resources and learning materials for big data technologies like Flink, Spark, Hadoop, and Hive.

+2
+0.0%
10.4K
total stars
#215
stephencelis/SQLite.swift

A type-safe, Swift-language layer over SQLite3 for building database-backed Swift applications.

+2
+0.0%
10.1K
total stars
#216
orbitdb/orbitdb

OrbitDB is a peer-to-peer database for the decentralized web, enabling developers to build offline-first, distributed applications.

+2
+0.0%
8.7K
total stars
#217
delta-io/delta

An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.

+2
+0.0%
8.6K
total stars
#218
apache/beam

Apache Beam is a unified programming model for batch and streaming data processing.

+2
+0.0%
8.5K
total stars
#219
AlaSQL/alasql

AlaSQL is a JavaScript SQL database for browser and Node.js that handles both relational tables and nested JSON data.

+2
+0.0%
7.3K
total stars
#220
apache/couchdb

An open-source, scalable, and fault-tolerant NoSQL database with a focus on reliability and offline-first design.

+2
+0.0%
6.8K
total stars
#221
rhiever/Data-Analysis-and-Machine-Learning-Projects

A collection of data analysis and machine learning projects and resources for developers.

+2
+0.0%
6.6K
total stars
#222
apache/pinot

Apache Pinot is a realtime distributed OLAP datastore for fast querying of large datasets.

+2
+0.0%
6.0K
total stars
#223
apache/hive

Apache Hive is a data warehouse software built on top of Apache Hadoop for querying and managing large datasets.

+2
+0.0%
6.0K
total stars
#224
tonsky/datascript

Immutable database and Datalog query engine for Clojure, ClojureScript and JS

+2
+0.0%
5.7K
total stars
#225
ujjwalkarn/DataSciencePython

A Python library for common data analysis and machine learning tasks

+2
+0.0%
5.7K
total stars
#226
kakuilan/china_area_mysql

This is a MySQL library containing China's 5-level administrative regions, not a vibe coder tool.

+2
+0.0%
5.3K
total stars
#227
mgramin/awesome-db-tools

A curated list of awesome database tools and resources to make working with databases easier.

+2
+0.0%
5.0K
total stars
#228
biopython/biopython

Biopython is a set of Python modules that provide a wide range of functionality for bioinformatics, including DNA/RNA/protein sequence analysis, phylogenetics, and more.

+2
+0.0%
4.9K
total stars
#229
bukosabino/ta

Technical Analysis Library using Pandas and Numpy for financial data analysis and trading strategies.

+2
+0.0%
4.9K
total stars
#230
cmu-db/bustub

An educational relational database management system (RDBMS) implementation in C++.

+2
+0.0%
4.9K
total stars
#231
ydb-platform/ydb

An open-source distributed SQL database with high availability, scalability, and ACID transactions.

+2
+0.0%
4.7K
total stars
#232
has2k1/plotnine

A grammar of graphics library for creating highly customizable and publication-quality plots in Python.

+2
+0.0%
4.5K
total stars
#233
ravendb/ravendb

A highly scalable, distributed, document-oriented NoSQL database with full-text search, spatial, and time-series support.

+2
+0.1%
3.9K
total stars
#234
xtensor-stack/xtensor

A C++ library for multidimensional array operations with broadcasting and lazy computing.

+2
+0.1%
3.7K
total stars
#235
DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

+2
+0.1%
3.7K
total stars
#236
vlcn-io/cr-sqlite

A Rust library that provides multi-writer and CRDT support for SQLite databases.

+2
+0.1%
3.6K
total stars
#237
nutsdb/nutsdb

A simple, fast, and embeddable key-value store written in Go that supports transactions and data structures.

+2
+0.1%
3.6K
total stars
#238
antonycourtney/tad

A desktop application for viewing and analyzing tabular data, with support for CSV, Parquet, and DuckDB.

+2
+0.1%
3.4K
total stars
#239
gedeck/practical-statistics-for-data-scientists

This is a code repository for a book on practical statistics for data scientists, not a developer discovery platform.

+2
+0.1%
3.2K
total stars
#240
apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.

+2
+0.1%
3.2K
total stars
#241
alexkay/spek

An acoustic spectrum analyzer library written in C++ for audio analysis and visualization.

+2
+0.1%
3.2K
total stars
#242
caj2pdf/caj2pdf

A Python tool to convert CAJ (China Academic Journals) files to PDF for developers who work with academic literature.

+2
+0.1%
3.2K
total stars
#243
synthetichealth/synthea

Synthea is an open-source synthetic patient population simulator for generating realistic healthcare data.

+2
+0.1%
3.0K
total stars
#244
hosseinmoein/DataFrame

C++ DataFrame library for statistical, financial, and machine learning analysis.

+2
+0.1%
2.9K
total stars
#245
mourner/rbush

RBush is a high-performance JavaScript R-tree-based 2D spatial index for points and rectangles.

+2
+0.1%
2.7K
total stars
#246
dolthub/go-mysql-server

A MySQL-compatible relational database with a storage agnostic query engine, implemented in Go.

+2
+0.1%
2.6K
total stars
#247
aimeos/upscheme

A database migration and schema management tool for PHP developers, supporting multiple database engines.

+2
+0.1%
2.6K
total stars
#248
GanjinZero/awesome_Chinese_medical_NLP

A curated collection of open-source Chinese medical NLP resources including datasets, models, and more.

+2
+0.1%
2.5K
total stars
#249
eddwebster/football_analytics

A collection of football analytics projects, data, and analysis by Edd Webster (@eddwebster).

+2
+0.1%
2.5K
total stars
#250
hardikkamboj/An-Introduction-to-Statistical-Learning

This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.

+2
+0.1%
2.5K
total stars
1...46...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.