Category
Showing 201-250 of 897 trending projects
MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.
This GitHub repository provides a collection of Bible versions and cross-reference databases, but it does not appear to be related to the given developer discovery platform focused on vibe coders.
LibRaw is a C++ library for reading RAW image files from digital cameras.
A parallel corpus of classical Chinese and modern Chinese texts for language processing and analysis.
Educational notebooks on quantitative finance, algorithmic trading, financial modeling, and investment strategy.
Open Babel is a chemical toolbox for working with chemical data and cheminformatics.
A fast, flexible, ocean-flavored fluid dynamics library for climate and ocean modeling on CPUs and GPUs.
A Python library for portfolio optimization and back-testing in finance.
Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.
Apache Flink is a stream processing framework for real-time and batch data processing.
ArangoDB is a multi-model database supporting documents, graphs, and key-values for high-performance applications.
Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.
A comprehensive collection of resources and learning materials for big data technologies like Flink, Spark, Hadoop, and Hive.
A type-safe, Swift-language layer over SQLite3 for building database-backed Swift applications.
OrbitDB is a peer-to-peer database for the decentralized web, enabling developers to build offline-first, distributed applications.
An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.
Apache Beam is a unified programming model for batch and streaming data processing.
AlaSQL is a JavaScript SQL database for browser and Node.js that handles both relational tables and nested JSON data.
An open-source, scalable, and fault-tolerant NoSQL database with a focus on reliability and offline-first design.
A collection of data analysis and machine learning projects and resources for developers.
Apache Pinot is a realtime distributed OLAP datastore for fast querying of large datasets.
Apache Hive is a data warehouse software built on top of Apache Hadoop for querying and managing large datasets.
Immutable database and Datalog query engine for Clojure, ClojureScript and JS
A Python library for common data analysis and machine learning tasks
This is a MySQL library containing China's 5-level administrative regions, not a vibe coder tool.
A curated list of awesome database tools and resources to make working with databases easier.
Biopython is a set of Python modules that provide a wide range of functionality for bioinformatics, including DNA/RNA/protein sequence analysis, phylogenetics, and more.
Technical Analysis Library using Pandas and Numpy for financial data analysis and trading strategies.
An educational relational database management system (RDBMS) implementation in C++.
An open-source distributed SQL database with high availability, scalability, and ACID transactions.
A grammar of graphics library for creating highly customizable and publication-quality plots in Python.
A highly scalable, distributed, document-oriented NoSQL database with full-text search, spatial, and time-series support.
A C++ library for multidimensional array operations with broadcasting and lazy computing.
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
A Rust library that provides multi-writer and CRDT support for SQLite databases.
A simple, fast, and embeddable key-value store written in Go that supports transactions and data structures.
A desktop application for viewing and analyzing tabular data, with support for CSV, Parquet, and DuckDB.
This is a code repository for a book on practical statistics for data scientists, not a developer discovery platform.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.
An acoustic spectrum analyzer library written in C++ for audio analysis and visualization.
A Python tool to convert CAJ (China Academic Journals) files to PDF for developers who work with academic literature.
Synthea is an open-source synthetic patient population simulator for generating realistic healthcare data.
C++ DataFrame library for statistical, financial, and machine learning analysis.
RBush is a high-performance JavaScript R-tree-based 2D spatial index for points and rectangles.
A MySQL-compatible relational database with a storage agnostic query engine, implemented in Go.
A database migration and schema management tool for PHP developers, supporting multiple database engines.
A curated collection of open-source Chinese medical NLP resources including datasets, models, and more.
A collection of football analytics projects, data, and analysis by Edd Webster (@eddwebster).
This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.
Get weekly updates on trending AI coding tools and projects.