Data & Databases

ORMs, query builders, databases, and data pipelines

Showing 121-140 of 5,250 projects

tikv/tikv

Distributed transactional key-value database, originally created to complement TiDB

16.6K
Active
Rust
Databases
#cncf#consensus#distributed-transactions

argoproj/argo-workflows

Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.

16.5K
Active
Go
ETL & Pipelines
Kubernetes
#kubernetes#pipelines#workflow

tursodatabase/libsql

libSQL is an open-source, open-contribution fork of SQLite, a widely used embedded database.

16.4K
Stable
C
Databases
#database#embedded-database#sqlite

prisma/prisma1

Prisma1 is a database toolkit with an ORM, migrations, and admin UI for Postgres, MySQL, and MongoDB.

16.4K
Archived
Scala
ORMs & Query Builders
GraphQL
#database#orm#migrations

ZhuLinsen/daily_stock_analysis

An LLM-driven stock analysis platform with real-time data, news, and decision-making dashboards.

16.2K
Active
Python
LLM Frameworks
API Frameworks
Python
#agent#quantitative-trading#stock-analysis

FavioVazquez/ds-cheatsheets

A comprehensive collection of data science cheatsheets for developers and data scientists.

16.2K
Archived
Data Science
#datascience#cheatsheet#python

apple/foundationdb

FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.

16.2K
Active
C++
Databases
#acid#distributed-database#key-value-store

dgraph-io/badger

Fast, embeddable key-value database written in Go for building high-performance storage applications.

15.5K
Active
Go
Databases
#database#key-value#ssd

treeverse/dvc

dvc is a data versioning and ML experiments tool that helps developers manage and track data and model changes.

15.4K
Active
Python
ETL & Pipelines
Python
#data-versioning#machine-learning#reproducibility

scylladb/scylladb

A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.

15.4K
Active
C++
Databases
#nosql#cassandra#database

apache/doris

Apache Doris is a high-performance, unified analytics database for real-time data processing.

15.1K
Active
Java
Databases
Spark
#database#olap#real-time

dagster-io/dagster

An open-source data orchestration platform for developing, running, and observing data pipelines and workflows.

15.1K
Active
Python
ETL & Pipelines
Python
#data-engineering#data-orchestration#workflow-automation

zhisheng17/flink-learning

This is a comprehensive learning resource for the Flink stream processing framework, covering concepts, principles, and real-world use cases.

15.1K
Experimental
Java
Databases
#stream-processing#flink#kafka

cayleygraph/cayley

An open-source graph database written in Go, useful for building applications that require linked data and graph-based queries.

15.0K
Stable
Go
Databases
#graph-database#linked-data#go

andkret/Cookbook

A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.

15.0K
Active
Python
ETL & Pipelines
Python
#data-engineering#etl#pipeline

waditu/tushare

A Python library for crawling historical data of China stocks.

14.5K
Archived
Python
Databases
Python
#finance#fintech#stock-data

scipy/scipy

SciPy is a Python library for scientific and technical computing, providing a wide range of algorithms and tools.

14.5K
Active
Python
Databases
Python
#scientific-computing#algorithms#data-analysis

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness.

14.3K
Stable
Databases
#big-data#data-analytics#data-science

arangodb/arangodb

ArangoDB is a multi-model database supporting documents, graphs, and key-values for high-performance applications.

14.1K
Active
C++
Databases
#database#multi-model#nosql

dexie/Dexie.js

Dexie.js is a minimalistic IndexedDB wrapper that simplifies offline storage and database management in web applications.

14.1K
Active
TypeScript
Databases
React
#indexeddb#offline-storage#database
1...68...263

Stay in the loop

Get weekly updates on trending AI coding tools and projects.