Data & Databases

ORMs, query builders, databases, and data pipelines

Showing 21-40 of 5,250 projects

DataExpert-io/data-engineer-handbook

Comprehensive data engineering resource hub with learning paths, books, communities, and tools

40.4K
Stable
Jupyter Notebook
Tutorials & Courses
Awesome Lists
Apache Airflow
#dataengineering#bigdata#apachespark

pingcap/tidb

Cloud-native distributed SQL database for modern applications

39.9K
Active
Go
Databases
Go
#cloud-native#distributed-database#mysql-compatibility

go-gorm/gorm

GORM is a developer-friendly ORM library for Golang, offering features like associations, hooks, and auto migrations.

39.7K
Active
Go
ORMs & Query Builders
GORM
#go#orm#golang

facebookresearch/faiss

Faiss is a library for efficient similarity search and clustering of dense vectors, supporting CPU and GPU acceleration.

39.3K
Active
C++
RAG & Vector
Vector Databases
#similarity-search#vector-clustering#dense-vectors

QuivrHQ/quivr

Opiniated RAG for GenAI integration in apps

39.0K
Experimental
Python
RAG & Vector
Vector Databases
Python
#ai#rag#vector-database

DataTalksClub/data-engineering-zoomcamp

Free 9-week data engineering course with hands-on modules on pipelines, dbt, Kafka, and Spark

38.9K
Active
Jupyter Notebook
Tutorials & Courses
ETL & Pipelines
dbt
#data-engineering#course#dbt

google/leveldb

Fast key-value storage library for C++

38.9K
Archived
C++
Databases
C++
#key-value#storage#C++

mindsdb/mindsdb

Federated query engine for AI with built-in MCP server

38.6K
Active
Python
MCP Servers
Agents & Orchestration
Python
#ai#mcp#agents

microsoft/qlib

AI-powered quantitative investment platform for finance and trading

38.2K
Active
Python
Inference
SaaS Boilerplates
Python
#quantitative-investment#algorithmic-trading#machine-learning

pola-rs/polars

Fast DataFrame query engine in Rust with Python/Rust/Node.js/R frontends

37.6K
Active
Rust
ETL & Pipelines
CLI Tools
Rust
#dataframe#rust#arrow

drawdb-io/drawdb

Database diagram editor and SQL generator

36.8K
Active
JavaScript
ETL & Pipelines
Charts & Visualization
JavaScript
#database-diagram#sql-generator#erd-editor

duckdb/duckdb

High-performance analytical in-process SQL database for developers

36.5K
Active
C++
Databases
#sql#database#analytics

typeorm/typeorm

ORM for TypeScript and JavaScript with support for multiple databases and platforms.

36.4K
Active
TypeScript
ORMs & Query Builders
TypeScript
#typeorm#orm#typescript

SheetJS/sheetjs

SheetJS Spreadsheet Data Toolkit for data extraction and spreadsheet generation.

36.2K
Archived
ETL & Pipelines
General Utilities
#spreadsheet#data-extraction#csv

qishibo/AnotherRedisDesktopManager

Redis desktop manager with GUI for managing Redis databases on Linux, Windows, Mac

34.0K
Stable
JavaScript
Caching
#redis#redis-client#redis-cluster

drizzle-team/drizzle-orm

TypeScript ORM for Node.js, Bun, Deno, and serverless environments

33.1K
Active
TypeScript
ORMs & Query Builders
CLI Tools
Node.js
#orm#typescript#nodejs

vercel/swr

React Hooks for efficient data fetching with caching and real-time updates

32.3K
Active
TypeScript
Component Libraries (React)
Frontend Frameworks
React
#data-fetching#react-hooks#caching

apache/kafka

Distributed event streaming platform for data pipelines and real-time apps

32.1K
Active
Java
ETL & Pipelines
Realtime
Java
#kafka#event-streaming#data-pipelines

cockroachdb/cockroach

Distributed SQL database for cloud-native apps

32.0K
Active
Go
Databases
Go
#cockroachdb#distributed-database#sql

hasura/graphql-engine

Hasura GraphQL Engine provides secure, real-time GraphQL APIs for data sources with access control and event triggers.

31.9K
Active
TypeScript
GraphQL
Realtime
TypeScript
#graphql#realtime-api#access-control
13...263

Stay in the loop

Get weekly updates on trending AI coding tools and projects.