Data & Databases

ORMs, query builders, databases, and data pipelines

Showing 41-60 of 5,250 projects

facebook/rocksdb

Embeddable, persistent key-value store for fast storage with LSM design

31.6K
Active
C++
Databases
#key-value-store#rocksdb#c-plus-plus

numpy/numpy

Fundamental package for scientific computing with Python

31.6K
Active
Python
ETL & Pipelines
Python
#numpy#scientific-computing#python-library

surrealdb/surrealdb

A scalable, distributed, collaborative document-graph database for the realtime web

31.4K
Active
Rust
Cloud-database
Document-database
Rust
#database#cloud#document-graph

influxdata/influxdb

Time-series database for metrics & analytics

31.4K
Active
Rust
Databases
#database#time-series#metrics

microsoft/graphrag

GraphRAG is a modular system for enhancing LLM outputs using knowledge graphs from unstructured text.

31.2K
Active
Python
RAG & Vector
RAG Frameworks
Python
#graphrag#llm#rag

seaweedfs/seaweedfs

Distributed storage system for blobs, files, and data lakes

30.8K
Active
Go
Containerization
Databases
#distributed-storage#blob-storage#cloud-drive

sequelize/sequelize

ORM for Node.js/TypeScript with multiple database support

30.3K
Active
TypeScript
ORMs & Query Builders
Node.js
#orm#nodejs#typescript

dragonflydb/dragonfly

Modern in-memory key-value store for caching and data management

30.1K
Active
C++
Caching
Databases
#in-memory#key-value#cache

alibaba/canal

MySQL binlog incremental subscription and consumption component

29.6K
Active
Java
ETL & Pipelines
#mysql#binlog#data-synchronization

qdrant/qdrant

Vector database for AI applications

29.3K
Active
Rust
Vector Databases
RAG & Vector
#vector-database#ai-search#embeddings-similarity

HKUDS/LightRAG

LightRAG is a fast and simple Retrieval-Augmented Generation (RAG) framework for efficient knowledge retrieval and generation.

29.0K
Active
Python
RAG & Vector
RAG Frameworks
Python
#RAG#retrieval-augmented-generation#knowledge-graph

CSSEGISandData/COVID-19

Real-time global and U.S. data tracking for developers and researchers.

29.0K
Archived
ETL & Pipelines
Admin Dashboards
#covid-19#data-tracking#jhu-csse

donnemartin/data-science-ipython-notebooks

Data science Python notebooks covering deep learning, machine learning, big data, and more.

28.9K
Archived
Python
Computer Vision
ML Ops
TensorFlow
#data-science#deep-learning#machine-learning

academic/awesome-datascience

Comprehensive Data Science learning and resource guide

28.5K
Active
Tutorials & Courses
ML Ops
#data-science#machine-learning#deep-learning

getredash/redash

Redash enables data-driven decisions by connecting to data sources and creating visualizations and dashboards.

28.3K
Active
Python
Analytics & Tracking
Search
Python
#analytics#dashboard#data-visualization

alibaba/druid

Druid is a high-performance database connection pool for Java applications, designed for monitoring and management.

28.2K
Active
Java
Caching
Background Jobs
Java
#database#connection pool#Java

mongodb/mongo

MongoDB database server and tools

28.2K
Active
C++
Databases
#mongodb#nosql#database

Automattic/mongoose

Mongoose is a MongoDB object modeling tool for Node.js and Deno, simplifying database interactions with schemas and models.

27.5K
Active
JavaScript
ORMs & Query Builders
Node.js
#mongodb#orm#nodejs

rethinkdb/rethinkdb

Realtime NoSQL database for web apps

27.0K
Stable
C++
Databases
#nosql#realtime#database

PostgREST/postgrest

PostgREST provides a REST API for any PostgreSQL database, enabling fast and standards-compliant API generation.

26.6K
Active
Haskell
API Documentation
Databases
#rest-api#postgresql#haskell
124...263

Stay in the loop

Get weekly updates on trending AI coding tools and projects.