Data & Databases

ORMs, query builders, databases, and data pipelines

Showing 1341-1360 of 5,250 projects

ClickHouse/clickhouse-go

A Go driver for the ClickHouse analytics database, enabling fast and efficient data processing.

3.3K
Active
Go
Databases
#clickhouse#analytics#database

waymo-research/waymo-open-dataset

Waymo Open Dataset is a large-scale dataset for autonomous driving research and development.

3.3K
Active
Python
Computer Vision
Datasets
Python
#autonomous-driving#dataset#computer-vision

WeBankFinTech/DataSphereStudio

DataSphereStudio is a one-stop data application development and management portal covering data exchange, analysis, and visualization.

3.3K
Stable
Java
ETL & Pipelines
API Frameworks
Spark
#data-management#data-analysis#data-visualization

OSU-NLP-Group/HippoRAG

HippoRAG is a novel RAG framework that enables LLMs to continuously integrate knowledge across external documents.

3.3K
Stable
Python
LLM Frameworks
RAG & Vector
Python
#llm#knowledge-integration#knowledge-graphs

linq2db/linq2db

Linq to database provider for .NET, supporting various database engines.

3.2K
Active
C#
ORMs & Query Builders
#database#orm#linq

rguo12/awesome-causality-algorithms

An index of algorithms for learning causality with data, useful for vibe coders working on AI-powered applications.

3.2K
Archived
ML Ops
API Frameworks
#causality#causal-inference#recommender-system

adelsz/pgtyped

pgTyped provides type-safe SQL queries in TypeScript, helping developers write more reliable and maintainable database-driven applications.

3.2K
Active
TypeScript
API Clients & Testing
ORMs & Query Builders
TypeScript
#type-safe#postgresql#sql

homenc/HElib

HElib is an open-source C++ library for homomorphic encryption, supporting BGV and CKKS schemes.

3.2K
Archived
C++
Privacy Tools
Encryption
#cryptography#encryption#privacy-enhancing-technologies

mljar/mljar-supervised

An AutoML Python package for tabular data with feature engineering, hyperparameter tuning, and automatic documentation.

3.2K
Experimental
Python
ML Ops
Databases
Python
#automated-machine-learning#automl#tabular-data

spring-projects/spring-data-jpa

Simplifies the development of creating a JPA-based data access layer in Java Spring applications.

3.2K
Active
Java
API Frameworks
ORMs & Query Builders
Spring
#java#jpa#spring

codingo/NoSQLMap

An automated NoSQL database enumeration and web application exploitation tool for security researchers.

3.2K
Stable
Python
Security Research
API Frameworks
Python
#nosql#security-tool#penetration-testing

facebookresearch/MUSE

A library for training multilingual word embeddings, useful for NLP tasks across languages.

3.2K
Archived
Python
LLM Frameworks
Vector Databases
Python
#nlp#embeddings#multilingual

gedeck/practical-statistics-for-data-scientists

This is a code repository for a book on practical statistics for data scientists, not a developer discovery platform.

3.2K
Stable
Jupyter Notebook
Data Analysis & Visualization
#statistics#data-science#jupyter-notebook

apache/avro

Apache Avro is a data serialization system for efficient storage and transmission of structured data.

3.2K
Active
Java
Databases
API Clients & Testing
#data-serialization#serialization-framework#big-data

spatie/laravel-analytics

A Laravel package to retrieve pageviews and other data from Google Analytics

3.2K
Active
PHP
API Frameworks
Analytics
Laravel
#analytics#google#statistics

symfony/doctrine-bridge

Provides integration for Doctrine with various Symfony components, enabling efficient database management.

3.2K
Active
PHP
API Frameworks
ORMs & Query Builders
Symfony
#symfony#orm#database

storj/storj

Ongoing development of Storj v3, a decentralized cloud object storage that is affordable, easy to use, private, and secure.

3.2K
Active
Go
File Storage
API Frameworks
#distributed#object-storage#open-source

mongodb/mongo-csharp-driver

The official C# .NET driver for MongoDB, allowing developers to interact with MongoDB databases from C# applications.

3.2K
Active
C#
API Clients & Testing
Databases
#csharp#nosql#database

lakesoul-io/LakeSoul

LakeSoul is a cloud-native, real-time Lakehouse framework for fast data ingestion and analytics on cloud storage.

3.2K
Active
Java
API Frameworks
Databases
#big-data#lakehouse#streaming

magenta/ddsp

A differentiable digital signal processing library for building AI-powered audio applications and models.

3.2K
Active
Python
LLM Frameworks
API Clients & Testing
Python
#audio#signal-processing#differentiable
1...6769...263

Stay in the loop

Get weekly updates on trending AI coding tools and projects.