Category
Showing 151-200 of 897 trending projects
A curated list of data science interview questions and answers for developers.
Sequel is a Ruby library that provides a powerful and flexible object-relational mapping (ORM) for databases.
A cloud-native PostgreSQL database developed by Alibaba Cloud for high-performance, scalable data storage and management.
Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.
MySQL Connector/J is a JDBC driver that enables Java applications to connect to MySQL databases.
A Python library that helps ensure data quality and reliability through data profiling and testing.
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
A Python library that provides a simple and unified interface for extracting text from any document format.
A PostgreSQL sample database for testing and learning SQL queries.
Dexie.js is a minimalistic IndexedDB wrapper that simplifies offline storage and database management in web applications.
The Data Transfer Project enables direct transfer of user data between online service providers.
A toolkit for SQLite databases, focused on application development with a Swift-based API.
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
A fast, scalable, and distributed database for transactional, analytical, and AI workloads.
A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.
A Python library for scraping soccer data from various sources for sports analytics and data science.
Citus is a distributed PostgreSQL database that enables scaling out your Postgres database across multiple nodes.
An in-depth tutorial covering mainstream database knowledge for backend developers.
Open-source graph database optimized for dynamic analytics and streaming data environments.
A comprehensive collection of data science cheatsheets for developers and data scientists.
Titan is a distributed graph database that can be used for building large-scale data-intensive applications.
Open source research data repository software built with Java.
FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.
A Python library that provides efficient, Pythonic data structures for sorted lists, dictionaries, and sets.
A free, open-source SQLite database manager for multiple platforms.
Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.
A Python tool to convert CAJ (China Academic Journals) files to PDF for developers who work with academic literature.
High-performance time-series database for IoT and IIoT
A C# library for reading and writing CSV files, with support for a wide range of CSV file formats.
Extremely fast, easy to use, and fully async NoSQL database for Flutter apps
An open-source multi-tool for exploring and publishing data, focused on simplifying data analysis and sharing.
A high-performance datastore for time series and tick data built on top of MongoDB.
Database manager for multiple database engines, runs as desktop or web app.
A Python library that provides a tour of the wonderland of math with visualizations and algorithms.
RBush is a high-performance JavaScript R-tree-based 2D spatial index for points and rectangles.
Reactive, local-first database for JavaScript apps with real-time sync and flexible storage
A Python library for quantitative trading and stock analysis.
A comprehensive list of learning materials to help developers understand database internals.
This repository contains efficient tools for LiDAR processing, focused on working with point cloud data.
A Python library for data migration and transformation in the Blaze project.
A cross-platform TUI database management tool written in Go for developers working with databases.
A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.
No description provided for this medical data repository.
Blazing-fast data wrangling toolkit for AI and data engineering workflows
A highly scalable, high-performance graph database that supports over 100 billion data points.
A Redis module that provides a time series data structure for storing and querying time series data.
A lightweight, fault-tolerant distributed database built on SQLite, designed for high availability.
An open-source, self-hosted database management tool with a spreadsheet-like interface for Postgres
DuckLake is an integrated data lake and catalog format written in C++.
Get weekly updates on trending AI coding tools and projects.