Category
Showing 551-600 of 897 trending projects
Realm is a mobile database that serves as a replacement for SQLite and ORMs.
A PHP database abstraction layer that provides a simple, consistent API for interacting with different database systems.
A collection of code examples and baselines for common data science and machine learning competitions.
The Go kernel for Jupyter notebooks and nteract, enabling data science and numerical computing in Go.
The Data Transfer Project enables direct transfer of user data between online service providers.
Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.
This is Facebook's branch of the Oracle MySQL database, including the MyRocks storage engine.
This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.
Open source time series library for Python, useful for statistical analysis and modeling.
Apache BookKeeper is a scalable, fault tolerant and low latency storage service optimized for append-only workloads.
A Python package for time series classification, useful for developers working with time-series data.
SchemaCrawler is a free database schema discovery and comprehension tool that supports various database management systems.
A high-performance, highly available, and distributed time series database written in Rust.
Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.
Curated list of Python software and packages for scientific research in audio
Performant probabilistic data structures for processing continuous, unbounded streams in Go.
Cartopy is a Python library for creating maps and visualizing spatial data with matplotlib support.
A JavaScript library for efficient querying and transformation of array-backed data tables.
A curated collection of resources related to image registration, including books, papers, videos, and toolboxes.
A C# library for reading and writing metadata in media files, useful for audio and video processing applications.
A Python library for analyzing movement trajectory data using GeoPandas.
A Python library for extracting, transforming, and loading tabular data.
Index your Gmail account to a SQLite DB and perform custom data analysis on your email.
NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.
Open-source massively parallel processing (MPP) database, an alternative to Greenplum.
A time series forecasting library for R, providing a wide range of models and tools for accurate predictions.
This is a Python library focused on basketball analytics and data processing.
A tutorial for using the popular Python data analysis library Pandas, presented at PyCon 2015.
Intake is a lightweight Python package for discovering, investigating, loading and distributing data.
Cloud-native genomic dataframes and batch computing for bioinformatics and genetics research.
This repository contains efficient tools for LiDAR processing, focused on working with point cloud data.
A Python library for implementing the Louvain community detection algorithm on graphs.
This Python library provides additional linear models for statistical modeling and analysis.
A next-generation curated knowledge sharing platform for data scientists and other technical professionals.
Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.
A parallel processing library for Pandas that improves performance on multi-core CPUs.
A curated list of tools and datasets for anomaly detection on time-series data.
A fast C++ library for high-performance matrix and vector operations.
A repository for the 100 Knocks of Data Science Preprocessing, focused on structured data processing.
A Python library for creating beautiful visualizations of language differences across document types.
A unified interface for distributed computing on Spark, Dask and Ray without any rewrites.
This is a Python library for financial applications, not a tool for AI-powered vibe coders.
This repository provides comprehensive tutorials and resources for learning data science and machine learning using Python.
A C# in-memory document database with source generator-based embedded typed readonly data.
A Python module for extracting and mapping Chinese province, city, and district data.
Graph and network visualization library for R developers working with tabular data
A .NET Standard library that provides strongly typed exceptions for Entity Framework Core across multiple database providers.
Dozer is a real-time data movement tool that leverages CDC to move data between various sources and sinks.
An offline IP database for developers to look up IP address geolocation information.
A Python library for cleaning and transforming data, inspired by the R package Janitor.
Get weekly updates on trending AI coding tools and projects.