Category
Showing 651-700 of 897 trending projects
A collection of procedures for the Neo4j graph database, providing advanced graph algorithms and utilities.
A high-performance compression library written in C for developers working with large data sets.
A Python library for analyzing movement trajectory data using GeoPandas.
A collection of Jupyter Notebook files for data analysis using Python, including a Chinese translation of the popular 'Python for Data Analysis' book.
A fast spatial index library for 2D points and rectangles in JavaScript, useful for geospatial applications.
A fast and elegant data exploration library for Elixir, providing series and dataframes for data science workflows.
A C++ library for importing OpenStreetMap data into a PostgreSQL/PostGIS database.
A curated list of Google Earth Engine resources for geospatial analysis and remote sensing applications.
Overture Maps Data is a Python library providing access to open-source geographic data.
Ploomber is a fast and versatile tool for building and deploying data pipelines that can be used with a variety of AI and ML tools.
A Python library for creating data processing pipelines using functional programming principles.
Firebird is a relational database management system (RDBMS) suitable for a wide range of applications from desktop to client-server to large databases.
Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.
A full-featured file system for online data storage, built with Python.
NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.
Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.
The Go kernel for Jupyter notebooks and nteract, enabling data science and numerical computing in Go.
A collection of R packages for data science, including tools for data manipulation, visualization, and modeling.
MetPy is a Python library for reading, visualizing, and performing calculations with weather data.
A Python library with data related to Brazilian municipalities, including IBGE codes, latitude, longitude, and more.
An R package that provides customizable and presentation-ready data summary and analytic result tables.
This GitHub repository provides tutorials on effectively using the Pandas library for data analysis.
Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.
Converts MySQL database dumps to SQLite3 compatible formats for easier migration and data portability.
Powerful plotting and data visualization library for the Julia programming language.
A C# library that converts Excel spreadsheets to JSON objects and saves them to a text file.
Percona Server is an enhanced, open-source version of the MySQL database management system.
A high-performance logical replication extension for PostgreSQL that enables fast, cross-version database replication.
A Python package for time series classification, useful for developers working with time-series data.
A Python package for processing earth-observing satellite data with support for common data formats and tools.
A curated list of Twitter datasets and resources for data scientists and social network analysts.
This is a Python library for financial applications, not a tool for AI-powered vibe coders.
Python interface for the igraph library, a powerful tool for network analysis and visualization.
Core database component for the Realm Mobile Database SDKs, a popular NoSQL database for mobile apps.
A Python tool to parse Redis dump.rdb files, analyze memory usage, and export data to JSON.
Zui is a powerful desktop app for exploring and working with data, with support for CSV, JSON, and the Zed data format.
A time series library for Apache Spark that provides a high-level API for working with time series data.
Apache BookKeeper is a scalable, fault tolerant and low latency storage service optimized for append-only workloads.
A high-performance, open-source data processing pipeline for ingesting Kafka data and sending it to Elasticsearch.
An intuitive library to extract features from time series data for data science and machine learning.
This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.
Simple Python interface for Graphviz, a popular open-source data visualization tool.
LuxCore is a high-performance path-tracing render engine for realistic 3D graphics and visualization.
A color palette package in R inspired by works at the Metropolitan Museum of Art in New York.
Sequel is a Ruby library that provides a powerful and flexible object-relational mapping (ORM) for databases.
A Python library that generates fake data for custom test databases.
R kernel for the Jupyter notebook environment, enabling interactive R programming in Jupyter.
An offline IP database for developers to look up IP address geolocation information.
An educational project to build a disk-based key-value store in Python for learning purposes.
A fast C-based implementation of Dynamic Time Warping, a popular algorithm for comparing time series data.
Get weekly updates on trending AI coding tools and projects.