Category
Showing 701-750 of 897 trending projects
Powerful plotting and data visualization library for the Julia programming language.
Zui is a powerful desktop app for exploring and working with data, with support for CSV, JSON, and the Zed data format.
A Java connector for integrating MongoDB with Hadoop ecosystems for big data processing.
Python interface for the igraph library, a powerful tool for network analysis and visualization.
A Python library with data related to Brazilian municipalities, including IBGE codes, latitude, longitude, and more.
Overture Maps Data is a Python library providing access to open-source geographic data.
GridDB is a fast and scalable open-source database for time-series IoT and big data applications.
AI-native database unifying vector, text, and structured data for hybrid search and in-database AI workflows.
Open-source BI platform for engineers to explore and model large-scale data pipelines.
Simple Python interface for Graphviz, a popular open-source data visualization tool.
SchemaCrawler is a free database schema discovery and comprehension tool that supports various database management systems.
A collection of PySpark examples covering RDD, DataFrame, and Dataset operations in Python.
An R package that provides customizable and presentation-ready data summary and analytic result tables.
A Python package for processing earth-observing satellite data with support for common data formats and tools.
Documentation for the popular .NET ORM Entity Framework Core and Entity Framework 6.
A C# library for reading and writing metadata in media files, useful for audio and video processing applications.
An R package that provides support for simple features, a standardized way to encode spatial vector data.
Percona Server is an enhanced, open-source version of the MySQL database management system.
NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.
A C# library that converts Excel spreadsheets to JSON objects and saves them to a text file.
A tool for comparing and evaluating databases for time series data.
Firebird is a relational database management system (RDBMS) suitable for a wide range of applications from desktop to client-server to large databases.
A Python toolbox for seismology and seismological observatories, providing tools for data processing and analysis.
A Python library for reading, manipulating, and writing data in various spreadsheet file formats.
Trill is a single-node query processor for temporal or streaming data.
A high-performance logical replication extension for PostgreSQL that enables fast, cross-version database replication.
Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.
A Ruby library that makes it easy to group temporal data, useful for developers working with time-series data.
A Python library for technical analysis indicators, with Chinese translation and documentation.
An educational project to build a disk-based key-value store in Python for learning purposes.
Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.
A Python package for time series classification, useful for developers working with time-series data.
A comprehensive resource for developers to learn and get started with data engineering using Python.
esProc SPL is a JVM-based programming language for structured data computation, serving as both a data analysis tool and an embedded computing engine.
An open-source threat hunting platform built on the ELK stack for security researchers and analysts.
This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.
A high-performance, MySQL-compatible vector database that supports structured and unstructured data for AI-driven applications.
ggplot2 is a powerful data visualization library for R that provides elegant and flexible graphics.
A Python library for cleaning and transforming data, inspired by the R package Janitor.
Synth is a Rust library for generating realistic, randomized test data for applications and databases.
MetPy is a Python library for reading, visualizing, and performing calculations with weather data.
Fiona is a Python library for reading and writing geographic data files, with support for CLI usage.
LevelDB key/value database in Go for building high-performance data-intensive applications.
A collection of R packages for data science, including tools for data manipulation, visualization, and modeling.
LuxCore is a high-performance path-tracing render engine for realistic 3D graphics and visualization.
This is an astronomy visualization project that maps orbits of asteroids in the solar system.
Ploomber is a fast and versatile tool for building and deploying data pipelines that can be used with a variety of AI and ML tools.
This Scala library provides a high-performance implementation of the node2vec algorithm for embedding graphs.
A curated list of community detection research papers with implementations for data science and network analysis.
A repository for collecting study materials and resources related to data analysis and related fields.
Get weekly updates on trending AI coding tools and projects.