Showing 1-13 of 13 projects
Distributed gradient boosting library for fast and accurate data science solutions
Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.
A high-performance GPU DataFrame library for data analysis and machine learning workloads.
A unified framework for large-scale data computation that scales popular Python data tools like NumPy, Pandas, and Scikit-Learn.
A unified interface for distributed computing on Spark, Dask and Ray without any rewrites.
An interactive tutorial for the Dask distributed computing library, focused on data analysis and manipulation.
A distributed task scheduler for Dask, a popular Python library for parallel and distributed computing.
Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.
Agile data preparation workflows made easy with popular Python data science libraries.
A high-level plotting library for data visualization in Python, built on top of HoloViews.
A scalable machine learning library for time series forecasting in Python.
A Python package for processing earth-observing satellite data with support for common data formats and tools.
Eliot is a Python logging library that provides detailed causality analysis and tracing for complex distributed systems.
Get weekly updates on trending AI coding tools and projects.