Showing 1-8 of 8 projects
Companion repo for Python for Data Analysis book with Jupyter notebooks and data science examples
Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.
A high-performance GPU DataFrame library for data analysis and machine learning workloads.
A collection of Jupyter Notebook files for data analysis using Python, including a Chinese translation of the popular 'Python for Data Analysis' book.
Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.
A Python library for extracting data from a wide range of internet sources into a pandas DataFrame.
A distributed task scheduler for Dask, a popular Python library for parallel and distributed computing.
A Python library for cleaning and transforming data, inspired by the R package Janitor.
Get weekly updates on trending AI coding tools and projects.