Explore Projects

Discover 8 open source projects

Active filters (1):
Search: pydataร—
Clear all

Showing 1-8 of 8 projects

wesm/pydata-book

Companion repo for Python for Data Analysis book with Jupyter notebooks and data science examples

24.4K
Stable
Jupyter Notebook
Books & Guides
#data-science#jupyter-notebooks#python

dask/dask

Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.

13.8K
Active
Python
Databases
Python
#parallel-computing#distributed-data-processing#data-analysis

rapidsai/cudf

A high-performance GPU DataFrame library for data analysis and machine learning workloads.

9.5K
Active
C++
Databases
Python
#data-analysis#data-science#gpu

BrambleXu/pydata-notebook

A collection of Jupyter Notebook files for data analysis using Python, including a Chinese translation of the popular 'Python for Data Analysis' book.

4.7K
Archived
Jupyter Notebook
Jupyter Notebook
Tutorials & Courses
Jupyter
#data-analysis#python#jupyter-notebook

databricks/koalas

Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

3.4K
Archived
Python
ORMs & Query Builders
Databases
Spark
#big-data#data-science#dataframe

pydata/pandas-datareader

A Python library for extracting data from a wide range of internet sources into a pandas DataFrame.

3.2K
Experimental
Python
Databases
ETL & Pipelines
Python
#data-analysis#data-extraction#pandas

dask/distributed

A distributed task scheduler for Dask, a popular Python library for parallel and distributed computing.

1.7K
Active
Python
API Frameworks
Databases
Python
#distributed-computing#parallel-processing#task-scheduling

pyjanitor-devs/pyjanitor

A Python library for cleaning and transforming data, inspired by the R package Janitor.

1.5K
Active
Python
ETL & Pipelines
CLI Tools
#cleaning-data#data-transformation#pandas-extension

Stay in the loop

Get weekly updates on trending AI coding tools and projects.