Explore Projects

Discover 50 open source projects

Active filters (1):
Search: dataframeร—
Clear all

Showing 1-20 of 50 projects

pola-rs/polars

Fast DataFrame query engine in Rust with Python/Rust/Node.js/R frontends

37.6K
Active
Rust
ETL & Pipelines
CLI Tools
Rust
#dataframe#rust#arrow

Kanaries/pygwalker

Interactive UI for visual analysis of dataframes

15.7K
Stable
Python
React
#data-analysis#visualization#interactive-ui

Data-Centric-AI-Community/ydata-profiling

A Python library for fast, customizable, and interactive data profiling and exploratory data analysis.

13.4K
Active
Python
Data Profiling
Python
#data-profiling#exploratory-data-analysis#data-quality

modin-project/modin

Modin: Scalable Pandas workflows with a single line of code change, enabling distributed data processing.

10.4K
Stable
Python
Databases
Python
#pandas#data-analysis#data-processing

rapidsai/cudf

A high-performance GPU DataFrame library for data analysis and machine learning workloads.

9.5K
Active
C++
Databases
Python
#data-analysis#data-science#gpu

vaexio/vaex

A high-performance Python library for working with large tabular datasets, offering efficient data manipulation and visualization.

8.5K
Stable
Python
Databases
Caching
Python
#bigdata#data-science#dataframe

apache/datafusion

Apache DataFusion is a powerful SQL query engine written in Rust, designed for big data processing and analysis.

8.5K
Active
Rust
Databases
ETL & Pipelines
#big-data#dataframe#olap

evidentlyai/evidently

Evidently is an open-source ML and LLM observability framework to evaluate, test, and monitor AI-powered systems.

7.3K
Active
Jupyter Notebook
MLOps
Data Validation
Jupyter Notebook
#data-quality#data-validation#model-monitoring

codebasics/py

This is a repository with sample Python programs for learning Python, covering topics like NumPy, Pandas, and Jupyter Notebooks.

7.3K
Experimental
Jupyter Notebook
Tutorials & Courses
Backend Frameworks
Jupyter
#python#jupyter-notebook#numpy

ibis-project/ibis

Portable Python dataframe library for data analysis and manipulation

6.4K
Active
Python
React
#dataframe#python#analysis

haifengl/smile

Smile is a comprehensive statistical machine learning and data science library for Java developers.

6.3K
Active
Java
ML Ops
Databases
#machine-learning#data-science#statistics

lux-org/lux

Automatically visualize your pandas dataframes with a single print command, enabling quick EDA.

5.4K
Archived
Python
Data Visualization
CLI Tools
Python
#data-science#exploratory-data-analysis#pandas

lk-geimfari/mimesis

Mimesis is a fast Python library for generating fake data in multiple languages for testing and development purposes.

4.8K
Active
Python
Databases
Testing
#data-generation#fake-data#testing

deepchecks/deepchecks

Deepchecks is an open-source solution for thorough testing of ML models and data from research to production.

4.0K
Stable
Python
ML Ops
Data Validation
Python
#machine-learning#data-validation#model-monitoring

jtablesaw/tablesaw

A high-performance Java library for data analysis, visualization, and machine learning.

3.7K
Experimental
Java
Databases
Data Visualization
#data-analysis#data-visualization#machine-learning

databricks/koalas

Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

3.4K
Archived
Python
ORMs & Query Builders
Databases
Spark
#big-data#data-science#dataframe

sngyai/Sequoia

A Python library for algorithmic trading on the Chinese A-shares stock market, with various technical analysis features.

3.2K
Archived
Python
API Frameworks
ORMs & Query Builders
Python
#a-shares#algorithmic-trading#technical-analysis

pydata/pandas-datareader

A Python library for extracting data from a wide range of internet sources into a pandas DataFrame.

3.2K
Experimental
Python
Databases
ETL & Pipelines
Python
#data-analysis#data-extraction#pandas

delta-io/delta-rs

A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.

3.2K
Active
Rust
ETL & Pipelines
API Frameworks
#delta-lake#etl#data-engineering

quantopian/qgrid

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

3.1K
Archived
Python
Databases
IDE Extensions
Python
#data-visualization#data-manipulation#jupyter-notebook

Stay in the loop

Get weekly updates on trending AI coding tools and projects.