Explore Projects

Discover 111 open source projects

Active filters (1):
Search: pandasร—
Clear all

Showing 81-100 of 111 projects

lotus-data/lotus

A Python library that uses LLMs and embeddings to process datasets with up to 1000x speedups

1.6K
Active
Python
LLM Frameworks
ETL & Pipelines
Python
#ai-data-processing#llm#semantic-search

narwhals-dev/narwhals

Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.

1.5K
Active
Python
Databases
CLI Tools
Python
#dataframes#compatibility#pandas

capitalone/DataProfiler

A Python library for extracting schema, statistics, and entities from datasets, useful for data profiling and privacy analysis.

1.5K
Stable
Python
ETL & Pipelines
CLI Tools
Python
#data-profiling#data-analysis#privacy

hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

1.5K
Archived
Python
ETL & Pipelines
API Frameworks
#big-data-cleaning#data-analysis#data-cleaning

pyjanitor-devs/pyjanitor

A Python library for cleaning and transforming data, inspired by the R package Janitor.

1.5K
Active
Python
ETL & Pipelines
CLI Tools
#cleaning-data#data-transformation#pandas-extension

SciSharp/NumSharp

High-performance N-dimensional tensor computation library for .NET, similar to NumPy for Python.

1.5K
Stable
C#
ML Ops
Databases
#machine-learning#numerical-computing#tensor-operations

MaxHalford/prince

A Python library for performing multivariate exploratory data analysis, including techniques like PCA, CA, MCA, MFA, and FAMD.

1.4K
Active
Python
ORMs & Query Builders
Data Visualization
Python
#exploratory-data-analysis#dimensionality-reduction#principal-component-analysis

data-forge/data-forge-ts

A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.

1.4K
Stable
TypeScript
Data Transformation & Analysis
Frontend Frameworks
React
#data-transformation#data-analysis#data-manipulation

jupyter-incubator/sparkmagic

Provides Jupyter magics and kernels for working with remote Spark clusters, enabling data scientists to easily interact with Spark from Jupyter Notebooks.

1.4K
Stable
Python
API Frameworks
Databases
Jupyter
#spark#jupyter-notebook#pyspark

yhat/pandasql

pandasql is a Python library that allows developers to use SQL syntax to query Pandas DataFrames.

1.3K
Archived
Python
ORMs & Query Builders
CLI Tools
Python
#sql#pandas#dataframe

holoviz/hvplot

A high-level plotting library for data visualization in Python, built on top of HoloViews.

1.3K
Active
Python
Charts & Visualization
Python
#data-visualization#plotting#pandas

rocketlaunchr/dataframe-go

A data science and machine learning library for Go, providing DataFrame functionality similar to Python's Pandas.

1.3K
Archived
Go
Databases
ML Ops
Go
#data-science#dataframe#machine-learning

wq/django-rest-pandas

Serves up Pandas dataframes via the Django REST Framework for use in client-side visualizations and offline analysis.

1.3K
Experimental
Python
Charts & Visualization
API Frameworks
Django
#chart#csv#dataviz

rsvp/fecon235

Notebooks for financial economics, including analyses of Federal Reserve, GDP, inflation, and more.

1.3K
Archived
Jupyter Notebook
Databases
ETL & Pipelines
Jupyter Notebook
#finance#economics#federal-reserve

JoinQuant/jqdatasdk

A Python package for easy access to financial market data in China for quantitative finance and FinTech applications.

1.2K
Active
Python
Databases
Financial Data
Python
#financial-data#stock-data#stock-market-data

lit26/finvizfinance

A Python library for financial analysis and data scraping from the Finviz platform.

1.2K
Active
Jupyter Notebook
ETL & Pipelines
Backend Frameworks
Jupyter Notebook
#financial-analysis#web-scraping#data-pipeline

python-geeks/Automation-scripts

A repository of Python automation scripts to make a developer's life easier.

1.2K
Stable
Python
CLI Tools
General Utilities
#python#automation#hacktoberfest

sajal2692/data-science-portfolio

A portfolio of data science projects covering machine learning, NLP, and more for personal and academic use.

1.2K
Archived
Jupyter Notebook
Databases
ML Ops
Python
#data-science#machine-learning#nlp

xorbitsai/xorbits

A scalable Python library for data science and machine learning tasks with API compatibility and lightning-fast performance.

1.2K
Active
Python
ML Ops
Databases
Python
#data-science#machine-learning#scalable

machow/siuba

Python library for using dplyr-like syntax with pandas and SQL databases

1.2K
Stable
Python
ORMs & Query Builders
CLI Tools
Python
#data-analysis#pandas#sql

Stay in the loop

Get weekly updates on trending AI coding tools and projects.