Explore Projects

Discover 125 open source projects

Active filters (1):
Search: data-analysisร—
Clear all

Showing 101-120 of 125 projects

sfirke/janitor

A collection of simple tools for data cleaning and wrangling in R for data science tasks.

1.4K
Archived
R
Data Cleaning & Wrangling
#data-analysis#data-cleaning#data-science

bruin-data/bruin

A data platform that enables building data pipelines with SQL, Python, and ingesting from various sources.

1.4K
Active
Go
ETL & Pipelines
API Frameworks
Go
#data-pipelines#data-ingestion#data-transformation

GoogleCloudPlatform/data-science-on-gcp

A repository providing data science tools and examples for the Google Cloud Platform.

1.4K
Stable
Jupyter Notebook
React
#data-science#cloud-computing#google-cloud

andresvourakis/data-scientist-handbook

A curated collection of resources to help aspiring and experienced data scientists in their career journey.

1.4K
Stable
Tutorials & Courses
Books & Guides
#data-science#data-analysis#machine-learning

data-forge/data-forge-ts

A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.

1.4K
Stable
TypeScript
Data Transformation & Analysis
Frontend Frameworks
React
#data-transformation#data-analysis#data-manipulation

amphi-ai/amphi-etl

A visual data preparation tool powered by Python, designed for data analysis and ETL tasks.

1.4K
Active
TypeScript
ETL & Pipelines
Data Analysis
TypeScript
#data-analysis#data-pipelines#data-transformation

hurshd0/must-read-papers-for-ml

A curated collection of must-read papers for Data Science, Machine Learning, and Deep Learning enthusiasts

1.3K
Archived
Papers
#machine-learning#deep-learning#data-science

uxlfoundation/scikit-learn-intelex

Seamless integration of Scikit-learn with Intellex for AI inference and machine learning applications

1.3K
Active
Python
React
#machine-learning#ai-inference#scikit-learn-integration

dongsuo/vue-data-board

A data analysis and visualization board built with Vue.js and Echarts, with support for drag-and-drop and no-code configuration.

1.3K
Experimental
Vue
Charts & Visualization
Component Libraries (Vue/Svelte)
Vue
#data-analysis#data-visualization#drag-and-drop

singer-io/getting-started

A getting started guide to Singer, a data integration framework for ETL and data analysis.

1.3K
Stable
Makefile
Makefile
#authentication#streaming#real-time

alan-turing-institute/CleverCSV

A Python package for handling messy CSV files with improved dialect detection and a command-line interface.

1.3K
Active
Python
ETL & Pipelines
CLI Tools
#csv#data-analysis#data-mining

PatMartin/Dex

Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.

1.3K
Archived
JavaScript
ETL & Pipelines
Charts & Visualization
#data-analysis#data-visualization#data-mining

LongOnly/Quantitative-Notebooks

Educational notebooks on quantitative finance, algorithmic trading, financial modeling, and investment strategy.

1.3K
Archived
Jupyter Notebook
Data Analysis
API Frameworks
Jupyter
#algorithmic-trading#data-science#finance

xinglie/report-designer

A comprehensive platform for designing, visualizing, and printing reports, diagrams, and more.

1.3K
Active
HTML
Component Libraries (React)
CMS & Content
React
#data-visualization#editor#online-design

nfstream/nfstream

NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.

1.2K
Stable
Python
Data Analysis
Network Security
#network-analysis#traffic-classification#data-mining

apache/cloudberry

Open-source massively parallel processing (MPP) database, an alternative to Greenplum.

1.2K
Active
C
Databases
OLAP
PostgreSQL
#big-data#data-analysis#data-warehouse

machow/siuba

Python library for using dplyr-like syntax with pandas and SQL databases

1.2K
Stable
Python
ORMs & Query Builders
CLI Tools
Python
#data-analysis#pandas#sql

predict-idlab/plotly-resampler

A Python library that helps visualize large time series data using the Plotly data visualization library.

1.2K
Stable
Python
Charts & Visualization
ETL & Pipelines
Python
#data-visualization#time-series#plotly

ChawlaAvi/Daily-Dose-of-Data-Science

A collection of code snippets and tutorials for data science and data analysis in Python.

1.2K
Experimental
Jupyter Notebook
Databases
ETL & Pipelines
Jupyter
#data-analysis#data-science#jupyter-notebook

apachecn/pyda-2e-zh

A Chinese translation of the book 'Python for Data Analysis' 2nd Edition, covering NumPy, Pandas, and other data analysis tools.

1.1K
Archived
CSS
Databases
ETL & Pipelines
#data-analysis#numpy#pandas

Stay in the loop

Get weekly updates on trending AI coding tools and projects.