Explore Projects

Discover 29 open source projects

Active filters (1):
Search: datascienceร—
Clear all

Showing 1-20 of 29 projects

academic/awesome-datascience

Comprehensive Data Science learning and resource guide

28.5K
Active
Tutorials & Courses
ML Ops
#data-science#machine-learning#deep-learning

Avaiga/taipy

Taipy is a Python library that helps developers turn data and AI algorithms into production-ready web apps quickly.

19.1K
Active
Python
Agents & Orchestration
Python
#data-engineering#data-ops#data-visualization

FavioVazquez/ds-cheatsheets

A comprehensive collection of data science cheatsheets for developers and data scientists.

16.2K
Archived
Data Science
#datascience#cheatsheet#python

virgili0/Virgilio

A comprehensive learning resource for data science and machine learning, covering a wide range of topics and tools.

14.3K
Stable
Jupyter Notebook
Tutorials & Courses
#data-science#machine-learning#ai

modin-project/modin

Modin: Scalable Pandas workflows with a single line of code change, enabling distributed data processing.

10.4K
Stable
Python
Databases
Python
#pandas#data-analysis#data-processing

Netflix/metaflow

Build, manage and deploy AI/ML systems with Metaflow

9.9K
Active
Python
Next.js
#metaflow#ai#ml

firmai/industry-machine-learning

A curated collection of practical machine learning and data science notebooks and libraries across different industries.

7.4K
Archived
Jupyter Notebook
Machine Learning Ops
Databases
Jupyter Notebook
#data-science#machine-learning#jupyter-notebook

traceloop/openllmetry

Open-source observability tool for GenAI and LLM applications, based on OpenTelemetry

6.9K
Active
Python
LLM Frameworks
Monitoring
Python
#llm#observability#monitoring

holoviz/panel

A powerful data exploration and web app framework for Python, with rich visualization and interactive features.

5.6K
Active
Python
Charts & Visualization
ORMs & Query Builders
Bokeh
#data-visualization#data-exploration#interactive-dashboards

sreeharierk/datascience

A comprehensive collection of free resources for learning and practicing data science, including AI and ML tools.

5.1K
Experimental
Machine Learning Algorithms
Computer Vision
#data-science#machine-learning#deep-learning

Nyandwi/machine_learning_complete

Comprehensive machine learning repository with 30+ notebooks covering various ML concepts and techniques.

5.0K
Archived
Jupyter Notebook
Computer Vision
Data Science
Jupyter Notebook
#machine-learning#data-science#computer-vision

lk-geimfari/mimesis

Mimesis is a fast Python library for generating fake data in multiple languages for testing and development purposes.

4.8K
Active
Python
Databases
Testing
#data-generation#fake-data#testing

theOehrly/Fast-F1

A Python package for accessing and analyzing Formula 1 racing data, including results, schedules, timing, and telemetry.

4.5K
Active
Python
Databases
Data Science
#formula1#motorsport#data-analysis

whoiskatrin/sql-translator

A TypeScript-based tool for converting natural language queries into SQL using AI.

4.3K
Experimental
TypeScript
LLM Wrappers & SDKs
Databases
TypeScript
#data-analysis#data-engineering#dataquery

underlines/awesome-ml

Curated list of useful LLM, analytics, and data science resources for developers working with AI tools.

2.6K
Experimental
LLM Frameworks
Databases
#machine-learning#data-science#analytics

EntilZha/PyFunctional

A Python library for creating data processing pipelines using functional programming principles.

2.5K
Experimental
Python
ETL & Pipelines
CLI Tools
Python
#data-pipeline#functional-programming#python-library

hardikkamboj/An-Introduction-to-Statistical-Learning

This repository provides Python implementations of exercises from the book 'An Introduction to Statistical Learning'.

2.5K
Archived
Jupyter Notebook
Data Science
Machine Learning
Python
#data-science#machine-learning#statistical-learning

PizzaDeDados/datascience-pizza

A repository for collecting study materials and resources related to data analysis and related fields.

2.4K
Archived
Data Science
Tutorials & Courses
#data-science#data-analysis#learning-resources

IndrajeetPatil/ggstatsplot

ggstatsplot is an R library that enhances ggplot2 visualizations with statistical analysis and hypothesis testing.

2.2K
Active
R
Data Visualization
Testing
R
#ggplot2#statistics#data-analysis

chris1610/pbpython

A collection of Python code, notebooks, and examples for practical business data analysis and visualization.

2.0K
Archived
Jupyter Notebook
Data Analysis
Data Visualization
#data-analysis#data-visualization#pandas
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.