Explore Projects

Discover 54 open source projects

Active filters (1):
Search: scientistร—
Clear all

Showing 21-40 of 54 projects

spotify/chartify

A Python library that makes it easy for data scientists to create charts and visualizations.

3.6K
Archived
Python
Charts & Visualization
Data Visualization
#data-visualization#python#charts

fastai/fastpages

An easy-to-use blogging platform with enhanced support for Jupyter Notebooks, ideal for data scientists and AI developers.

3.5K
Archived
Jupyter Notebook
Static Site Generators
Databases
Jekyll
#jupyter-notebooks#data-science#blogging

databricks/koalas

Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

3.4K
Archived
Python
ORMs & Query Builders
Databases
Spark
#big-data#data-science#dataframe

gedeck/practical-statistics-for-data-scientists

This is a code repository for a book on practical statistics for data scientists, not a developer discovery platform.

3.2K
Stable
Jupyter Notebook
Data Analysis & Visualization
#statistics#data-science#jupyter-notebook

bfortuner/ml-glossary

A comprehensive machine learning glossary and cheatsheet for data scientists and AI developers.

3.1K
Archived
Python
Cheatsheets
Tutorials & Courses
Python
#machine-learning#deep-learning#data-science

weijie-chen/Linear-Algebra-With-Python

Linear Algebra with Python lecture notes for data scientists and quantitative analysts

2.6K
Archived
Jupyter Notebook
React
#linear-algebra#python#data-science

oegedijk/explainerdashboard

Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.

2.5K
Active
Python
Explainer
Charts & Visualization
Dash
#explainable-ai#interactive-dashboards#model-interpretability

mckinsey/causalnex

A Python library to help data scientists infer causation from data rather than just observing correlation.

2.4K
Archived
Python
Machine Learning Ops
Caching
Python
#causal-inference#bayesian-networks#data-science

PizzaDeDados/datascience-pizza

A repository for collecting study materials and resources related to data analysis and related fields.

2.4K
Archived
Data Science
Tutorials & Courses
#data-science#data-analysis#learning-resources

apache/hamilton

Hamilton is an open-source ETL framework that helps data scientists and engineers build modular, testable dataflows with lineage and metadata.

2.4K
Active
Jupyter Notebook
ETL & Pipelines
MLOps
Python
#etl#data-engineering#data-science

DeepInsight-AI/DeepBI

An AI-driven data application platform that redefines business intelligence with LLMs and AI-native tools.

2.3K
Experimental
Python
LLM Frameworks
Databases
Python
#ai#data-analysis#business-intelligence

cdeweyx/DS-Career-Resources

Compilation of resources for aspiring data scientists, including career and internship information.

2.1K
Archived
Python
Tutorials & Courses
Coding Challenges
Python
#data-science#career#internships

zarr-developers/zarr-python

An efficient and compressed N-dimensional array library for Python, useful for data scientists and ML engineers.

1.9K
Active
Python
Databases
CLI Tools
Python
#compressed#ndimensional-arrays#data-science

krishnaik06/6-Months-Data-Science-Roadmap-

A 6-month data science roadmap for developers who want to become data scientists.

1.9K
Archived
Tutorials & Courses
Cheatsheets
#data-science#roadmap#tutorials

allisonhorst/stats-illustrations

A collection of illustrated R and statistics resources for developers and data scientists

1.8K
Archived
Tutorials & Courses
Databases
#r#statistics#data-visualization

quant-science/sunday-quant-scientist

A newsletter focused on quantitative and algorithmic trading, portfolio analysis, and investing.

1.7K
Stable
HTML
Tutorials & Courses
Backend Frameworks
#finance#trading#quantitative-analysis

scientistproject/Scientist.net

A .NET library for carefully refactoring critical paths, ported from GitHub's Ruby Scientist library.

1.5K
Stable
C#
Testing
API Frameworks
#refactoring#testing#critical-paths

CodeCutTech/Efficient_Python_tricks_and_tools_for_data_scientists

A collection of efficient Python tricks and tools for data scientists to improve their productivity.

1.5K
Experimental
Jupyter Notebook
Data Science
Jupyter Notebook
#data-science#python#jupyter

code-kern-ai/refinery

An open-source tool for scaling, assessing, and maintaining natural language data for AI/ML models.

1.5K
Archived
Python
Data Labeling
Search-as-a-Service
Python
#active-learning#data-labeling#natural-language-processing

business-science/awesome-generative-ai-data-scientist

A curated list of 100+ resources for building and deploying generative AI, focused on helping you become a Generative AI Data Scientist.

1.4K
Experimental
LLM Frameworks
LLM Wrappers & SDKs
React
#generative-ai#data-science#machine-learning

Stay in the loop

Get weekly updates on trending AI coding tools and projects.