Explore Projects

Discover 380 open source projects

Active filters (1):
Search: data-scienceร—
Clear all

Showing 101-120 of 380 projects

afshinea/stanford-cs-230-deep-learning

A collection of cheatsheets for the Stanford CS 230 Deep Learning course, covering key concepts and techniques.

6.9K
Archived
ML Ops
Cheatsheets
#deep-learning#convolutional-neural-networks#recurrent-neural-networks

mahmoud/boltons

A collection of Python utility functions and data structures that extend the standard library.

6.9K
Active
Python
General Utilities
CLI Tools
#python#standard-library#utilities

polakowo/vectorbt

A high-performance Python library for backtesting, algorithmic trading, and quantitative research.

6.8K
Active
Python
Quantitative Analysis
API Frameworks
Python
#algorithmic-trading#backtesting#quantitative-finance

flyteorg/flyte

A flexible workflow orchestration platform that seamlessly integrates data, ML, and analytics stacks.

6.8K
Active
Go
ML Ops
API Frameworks
Go
#workflow-orchestration#data-integration#machine-learning

feast-dev/feast

An open-source feature store for AI/ML applications

6.8K
Active
Python
React
#feature-store#open-source#AI/ML

rhiever/Data-Analysis-and-Machine-Learning-Projects

A collection of data analysis and machine learning projects and resources for developers.

6.6K
Archived
Jupyter Notebook
Data Science
Learning & Education
Jupyter Notebook
#data-analysis#machine-learning#jupyter-notebook

qinwf/awesome-R

A curated list of awesome R packages, frameworks and software for data analysis and data science.

6.4K
Stable
R
Databases
ORMs & Query Builders
#r#rstats#data-analysis

dair-ai/ML-Course-Notes

A repository of machine learning course and lecture notes for developers interested in AI and data science.

6.4K
Archived
LLM Frameworks
Tutorials & Courses
#machine-learning#data-science#deep-learning

rushter/data-science-blogs

A curated list of data science blogs

6.4K
Archived
Python
React
#data-science#machine-learning#curated-lists

haifengl/smile

Smile is a comprehensive statistical machine learning and data science library for Java developers.

6.3K
Active
Java
ML Ops
Databases
#machine-learning#data-science#statistics

pachyderm/pachyderm

Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.

6.3K
Experimental
Go
ETL & Pipelines
Containerization
Go
#data-pipelines#data-versioning#distributed-systems

lance-format/lance

An open-source data format for building high-performance multimodal AI applications with fast random access, vector indexing, and data versioning.

6.1K
Active
Rust
LLM Frameworks
Databases
Rust
#data-format#data-versioning#vector-index

aimhubio/aim

Aim is an open-source experiment tracker that makes it easy to track and visualize machine learning experiments.

6.0K
Active
Python
ML Ops
CLI Tools
PyTorch
#experiment-tracking#metadata-tracking#visualization

datajuicer/data-juicer

A Python library for processing and analyzing data with foundation models and large language models.

6.0K
Active
Python
LLM Frameworks
ETL & Pipelines
Python
#data-processing#data-analysis#foundation-models

evidence-dev/evidence

A business intelligence platform that allows developers to build interactive data visualizations in SQL and Markdown.

6.0K
Stable
JavaScript
Charts & Visualization
Databases
Svelte
#analytics#business-intelligence#dashboard

snorkel-team/snorkel

A powerful system for quickly generating high-quality training data with weak supervision for AI/ML projects.

5.9K
Archived
Python
LLM Frameworks
Data Pipelines
Python
#data-augmentation#weak-supervision#machine-learning

PriorLabs/TabPFN

A foundation model for tabular data that enables advanced machine learning on structured datasets.

5.8K
Active
Python
LLM Frameworks
ML Ops
Python
#tabular-data#foundation-model#machine-learning

online-ml/river

A Python library for online machine learning, enabling incremental and real-time learning on data streams.

5.7K
Active
Python
ML Ops
Streaming
Python
#concept-drift#incremental-learning#online-learning

ujjwalkarn/DataSciencePython

A Python library for common data analysis and machine learning tasks

5.7K
Archived
Python
Databases
ML Ops
Python
#data-science#machine-learning#python-tutorial

biolab/orange3

Interactive data analysis tool with machine learning, visualization, and decision tree capabilities

5.6K
Active
Python
ML Ops
Data Visualization
Python
#data-visualization#machine-learning#data-mining
1...57...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.