Explore Projects

Discover 703 open source projects

Active filters (1):
Search: sciences×
Clear all

Showing 101-120 of 703 projects

microsoft/RD-Agent

An R&D agent that automates high-value generic R&D processes to let AI drive data-driven AI.

11.5K
Active
Python
Agents & Orchestration
Python
#ai#automation#data-mining

0voice/expert_readed_books

A curated list of recommended books for engineers covering computer science, software, startups, and more.

11.5K
Experimental
Books & Guides
#books#programming#computer-science

cleanlab/cleanlab

An open-source library for data-centric AI with tools for data quality and machine learning on messy, real-world data.

11.4K
Active
Python
Data Quality
Python
#data-centric-ai#data-quality#data-cleaning

statsmodels/statsmodels

Statsmodels is a Python library for statistical modeling and econometrics, providing tools for data analysis and prediction.

11.3K
Active
Python
Data Science
Python
#data-analysis#statistics#econometrics

great-expectations/great_expectations

A Python library that helps ensure data quality and reliability through data profiling and testing.

11.2K
Active
Python
ETL & Pipelines
#data-quality#data-testing#data-profiling

aws/amazon-sagemaker-examples

A collection of Jupyter notebooks showcasing how to build and deploy machine learning models with Amazon SageMaker.

10.9K
Active
Jupyter Notebook
ML Ops
Jupyter Notebook
#machine-learning#deep-learning#data-science

wandb/wandb

The AI developer platform to train and fine-tune models, and manage models from experimentation to production.

10.9K
Active
Python
ML Ops
PyTorch
#ai#machine-learning#model-versioning

1c7/Crash-Course-Computer-Science-Chinese

A crash course in computer science with AI tools for vibe coders

10.8K
Archived
JavaScript
React
#computer-science#crash-course#cs

kedro-org/kedro

Kedro is a Python toolkit for building production-ready data science and machine learning pipelines.

10.8K
Active
Python
ETL & Pipelines
Python
#machine-learning#data-engineering#pipeline

fastai/numerical-linear-algebra

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

10.7K
Archived
Jupyter Notebook
Linear-Algebra
Python
#linear-algebra#data-science#machine-learning

Yorko/mlcourse.ai

An open-source machine learning course focused on practical algorithms and data analysis in Python.

10.5K
Active
Python
Algorithms
Python
#machine-learning#data-science#algorithms

voxel51/fiftyone

Refine high-quality datasets and visual AI models with this Python library for active learning and data curation.

10.4K
Active
Python
Computer Vision
Python
#active-learning#data-curation#data-quality

lexfridman/mit-deep-learning

This repository contains tutorials, assignments, and competitions for MIT's deep learning courses, covering a wide range of AI and machine learning topics.

10.4K
Archived
Jupyter Notebook
Deep Learning
#deep-learning#machine-learning#ai

modin-project/modin

Modin: Scalable Pandas workflows with a single line of code change, enabling distributed data processing.

10.4K
Stable
Python
Databases
Python
#pandas#data-analysis#data-processing

SSHeRun/CS-Xmind-Note

This is a collection of mind maps and notes for fundamental computer science courses, not a developer discovery platform for AI tools.

10.3K
Archived
Tutorials & Courses
#computer-science#education#study-materials

wolverinn/Waking-Up

A comprehensive resource for computer science interview preparation, covering topics like networking, OS, databases, and Git.

10.2K
Archived
Interview Prep
Python
#interview-questions#computer-science#networking

chiphuyen/machine-learning-systems-design

A booklet on machine learning systems design with exercises for developers interested in MLOps and production ML.

10.1K
Archived
HTML
MLOps
#machine-learning#systems-design#mlops

autogluon/autogluon

Fast and accurate machine learning framework that can build models in just 3 lines of Python code.

10.1K
Active
Python
ML Ops
PyTorch
#automated-machine-learning#computer-vision#tabular-data

EpistasisLab/tpot

A Python Automated Machine Learning tool that optimizes ML pipelines using genetic programming.

10.0K
Stable
Jupyter Notebook
AutoML
scikit-learn
#automated-machine-learning#hyperparameter-optimization#model-selection

HugoBlox/kit

An open-source Copilot for data scientists, enabling building high-performance portfolios, lab sites, and docs in Markdown and Jupyter.

9.9K
Active
HTML
AI Code Editors
Hugo
#data-science#open-source#static-site-generator
1...57...36

Stay in the loop

Get weekly updates on trending AI coding tools and projects.