Explore Projects

Discover 380 open source projects

Active filters (1):
Search: data-scienceร—
Clear all

Showing 341-360 of 380 projects

DeepWisdom/AutoDL

Automated Deep Learning without any human intervention, the first solution for the AutoDL challenge@NeurIPS.

1.2K
Archived
Python
AutoML
ML Ops
PyTorch
#automated-machine-learning#automl#data-science

JuliaStats/Distributions.jl

A comprehensive Julia library for probability distributions and related statistical functions.

1.2K
Active
Julia
Databases
CLI Tools
#probability-distributions#statistics#data-science

predict-idlab/plotly-resampler

A Python library that helps visualize large time series data using the Plotly data visualization library.

1.2K
Stable
Python
Charts & Visualization
ETL & Pipelines
Python
#data-visualization#time-series#plotly

run-house/kubetorch

Distribute and run AI workloads on Kubernetes with a Python-based infrastructure toolkit like PyTorch.

1.2K
Active
Python
ML Ops
Containerization
PyTorch
#kubernetes#distributed-computing#data-science

ChawlaAvi/Daily-Dose-of-Data-Science

A collection of code snippets and tutorials for data science and data analysis in Python.

1.2K
Experimental
Jupyter Notebook
Databases
ETL & Pipelines
Jupyter
#data-analysis#data-science#jupyter-notebook

zinggAI/zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

1.2K
Active
Java
ETL & Pipelines
ML Ops
#identity-resolution#entity-resolution#data-deduplication

cleanlab/cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

1.2K
Active
Python
Computer Vision
Data Exploration
Python
#computer-vision#data-quality#data-profiling

WecoAI/aideml

AIDE is an AI-driven machine learning engineering agent that automates AI R&D tasks for developers.

1.1K
Stable
Python
Agents & Orchestration
LLM Frameworks
Python
#ai-r-d#machine-learning#automation

daochenzha/data-centric-AI

A curated list of resources for data-centric AI development, including tools, frameworks, and best practices.

1.1K
Archived
LLM Frameworks
Databases
#data-centric-ai#machine-learning#data-science

Shujian2015/FreeML

A curated list of free data science and machine learning resources for developers.

1.1K
Archived
Machine Learning
Tutorials & Courses
#data-science#machine-learning#natural-language-processing

compdemocracy/polis

Open-source AI-powered platform for large-scale participatory democracy and civic feedback

1.1K
Active
JavaScript
Agents & Orchestration
API Frameworks
JavaScript
#civic-tech#participatory-democracy#deliberative-democracy

areed1192/sigma_coding_youtube

A collection of code for tutorials covering a wide range of data science, API, and productivity tools.

1.1K
Archived
Jupyter Notebook
Tutorials & Courses
API Clients & Testing
Python
#data-science#api-development#productivity-tools

Oxen-AI/Oxen

A fast data versioning system for ML datasets, making it easy to version and track changes like code.

1.1K
Active
Rust
Data & Databases
Version Control
Rust
#data-versioning#machine-learning#version-control

qri-io/qri

An open-source platform for building and sharing datasets, focused on trust, privacy, and decentralization.

1.1K
Archived
Go
Databases
CLI Tools
#dataset#ipfs#p2p

red-data-tools/pycall.rb

A library for calling Python functions from the Ruby language, enabling data science and ML workflows.

1.1K
Stable
C
ORMs & Query Builders
CLI Tools
Ruby
#data-science#pycall#python-integration

novak-99/MLPP

A C++ library for building machine learning applications, revitalizing C++ as a ML front-end.

1.1K
Archived
C++
LLM Frameworks
API Frameworks
#cpp#data-science#deep-learning

alishobeiri/thread-notebook

AI-powered Jupyter Notebook that can generate, edit, and debug code cells, and chat with your data

1.1K
Stable
JavaScript
AI Code Generation
LLM Frameworks
React
#ai-code-generation#jupyter-notebooks#llm

shaypal5/awesome-twitter-data

A curated list of Twitter datasets and resources for data scientists and social network analysts.

1.1K
Archived
Datasets
Social Network Analysis
#twitter#social-media#data-science

caserec/Datasets-for-Recommender-Systems

A high-quality dataset repository for building recommender systems, useful for vibe coders working on AI-powered applications.

1.1K
Archived
Jupyter Notebook
Datasets
#data-science#recommender-systems#public-data

dataquestio/project-walkthroughs

A collection of data science, machine learning, and web development project code for Dataquest's YouTube channel.

1.1K
Archived
Jupyter Notebook
Databases
Data Science
Jupyter Notebook
#data-science#machine-learning#pandas
1...1719

Stay in the loop

Get weekly updates on trending AI coding tools and projects.