Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 161-180 of 3,485 projects

TheAlgorithms/C

Educational C algorithm implementations

21.8K
Archived
C
Coding Challenges
Documentation
#c#algorithms#data-structures

recommenders-team/recommenders

Recommenders is a project for prototyping and operationalizing recommendation systems with Jupyter notebooks and best practices.

21.5K
Active
Python
ML Ops
ETL & Pipelines
Python
#recommendation-systems#machine-learning#data-science

vectordotdev/vector

High-performance observability data pipeline for logs and metrics

21.4K
Active
Rust
Monitoring
ETL & Pipelines
#observability#data-pipeline#logs

yjs/yjs

CRDT framework for real-time collaborative editing

21.3K
Active
JavaScript
Collaboration & Real-time
Frontend Frameworks
JavaScript
#collaboration#realtime#crdt

matomo-org/matomo

Open-source analytics platform for website tracking with built-in privacy

21.3K
Active
PHP
Analytics & Tracking
PHP
#analytics#php#privacy

thingsboard/thingsboard

Open-source IoT platform for device management, data collection, and visualization

21.3K
Active
Java
Home Automation
Realtime
Java
#iot-platform#device-management#data-visualization

huggingface/datasets

AI-powered dataset management and preprocessing library for ML projects

21.3K
Active
Python
ML Ops
ETL & Pipelines
HuggingFace
#datasets#ml-ops#data-preprocessing

hapijs/joi

Powerful data validation for JavaScript

21.2K
Stable
JavaScript
Validation
JavaScript
#data-validation#javascript#schema

ossu/data-science

Free self-taught Data Science curriculum with MOOCs

21.0K
Experimental
Tutorials & Courses
#data-science#curriculum#mooc

elastic/kibana

Kibana is an open-source data visualization and management tool for Elasticsearch

21.0K
Active
TypeScript
Search
Charts & Visualization
#elasticsearch#data-visualization#observability

airbytehq/airbyte

Data integration platform for ELT pipelines from APIs, databases & files to databases, warehouses & lakes

20.8K
Active
Python
ETL & Pipelines
#data-integration#elt#etl

apache/shardingsphere

Distributed SQL database middleware for sharding, scalability, and security

20.7K
Active
Java
Databases
Java
#distributed-sql#database-sharding#data-encryption

airbnb/visx

Reusable visualization components for React and D3

20.6K
Stable
TypeScript
Charts & Visualization
React
#react#d3#visualization

modood/Administrative-divisions-of-China

Chinese administrative divisions data for provinces, cities, counties, towns, and villages.

20.6K
Stable
JavaScript
General Utilities
#china#administrative-divisions#address-data

dolthub/dolt

Dolt is Git for Data, enabling version control for SQL databases with Git-like commands and features.

20.5K
Active
Go
Databases
#data-version-control#database#git-for-data

toml-lang/toml

TOML is a minimal configuration file format designed for easy reading and parsing into data structures.

20.4K
Stable
Validation
#toml#configuration#serialization

bokeh/bokeh

Interactive data visualization library for Python and JavaScript

20.4K
Active
TypeScript
Charts & Visualization
Jupyter
#data-visualization#interactive-plots#python

allinurl/goaccess

Real-time web log analyzer for terminal and browser

20.3K
Active
C
Terminal UIs
Analytics & Tracking
#log-analysis#real-time#terminal

EthicalML/awesome-production-machine-learning

Curated list of open source libraries for deploying, monitoring, and scaling machine learning in production

20.2K
Active
ML Ops
#machine-learning#mlops#ai

facebook/prophet

Time series forecasting with Prophet for multiple seasonality and growth patterns.

20.1K
Active
Python
Inference
ETL & Pipelines
Python
#forecasting#time-series#data-science
1...810...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.