Explore Projects

Discover 3,547 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 41-60 of 3,547 projects

minimaxir/big-list-of-naughty-strings

Collection of strings that can cause issues in user-input data for QA testing

47.6K
Archived
Python
Testing
#naughty-strings#qa-testing#input-validation

run-llama/llama_index

LLamaIndex framework for building LLM-powered agents

47.4K
Active
Python
Agents & Orchestration
Desktop Model Runners
LLM Frameworks
#llamaindex#agents#fine-tuning

jakevdp/PythonDataScienceHandbook

Python Data Science Handbook in Jupyter Notebooks

47.0K
Archived
Jupyter Notebook
Books & Guides
ETL & Pipelines
Jupyter Notebook
#jupyter-notebook#matplotlib#numpy

pixijs/pixijs

PixiJS is a fast 2D HTML5 rendering engine for creating interactive graphics and games with WebGL/WebGPU.

46.7K
Active
TypeScript
Frontend Frameworks
Animation & Motion
TypeScript
#webgl#webgpu#2d-rendering

GokuMohandas/Made-With-ML

Learn to build production-grade ML applications with code and best practices

46.6K
Archived
Jupyter Notebook
ML Ops
Tutorials & Courses
Jupyter Notebook
#machine-learning#mlops#data-science

metabase/metabase

Open-source BI tool for data analysis and visualization

46.3K
Active
Clojure
Search
Analytics & Tracking
Clojure
#analytics#business-intelligence#data-visualization

ClickHouse/ClickHouse

Real-time analytics database for generating data reports

46.2K
Active
C++
Databases
#analytics#big-data#clickhouse

GitHubDaily/GitHubDaily

Curated list of GitHub projects with tutorials, tools, and resources for developers.

45.5K
Stable
Awesome Lists
#github#awesome#resources

bevyengine/bevy

Bevy is a data-driven game engine in Rust for building 2D/3D games.

44.9K
Active
Rust
Full-Stack Frameworks
CLI Tools
Rust
#bevy#game-engine#rust

apache/airflow

Apache Airflow for workflow orchestration

44.5K
Active
Python
ETL & Pipelines
Background Jobs
Python
#airflow#data-pipelines#workflow-orchestration

NaiboWang/EasySpider

Visual code-free web crawler/spider with GUI for data collection and automation

44.0K
Active
JavaScript
Testing
No-Code AI Platforms
#web-crawler#data-collection#gui

streamlit/streamlit

Streamlit is a Python library for building and sharing interactive data apps quickly.

43.7K
Active
Python
CLI Tools
ETL & Pipelines
Python
#data-apps#interactive-visualization#python

AykutSarac/jsoncrack.com

Visualize JSON data into interactive graphs with features like format conversion and code generation.

43.4K
Active
TypeScript
Charts & Visualization
CLI Tools
Next.js
#json-visualization#data-format-conversion#code-generation

apache/spark

Unified analytics engine for large-scale data processing

42.9K
Active
Scala
ETL & Pipelines
Realtime
Apache
#big-data#spark#data-processing

apachecn/ailearning

AI learning repository with tutorials and implementations for machine learning, deep learning, and data analysis

42.1K
Archived
Python
ML Ops
Python
#machine-learning#deep-learning#data-analysis

gradio-app/gradio

Gradio App for building and sharing delightful machine learning apps

41.9K
Active
Python
AI Code Editors
Gradio
#gradio#machine learning#python app

deepspeedai/DeepSpeed

DeepSpeed optimizes deep learning training and inference with distributed computing techniques.

41.7K
Active
Python
ML Ops
Inference
PyTorch
#deep-learning#distributed-training#inference-optimization

ray-project/ray

Ray is a unified framework for scaling AI and Python applications with distributed computing and ML libraries.

41.6K
Active
Python
ML Ops
Containerization
Python
#distributed-computing#ml-ops#ai-framework

hpcaitech/ColossalAI

Colossal-AI optimizes large AI model training and inference with distributed computing and GPU acceleration.

41.4K
Active
Python
ML Ops
Inference
PyTorch
#ai-optimization#distributed-training#gpu-acceleration

ccxt/ccxt

Cryptocurrency trading API library for connecting to 100+ exchanges

41.2K
Active
Python
Crypto Tools
#crypto#trading-api#exchanges
124...178

Stay in the loop

Get weekly updates on trending AI coding tools and projects.