Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 481-500 of 3,485 projects

apache/iceberg

Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.

8.6K
Active
Java
Databases
API Frameworks
Apache
#data-lake#versioning#scalable

blacksmithgu/obsidian-dataview

A powerful data index and query language for Obsidian.md, a note-taking and knowledge management app.

8.6K
Stable
TypeScript
IDE Extensions
Databases
TypeScript
#obsidian-plugin#query-language#data-index

jamiebuilds/itsy-bitsy-data-structures

A library of common data structures in JavaScript for developers to learn and explore.

8.6K
Archived
JavaScript
Learning & Education
Backend & APIs
JavaScript
#data-structures#algorithms#javascript

fengdu78/Data-Science-Notes

A collection of notes and resources for data science, but not specifically focused on vibe coders or AI tools.

8.5K
Archived
Jupyter Notebook
Tutorials & Courses
Books & Guides
#data-science#machine-learning#notebooks

google/osv-scanner

An open-source vulnerability scanner written in Go that uses data from osv.dev to identify security issues.

8.5K
Active
Go
Security Research
CLI Tools
#security#vulnerability-scanner#open-source

apache/beam

Apache Beam is a unified programming model for batch and streaming data processing.

8.5K
Active
Java
ETL & Pipelines
API Frameworks
#batch#streaming#big-data

jackvale/rectg

A curated collection of 10,000+ Telegram channels, groups, and bots to help developers discover resources.

8.5K
Stable
JavaScript
Awesome Lists & Curations
API Clients & Testing
Node.js
#open-data#telegram#telegram-bot

yogeshojha/rengine

An automated reconnaissance framework for web applications focused on highly configurable streamlined recon process.

8.5K
Stable
HTML
Penetration Testing
CLI Tools
#information-gathering#reconnaissance#web-scanning

vaexio/vaex

A high-performance Python library for working with large tabular datasets, offering efficient data manipulation and visualization.

8.5K
Stable
Python
Databases
Caching
Python
#bigdata#data-science#dataframe

open-circle/valibot

A modular and type-safe schema library for validating structural data, focused on developer productivity.

8.5K
Active
TypeScript
API Clients & Testing
CLI Tools
TypeScript
#type-safe#modular#schema

apache/datafusion

Apache DataFusion is a powerful SQL query engine written in Rust, designed for big data processing and analysis.

8.5K
Active
Rust
Databases
ETL & Pipelines
#big-data#dataframe#olap

GoogleCloudPlatform/training-data-analyst

A collection of labs and demos for Google Cloud Platform (GCP) training courses.

8.5K
Active
Jupyter Notebook
Learning & Education
API Frameworks
#google-cloud-platform#training-resources#api-development

appbaseio/dejavu

A web UI for Elasticsearch and OpenSearch that allows importing, browsing, and editing data with rich filters and query views.

8.5K
Active
JavaScript
Realtime
Databases
React
#elasticsearch#opensearch#database-gui

kelseyhightower/confd

Manage local application configuration files using templates and data from etcd or consul.

8.4K
Archived
Go
API Frameworks
CLI Tools
#configuration-management#templates#etcd

jupyter/docker-stacks

Docker images containing Jupyter applications for data science and machine learning workflows.

8.4K
Active
Python
Databases
CLI Tools
Python
#jupyter#ipython#notebook

visgl/react-map-gl

A React-friendly wrapper around the Mapbox GL JS library for building interactive maps

8.4K
Active
TypeScript
Component Libraries (React)
Databases
React
#data-visualization#map#mapbox

ericchiang/pup

A command-line tool for parsing HTML, useful for web scraping and data extraction tasks.

8.4K
Archived
HTML
Backend Frameworks
CLI Tools
Node.js
#web-scraping#data-extraction#html-parsing

NorthwoodsSoftware/GoJS

JavaScript diagramming library for interactive flowcharts, org charts, design tools, planning tools, visual languages.

8.4K
Active
HTML
Charts & Visualization
Frontend Frameworks
JavaScript
#visualization#diagrams#charts

thoughtbot/factory_bot

A Ruby library for setting up test data using factories, making it easier to write and maintain tests.

8.4K
Active
Ruby
Testing
API Frameworks
Rails
#testing#factories#rails

crossbeam-rs/crossbeam

Crossbeam is a Rust library that provides tools for concurrent programming, including data structures and synchronization primitives.

8.4K
Active
Rust
API Frameworks
CLI Tools
#concurrency#data-structures#lock-free
1...2426...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.