Explore Projects

Discover 3,485 open source projects

Active filters (1):
Search: dataร—
Clear all

Showing 3261-3280 of 3,485 projects

FrenchYeti/dexcalibur

An all-in-one Android reverse engineering platform focused on dynamic instrumentation automation.

1.1K
Archived
JavaScript
CLI Tools
Android
Node
#android#reverse-engineering#instrumentation-automation

sierra-research/tau-bench

Tau-Bench is a Python library for benchmarking and evaluating AI language models and tools.

1.1K
Stable
Python
LLM Frameworks
CLI Tools
#benchmarking#evaluation#language-models

yaml/libyaml

LibYAML is a C library for parsing and emitting YAML, a popular data serialization format.

1.1K
Archived
C
API Clients & Testing
CLI Tools
#yaml#serialization#parsing

lhotse-speech/lhotse

Lhotse is a set of tools for handling multimodal data in machine learning projects, with a focus on speech and audio.

1.1K
Active
Python
Speech & Voice
Data Pipelines
PyTorch
#speech-recognition#audio-processing#data-handling

mukunku/ParquetViewer

A simple Windows desktop app for viewing and querying Apache Parquet files, a popular big data format.

1.1K
Active
C#
Databases
CLI Tools
#apache-parquet#big-data#windows-desktop

tdpetrou/Learn-Pandas

This GitHub repository provides tutorials on effectively using the Pandas library for data analysis.

1.1K
Archived
Jupyter Notebook
Databases
Tutorials & Courses
Jupyter Notebook
#pandas#data-analysis#data-science

scratchdata/scratchdata

A Swiss army knife for big data, enabling seamless integration with popular data warehousing solutions.

1.1K
Archived
Go
Databases
CLI Tools
#bigquery#clickhouse#data-warehouse

DamianOsipiuk/vue-query

Hooks for fetching, caching, and updating asynchronous data in Vue applications.

1.1K
Archived
TypeScript
Component Libraries (Vue/Svelte)
GraphQL
Vue
#async#cache#fetch

jblindsay/whitebox-tools

An advanced geospatial data analysis platform for tasks like geomorphology, hydrology, and remote sensing.

1.1K
Experimental
Rust
Databases
CLI Tools
#geospatial#gis#geomorphology

mariusandra/insights

Open-source self-hosted business intelligence platform for data analytics and visualization.

1.1K
Stable
JavaScript
Charts & Visualization
API Frameworks
React
#business-intelligence#dashboard#data-analytics

liuyubobobo/Play-with-Algorithm-Interview

A collection of coding interview preparation materials, including algorithms, data structures, and practice problems.

1.1K
Archived
C++
Coding Challenges
Interview Prep
#algorithms#data-structures#interview-prep

apache/amoro

Apache Amoro is an open-source Lakehouse management system built on big data formats like Flink, Hudi, and Iceberg.

1.1K
Active
Java
Databases
ETL & Pipelines
Flink
#big-data#data-lake#lakehouse

cosmosgl/graph

GPU-accelerated force graph layout and rendering library for visualizing network data.

1.1K
Active
TypeScript
Charts & Visualization
CLI Tools
React
#data-visualization#graph-algorithms#force-layout

Oxen-AI/Oxen

A fast data versioning system for ML datasets, making it easy to version and track changes like code.

1.1K
Active
Rust
Data & Databases
Version Control
Rust
#data-versioning#machine-learning#version-control

tatuylonen/wiktextract

A Python library for parsing and extracting multilingual data from Wiktionary dump files.

1.1K
Active
Python
CLI Tools
Databases
#wiktionary#multilingual#parser

chen310/NeteaseCloudMusicTasks

A Python library for interacting with the Netease Cloud Music API, providing access to music data and user-related functionalities.

1.1K
Archived
Python
API Clients & Testing
Backend Frameworks
Python
#netease-cloud-music#music-api#python-library

Teradata/kylo

Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.

1.1K
Archived
Java
ETL & Pipelines
Realtime
#data-lake#hadoop#spark

grapheco/InteractiveGraph

An interactive visualization and analysis framework for large graph data, with built-in applications for navigation, exploration, and relationship discovery.

1.1K
Experimental
JavaScript
Charts & Visualization
Frontend Frameworks
JavaScript
#graph-visualization#data-exploration#neo4j-integration

hazelcast/hazelcast-jet

Hazelcast Jet is a distributed stream and batch processing engine for high-performance applications.

1.1K
Archived
Java
API Frameworks
Databases
#distributed-processing#batch-processing#stream-processing

qri-io/qri

An open-source platform for building and sharing datasets, focused on trust, privacy, and decentralization.

1.1K
Archived
Go
Databases
CLI Tools
#dataset#ipfs#p2p
1...163165...175

Stay in the loop

Get weekly updates on trending AI coding tools and projects.