Explore Projects

Discover 290 open source projects

Active filters (1):
Search: apache×
Clear all

Showing 141-160 of 290 projects

GoogleCloudPlatform/PerfKitBenchmarker

PerfKit Benchmarker is an open-source tool for measuring and comparing cloud infrastructure performance.

2.0K
Active
Python
CLI Tools
Containerization
#benchmark#performance#cloud

databricks/spark-deep-learning

Deep learning library for Apache Spark that provides high-level APIs and models for building machine learning pipelines.

2.0K
Archived
Python
ML Ops
ETL & Pipelines
Apache Spark
#machine-learning#deep-learning#spark

elyra-ai/elyra

Elyra extends JupyterLab with an AI-centric approach for developing and deploying ML/AI pipelines.

2.0K
Active
Python
ML Ops
MCP Frameworks
JupyterLab
#ai#machine-learning#jupyterlab

apache/bookkeeper

Apache BookKeeper is a scalable, fault tolerant and low latency storage service optimized for append-only workloads.

2.0K
Active
Java
Databases
Realtime
#distributed-systems#big-data#wal

apache/datafusion-ballista

Apache DataFusion Ballista is a distributed query engine for big data analysis, built with Rust and Arrow.

2.0K
Active
Rust
Databases
ETL & Pipelines
#big-data#dataframe#distributed

tinkerpop/gremlin

A Graph Traversal Language for traversing and querying graph data structures.

2.0K
Archived
Java
React
#graph-traversal#querying#data-structures

apache/cassandra-spark-connector

A Scala connector that allows Apache Spark to interact with Apache Cassandra databases.

2.0K
Experimental
Scala
API Frameworks
Databases
Scala
#cassandra#spark#database

apache/trafficserver

A fast, scalable, and extensible HTTP/1.1 and HTTP/2 compliant caching proxy server.

1.9K
Active
C++
API Frameworks
Caching
#http#proxy#caching

rholder/retrying

General-purpose retrying library that simplifies adding retry behavior to Python code

1.9K
Archived
Python
General Utilities
#retrying#backoff#exponential-backoff

feathr-ai/feathr

Feathr is a scalable, unified data and AI engineering platform for enterprises, with features like feature engineering, feature governance, and a feature marketplace.

1.9K
Archived
Scala
Feature Flags
MLOps
Apache Spark
#data-engineering#feature-engineering#feature-governance

Wizcorp/phonegap-facebook-plugin

The official plugin for Facebook integration in Apache Cordova/PhoneGap for mobile app development.

1.9K
Archived
Java
Cross-Platform
API Clients & Testing
Apache Cordova
#facebook-integration#cordova-plugin#phonegap-plugin

oupala/apaxy

A customizable Apache directory listing theme with Docker support

1.9K
Archived
Shell
Icons & Assets
Docker
React
#customizable-theme#docker#apache

apache/servicecomb-pack

Apache ServiceComb Pack is a distributed transaction coordination solution for microservices applications.

1.9K
Archived
Java
API Frameworks
Databases
#microservices#distributed-transactions#tcc

apache/kudu

Apache Kudu is a high-performance, open-source columnar storage engine for large datasets in the Apache Hadoop ecosystem.

1.9K
Active
C++
Databases
API Frameworks
#big-data#cplusplus#open-source

uber/petastorm

Petastorm enables training and evaluation of deep learning models from Apache Parquet datasets.

1.9K
Active
Python
ML Ops
Databases
PyTorch
#deep-learning#machine-learning#data-processing

zhp8341/flink-streaming-platform-web

A real-time streaming platform built on Apache Flink for building scalable and reliable data pipelines.

1.9K
Stable
Java
API Frameworks
Streaming
Java
#flink#sql#streaming

nubskr/walrus

A distributed log streaming engine built from first principles using Rust, designed for high-performance data processing.

1.9K
Active
Rust
Realtime
Databases
#streaming#kafka#nats

apache/polaris

Apache Polaris is an open-source catalog for Apache Iceberg, a high-performance table format for data lakes.

1.9K
Active
Java
API Frameworks
Databases
Apache
#apache#iceberg#data-catalog

awesome-spark/awesome-spark

A curated list of awesome Apache Spark packages and resources for developers.

1.9K
Archived
Shell

dromara/MaxKey

MaxKey is an open-source, leading-edge IAM-IDaaS (Identity and Access Management) product that supports various SSO protocols.

1.8K
Active
Java
Authentication
Analytics & Tracking
Spring
#authentication#sso#iam
1...79...15

Stay in the loop

Get weekly updates on trending AI coding tools and projects.