Explore Projects

Discover 290 open source projects

Active filters (1):
Search: apacheร—
Clear all

Showing 241-260 of 290 projects

astronomer/astronomer-cosmos

Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code.

1.2K
Active
Python
API Frameworks
ETL & Pipelines
Python
#airflow#dbt#workflow

TomRoush/PdfBox-Android

A Java library for working with PDF files on Android devices.

1.2K
Archived
Java
API Frameworks
Android
Android
#android#pdf#document-manipulation

lensacom/sparkit-learn

A Python library that integrates Scikit-learn into the Apache Spark distributed computing framework.

1.2K
Archived
Python
ML Ops
ETL & Pipelines
#apache-spark#scikit-learn#distributed-computing

Thriftpy/thriftpy

Thriftpy is a lightweight, pure-Python implementation of the Apache Thrift RPC framework.

1.2K
Archived
Python
API Frameworks
Python
#rpc#serialization#thrift

apache/datafusion-comet

A Spark accelerator for Apache DataFusion, a SQL query engine written in Rust, aimed at vibe coders.

1.1K
Active
Scala
LLM Frameworks
Databases
Spark
#spark#rust#data-processing

weexteam/hackernews-App-powered-by-Apache-Weex

This is a Hackernews clone built with the Apache Weex framework, likely focused on front-end development.

1.1K
Archived
Java
Component Libraries (Vue/Svelte)
Vue
#hackernews#clone#vue

apache/accumulo

Apache Accumulo is a scalable and robust key-value store that provides a sparse, sorted, distributed, and persistent multi-dimensional table.

1.1K
Active
Java
Databases
API Frameworks
#big-data#distributed-computing#database

graphframes/graphframes

GraphFrames provides DataFrame-based Graphs for Apache Spark, enabling scalable graph analysis and algorithms.

1.1K
Active
Scala
Databases
Caching
#apache-spark#big-data#graph-analysis

apache/cordova-plugin-inappbrowser

Apache Cordova InAppBrowser Plugin allows developers to open URLs inside their app instead of the default browser.

1.1K
Active
Java
Cross-Platform
Component Libraries (React)
React
#android#cordova#ios

confluentinc/cp-all-in-one

Docker-compose files for running the Confluent Platform, an Apache Kafka-based event streaming platform.

1.1K
Stable
Python
API Frameworks
Databases
#apache-kafka#confluent#docker-compose

Parsely/pykafka

High-performance Apache Kafka client library for Python developers with low-level and high-level consumer/producer APIs.

1.1K
Archived
Python
API Frameworks
Databases
#apache-kafka#event-streaming#message-queue

apache/nano

Nano is a JavaScript library for CouchDB, a popular NoSQL database, providing a simple API for interacting with it.

1.1K
Archived
JavaScript
API Clients & Testing
Databases
Node
#couchdb#nosql#database

apache/apisix-ingress-controller

APISIX Ingress Controller for Kubernetes, a high-performance, cloud-native API gateway built on top of Apache APISIX.

1.1K
Active
Go
API Frameworks
Containerization
Go
#api-gateway#kubernetes#cloud-native

mukunku/ParquetViewer

A simple Windows desktop app for viewing and querying Apache Parquet files, a popular big data format.

1.1K
Active
C#
Databases
CLI Tools
#apache-parquet#big-data#windows-desktop

anzhihe/Free-Web-Books

This is a collection of free web development learning resources across various technologies and frameworks.

1.1K
Archived
JavaScript
Full-Stack Frameworks
Frontend Frameworks
Vue
#web-development#frontend#backend

apache/amoro

Apache Amoro is an open-source Lakehouse management system built on big data formats like Flink, Hudi, and Iceberg.

1.1K
Active
Java
Databases
ETL & Pipelines
Flink
#big-data#data-lake#lakehouse

Teradata/kylo

Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.

1.1K
Archived
Java
ETL & Pipelines
Realtime
#data-lake#hadoop#spark

etcd-io/zetcd

Zetcd is a Go library that provides the Apache Zookeeper API by backing it with an etcd cluster.

1.1K
Archived
Go
API Frameworks
Databases
#zookeeper#etcd#distributed-systems

apache/freemarker

Apache Freemarker is a Java-based template engine that provides a flexible way to generate dynamic content.

1.1K
Stable
Java
Backend Frameworks
#template-engine#java#open-source

mahmoudparsian/data-algorithms-book

This repository provides a comprehensive guide and implementations for data algorithms using MapReduce, Spark, Java, and Scala.

1.1K
Archived
Java
Databases
ETL & Pipelines
Apache Hadoop
#data-algorithms#mapreduce#spark
1...121415

Stay in the loop

Get weekly updates on trending AI coding tools and projects.