Showing 241-260 of 290 projects
Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code.
A Java library for working with PDF files on Android devices.
A Python library that integrates Scikit-learn into the Apache Spark distributed computing framework.
Thriftpy is a lightweight, pure-Python implementation of the Apache Thrift RPC framework.
A Spark accelerator for Apache DataFusion, a SQL query engine written in Rust, aimed at vibe coders.
This is a Hackernews clone built with the Apache Weex framework, likely focused on front-end development.
Apache Accumulo is a scalable and robust key-value store that provides a sparse, sorted, distributed, and persistent multi-dimensional table.
GraphFrames provides DataFrame-based Graphs for Apache Spark, enabling scalable graph analysis and algorithms.
Apache Cordova InAppBrowser Plugin allows developers to open URLs inside their app instead of the default browser.
Docker-compose files for running the Confluent Platform, an Apache Kafka-based event streaming platform.
High-performance Apache Kafka client library for Python developers with low-level and high-level consumer/producer APIs.
Nano is a JavaScript library for CouchDB, a popular NoSQL database, providing a simple API for interacting with it.
APISIX Ingress Controller for Kubernetes, a high-performance, cloud-native API gateway built on top of Apache APISIX.
A simple Windows desktop app for viewing and querying Apache Parquet files, a popular big data format.
This is a collection of free web development learning resources across various technologies and frameworks.
Apache Amoro is an open-source Lakehouse management system built on big data formats like Flink, Hudi, and Iceberg.
Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.
Zetcd is a Go library that provides the Apache Zookeeper API by backing it with an etcd cluster.
Apache Freemarker is a Java-based template engine that provides a flexible way to generate dynamic content.
This repository provides a comprehensive guide and implementations for data algorithms using MapReduce, Spark, Java, and Scala.
Get weekly updates on trending AI coding tools and projects.