Explore Projects

Discover 10 open source projects

Active filters (1):
Search: avroร—
Clear all

Showing 1-10 of 10 projects

apache/avro

Apache Avro is a data serialization system for efficient storage and transmission of structured data.

3.2K
Active
Java
Databases
API Clients & Testing
#data-serialization#serialization-framework#big-data

confluentinc/schema-registry

Confluent Schema Registry for Kafka, a central repository for managing and storing Avro, JSON, and Protobuf schemas.

2.4K
Active
Java
API Frameworks
Databases
Java
#kafka#avro#json

getml/reflect-cpp

A C++20 library for fast serialization, deserialization and validation using reflection, supporting multiple data formats.

1.8K
Active
C++
API Frameworks
ORMs & Query Builders
#serialization#deserialization#validation

capitalone/DataProfiler

A Python library for extracting schema, statistics, and entities from datasets, useful for data profiling and privacy analysis.

1.5K
Stable
Python
ETL & Pipelines
CLI Tools
Python
#data-profiling#data-analysis#privacy

OBenner/data-engineering-interview-questions

This GitHub repository contains over 2,000 data engineering interview questions to help developers prepare.

1.5K
Active
Python
Interview Prep
ETL & Pipelines
#data-engineering#interview-questions#interview-prep

mtth/avsc

An Avro serialization library for JavaScript and TypeScript, used for efficient binary data encoding and schema evolution.

1.4K
Experimental
JavaScript
API Clients & Testing
Databases
JavaScript
#avro#serialization#binary-format

pmacct/pmacct

pmacct is a multi-purpose network monitoring tool for passive data collection and analysis

1.2K
Active
C
API Frameworks
Databases
#networking#monitoring#data-analysis

linkedin/goavro

Goavro is a Go library for encoding and decoding Avro data, a binary serialization format.

1.1K
Active
Go
API Frameworks
Databases
#avro#serialization#data-encoding

bigdatagenomics/adam

ADAM is a genomics analysis platform with specialized file formats built using Apache Spark and Apache Parquet.

1.0K
Experimental
Scala
ETL & Pipelines
API Frameworks
Spark
#bioinformatics#genomics#big-data

deviceinsight/kafkactl

Command line tool for managing Apache Kafka, a popular distributed streaming platform.

1.0K
Active
Go
API Frameworks
CLI Tools
Go
#apache-kafka#streaming#cli

Stay in the loop

Get weekly updates on trending AI coding tools and projects.