Showing 221-240 of 310 projects
A Python scraper for public data from the EU and other parliament websites.
Democratizing internet-scale financial data for developers through natural language processing.
A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL databases seamlessly
A Python script that automatically downloads books from Z-Library and uploads them to Google NotebookLM.
A Python library that syncs data from Postgres to Elasticsearch/OpenSearch, enabling real-time data pipelines.
An end-to-end data engineering project example showcasing tools and technologies for building data pipelines.
This is a book that teaches how to use Apache Spark for lightning-fast data analytics.
Effortlessly scrape data from websites using machine learning and HTML examples with this Python library.
An open-source financial data extraction tool that allows easy API access to web scrape data from various websites.
Quilt is a data mesh for connecting people with actionable data, built with TypeScript.
This repository provides best practices and examples for building ETL (Extract, Transform, Load) pipelines using Apache Airflow.
A web scraping and data collection tool for the Chinese social media platform Xiaohongshu (Little Red Book).
A visual data preparation tool powered by Python, designed for data analysis and ETL tasks.
Hop is a flexible and extensible open-source data integration platform for building and orchestrating ETL and streaming pipelines.
PDAL is a C++ library for processing point cloud data, similar to GDAL for raster data.
An R-focused pipeline toolkit for reproducibility and high-performance computing.
A Go daemon that syncs MongoDB to Elasticsearch in real-time for search-powered applications.
A getting started guide to Singer, a data integration framework for ETL and data analysis.
A Python package for handling messy CSV files with improved dialect detection and a command-line interface.
Dex is a powerful data visualization tool that enables data exploration and publishing of web visualizations.
Get weekly updates on trending AI coding tools and projects.