Explore Projects

Discover 299 open source projects

Active filters (1):
Search: pipelineร—
Clear all

Showing 221-240 of 299 projects

fendevel/Guide-to-Modern-OpenGL-Functions

A guide to using modern OpenGL functions, including Direct State Access (DSA) and other advanced features.

1.4K
Archived
Frontend Frameworks
CLI Tools
#opengl#modern-opengl#direct-state-access

astronomer/dag-factory

Declaratively construct Apache Airflow DAGs with YAML configuration files, simplifying complex data pipeline management.

1.4K
Active
Python
API Frameworks
ETL & Pipelines
Python
#airflow#data-pipelines#etl

sakura-editor/sakura

SAKURA Editor is a Japanese text editor for Windows, with features like regex, macros, and grep.

1.4K
Active
C++
IDE Extensions
Frontend Frameworks
#text-editor#regex#macro

griffithlab/rnaseq_tutorial

An educational resource for learning RNA-seq analysis including cloud computing, data formats, and visualization.

1.4K
Archived
R
Tutorials & Courses
R
#bioinformatics#rna-seq#data-analysis

alelievr/HDRP-Custom-Passes

A collection of custom render passes for Unity's High-Definition Render Pipeline (HDRP).

1.4K
Archived
C#
Unity HDRP
Unity
#unity#hdrp#rendering

GoogleCloudPlatform/data-science-on-gcp

A repository providing data science tools and examples for the Google Cloud Platform.

1.4K
Stable
Jupyter Notebook
React
#data-science#cloud-computing#google-cloud

explosion/spacy-transformers

Use pre-trained transformer models like BERT, GPT-2, and XLNet in the spaCy NLP library.

1.4K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
spaCy
#natural-language-processing#transformers#bert

ReactiveX/IxJS

IxJS provides an implementation of the Reactive Extensions for JavaScript, allowing for declarative data processing pipelines.

1.4K
Experimental
TypeScript
General Utilities
Backend Frameworks
React
#reactive#data-processing#streams

sylefeb/Silice

Silice is a powerful hardware description language that simplifies designing hardware algorithms with parallelism and pipelines.

1.4K
Active
C++
Arduino & Embedded
CLI Tools
#fpga#hardware-design#parallel-processing

fmind/mlops-python-package

Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

1.4K
Active
Jupyter Notebook
MLOps
CLI Tools
Python
#mlops#data-pipelines#automation

toluaina/pgsync

A Python library that syncs data from Postgres to Elasticsearch/OpenSearch, enabling real-time data pipelines.

1.4K
Active
Python
ETL & Pipelines
Realtime
Python
#change-data-capture#elasticsearch-sync#postgresql

damklis/DataEngineeringProject

An end-to-end data engineering project example showcasing tools and technologies for building data pipelines.

1.4K
Archived
Python
ETL & Pipelines
API Frameworks
Django
#data-engineering#data-pipeline#etl

opendatadiscovery/odd-platform

First open-source data discovery and observability platform for data practitioners.

1.4K
Active
Java
Data Discovery
Data Observability
#data-catalog#data-engineering#data-governance

koaning/scikit-lego

Extra blocks for scikit-learn pipelines, a popular machine learning library in Python.

1.4K
Active
Python
ML Ops
Backend Frameworks
Python
#machine-learning#scikit-learn#pipelines

aio-libs/aiokafka

An asyncio client for Apache Kafka, a distributed streaming platform for building real-time data pipelines and streaming apps.

1.4K
Active
Python
Realtime
Caching
#kafka#streaming#data-pipelines

explosion/spacy-llm

Integrates Large Language Models (LLMs) into structured NLP pipelines using Spacy.

1.4K
Archived
Python
Prompt Engineering
Spacy
#LLM#NLP#Spacy

rockingdingo/deepnlp

A deep learning NLP pipeline implemented on TensorFlow for natural language processing tasks.

1.4K
Archived
Python
LLM Frameworks
API Frameworks
TensorFlow
#nlp#deep-learning#tensorflow

gtoonstra/etl-with-airflow

This repository provides best practices and examples for building ETL (Extract, Transform, Load) pipelines using Apache Airflow.

1.4K
Archived
Shell
ETL & Pipelines
#etl#airflow#data-pipelines

amphi-ai/amphi-etl

A visual data preparation tool powered by Python, designed for data analysis and ETL tasks.

1.4K
Active
TypeScript
ETL & Pipelines
Data Analysis
TypeScript
#data-analysis#data-pipelines#data-transformation

apache/hop

Hop is a flexible and extensible open-source data integration platform for building and orchestrating ETL and streaming pipelines.

1.3K
Active
Java
ETL & Pipelines
ETL & Pipelines
#data-integration#etl#orchestration
1...1113...15

Stay in the loop

Get weekly updates on trending AI coding tools and projects.