Found 49,556 repositories(showing 30)
apache
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
argoproj
Workflow Engine for Kubernetes
windmill-labs
Open-source developer platform to power your entire infra and turn scripts into webhooks, workflows and UIs. Fastest workflow engine (13x vs Airflow). Open-source alternative to Retool and Temporal.
apache
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
orchest
Build data pipelines, the easy way ๐ ๏ธ
jghoman
Curated list of resources about Apache Airflow
puckel
Docker Apache Airflow
WeBankFinTech
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
mara
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
elyra-ai
Elyra extends JupyterLab with an AI centric approach.
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
teamclairvoyant
A series of DAGs/Workflows to help maintain the operation of Airflow
More than 2000+ Data engineer interview questions.
san089
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
astronomer
Construct Apache Airflow DAGs Declaratively via YAML configuration files
damklis
Example end to end data engineering project.
gtoonstra
ETL best practices with airflow, with examples
astronomer
Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code
abhishek-ch
A Data Engineering & Machine Learning Knowledge Hub
alanchn31
Personal Data Engineering Projects
tuanavu
Apache Airflow tutorial
iusztinpaul
๐ ๐ง๐ต๐ฒ ๐๐๐น๐น ๐ฆ๐๐ฎ๐ฐ๐ธ ๐ณ-๐ฆ๐๐ฒ๐ฝ๐ ๐ ๐๐ข๐ฝ๐ ๐๐ฟ๐ฎ๐บ๐ฒ๐๐ผ๐ฟ๐ธ | ๐๐ฒ๐ฎ๐ฟ๐ป ๐ ๐๐ & ๐ ๐๐ข๐ฝ๐ for free by designing, building and deploying an end-to-end ML batch system ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + 2.5 ๐ฉ๐ฐ๐ถ๐ณ๐ด ๐ฐ๐ง ๐ณ๐ฆ๐ข๐ฅ๐ช๐ฏ๐จ & ๐ท๐ช๐ฅ๐ฆ๐ฐ ๐ฎ๐ข๐ต๐ฆ๐ณ๐ช๐ข๐ญ๐ด
couler-proj
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
ankurchavda
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Code for Data Pipelines with Apache Airflow
This repository provides a command line interface (CLI) utility that replicates an Amazon Managed Workflows for Apache Airflow (MWAA) environment locally.
raystack
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
airflow-helm
The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, it has since helped thousands of companies create production-ready deployments of Airflow on Kubernetes.
mumoshu
A docker image and kubernetes config files to run Airflow on Kubernetes
josephmachado
Beginner data engineering project - batch edition