Found 45 repositories(showing 30)
moritzkoerber
A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.
ONSdigital
Data engineering pipeline for the household COVID-19 Infection Survey (CIS)
tangybluff
Scalable data engineering pipeline built for clinical data. Uses Terraform, DLT, GCP (GCS + BigQuery), dbt, and Dagster to ingest, transform, and orchestrate COVID-19 patient data for analytics, visualization, and future ML applications.
ROSHANFAREED
End-to-end Azure Data Engineering pipeline using ADF, Databricks (PySpark), ADLS Gen2, Azure SQL, and Power BI for COVID-19 analytics
SHREYAS-SHETTY-KR
A comprehensive exploration of constructing an Extract, Transform, Load (ETL) pipeline in the AWS Cloud, with a focus on leveraging AWS COVID-19 dataset to showcase best practices in the field of data engineering.
This repository consist of a project to build an ETL pipeline for a data lake hosted on S3 using Spark to assess the impact of covid-19 in the stock market. This project is for Udacity's Data Engineering Nanodegree.
JinXuan-Wong
End-to-end real-time data engineering pipeline using Kafka, Spark, HDFS, MongoDB, and Neo4j for COVID-19 outbreak monitoring and analytics.
lpillmann
Data Pipeline for COVID-19 Vaccination in Brazil | Udacity Data Engineering Nanodegree Capstone Project
amine-akrout
Capstone Project for Udacity's Data Engineering Nanodegree : End-to-end data pipeline to analyze covid-19 effect on airbnb
SidEnigma
Reporting of COVID-19 cases, deaths, hospital and ICU occupancies till week 47 of 2023 using a data engineering pipeline in Azure, visualized in PowerBI
COVID-19 Data Engineering Pipeline
soswal2506
No description available
ctwangwang
No description available
dataguru615
No description available
No description available
macmichael-analytics
No description available
chandrahaasreddyk
COVID-19 ETL Pipeline - Data Engineering Portfolio Project
To analyze COVID-19 datasets using AWS services to identify patterns, trends, and potential risk factors through a full-fledged data engineering pipeline, and visualize insights via BI tools like Tableau, Power BI, or Looker Studio.
sidc3784-dev
End-to-end COVID-19 Data Engineering ETL Pipeline using Python,Pandas,SQLite, and Matplotlib.
shruti8767
I’m excited to share my data engineering project — “Covid-19 Data Engineering Pipeline using Azure Data Factory.” This project focuses on building a scalable, automated, and cloud-based data pipeline to process, transform, and analyze Covid-19 data efficiently.
REinstall-SyS
This is an end to end data engineering pipeline that extracts, processes and visually displays COVID-19 data.
ROSI9979
Production-ready COVID-19 data analysis ETL pipeline with Azure integration and Tableau visualization - Python, Pandas, Data Engineering
MatthewLawrencel
A data engineering pipeline built with Python, Pandas, SQLAlchemy, PostgreSQL, and Matplotlib to process, store, and visualize COVID-19 case data.
zeeyaad
COVID-19 Epidemiology Data Engineering & Analysis Project — A data engineering pipeline that cleans and processes global COVID-19 epidemiological data, engineers key health indicators, and produces an analysis-ready dataset for exploratory analysis of testing, recovery, and mortality trends.
godhaniripal
this is data engineering project which covers the topic ETL pipeline creation of the COVID-19 cased accross the globe
anamikarpp
Data Engineering Project – Azure Data Factory COVID-19 Pipeline Built and deployed end-to-end ETL pipelines in Azure Data Factory to ingest, transform, and store COVID-19 data. Automated data load from external sources to Azure Blob and Azure SQL DB. Exported and documented pipelines in GitHub.
KChand1
🌍 Build an end-to-end Azure Data Engineering pipeline to ingest, transform, and analyze COVID-19 data for effective reporting and insights.
Gajoshana2910
The COVID-19 Data Pipeline is an end-to-end Data Engineering ETL (Extract, Transform, Load) pipeline designed to automate the collection, processing, and storage of COVID-19 data from external sources. The pipeline is built using Python, Apache Airflow, PostgreSQL, and Docker to ensure scalability, automation, and ease of deployment.
mwandikikepha
A production-ready dbt data pipeline analyzing COVID-19 data across African countries. Features automated scheduling, 4-layer transformations, BigQuery integration, and professional documentation. Demonstrates enterprise data engineering practices.
Avi-k-dua
COVID 19 Data Pipeline - Azure Data Engineering: A complete end-to-end ETL pipeline to fetch COVID19 daily and weekly data from API, transform and load to SQL database in Azure using Data Factory, Databricks, Key-Vaults and Data Lake