Back to search
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Stars
322
Forks
146
Watchers
322
Open Issues
5
Overall repository health assessment
4
commits
1
commits
No package.json found
This might not be a Node.js project