Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.
Stars
71
Forks
13
Watchers
71
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
23
commits
updated architecture & added notebook version of databricks archive
503ca85View on GitHubadded markdowns, updated data generation, spark streaming script
b8233deView on GitHubadded few markdowns, updated terraform script & kafka docker file
2760a27View on GitHub