Found 5 repositories(showing 5)
A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill using Docker and Cassandra (NoSQL DB) for storage; This allows for for fast feature engineering and data cleaning.
Theeo04
No description available
No description available
No description available
linshimiao
This tutorial uses Java API to communicates with Cassandra DB and performs various CRUD operations.
All 5 repositories loaded