Implementing the Speed Layer of Lambda Architecture

In this post, we will implement the third part of our system, which is the Speed Layer of Lambda architecture. We will use Spark Structured Streaming to read data from Kafka’s “TwitterStreaming” topic and analyze this data in real time.

Implementing the Batch Layer of Lambda Architecture

In the previous post, we implemented the first part of our system (Data collection and storage). In this post, we will implement the second part of our system, which is Batch Layer of Lambda Architecture. The Batch Processing of our

Connecting Spark to Cassandra

In this tutorial, we will write a program that use Spark Cassandra Connector to connect Apache Spark to Cassandra. First, we add the following dependencies to the pom.xml file:

Spark Cassandra Connector allows users using both RDD and DataFrame