Data collection and storage

As briefly explained in Overview of the Project, we will use Twitter4j library to collect Twitter’s stream data, publish this data to Kafka, then save it in Cassandra database. The details about how to publish Twitter’s data to Kafka can

Real-time Twitter Analysis

In this tutorial, we will use TwitterStreaming API to get Twitter’s tweets in real time and publish it to a Kafka topic. We then use Spark as a Consumer to read Twitter data from Kafka and analyse that data. To

Publishing Tweeter’s data to a Kafka topic

In the previous tutorial (A basic Kafka program), we wrote two Java classes: SimpleProducer and SimpleConsumer. SimpleProducer acts like a Kafka Producer that send messages (numbers from 0 to 9) to ‘test’ topic and SimpleConsumer is a Kafka Consummer that