Kafka
Overview
This training course gives participants an overall training around the development of open source stream processing software called Kafka. This training program will also give you insights of integrating Kafka with Big Data, HDFS, and MapReduce, Zookeeper, Strom and Spark. Objective of this program:-
- Participants will Understand the Big Data and the Real-Time Big Data
- Participants will Understand about Apache Hadoop
- Participants will learn Zookeeper Installation and configuration
- Participants will also Understand Kafka Architecture
2 Days
Pre-Requisites
Participants must have basic understanding of Big Data tools
Course Outline
- Introduction: Big Data & Real-Time Big Data
- 3Vs: Big Data
- Data evolution & characteristics: Big Data Industry Examples
- Introduction: Big Data & Real-Time Big Data
- Apache Hadoop: HDFS & MapReduce
- Real-time: Big Data analysis, STORM & SPARK
- Introduction: Zookeeper
- Introducing Zookeeper
- Zookeeper Recipes
- Zookeeper installation and configuration
- Demo 1 – Installing Ubuntu VM & connecting with Putty
- Kafka Introduction & Architecture
- Introducing Kafka
- Kafka Architecture
- Message Queues
- Kafka Introduction & Architecture
- Publish subscribe
- Kafka partitions
- Kafka Installation, Configuration & Datamodel
- Kafka Installation
- Kafka configuration
- Kafka Installation, Configuration & Data model
- Kafka cluster setup
- Starting Kafka
- Demo 2 – Installing & configuring Zookeeper and Kafka
- Kafka interfaces
- Java interface to Kafka
- Compiling Java program for Kafka
