Apache Storm
Overview
-
- Participants will be able to have complete knowledge of the fundamental concepts and also, know the architecture of Apache Storm, they will also Plan installation and configuration with Apache Storm
- Participants will be able to understand concepts like Ingesting & processing of real-time events with Storm and also Understand the fundamentals of Trident extension to Apache Storm
- Participants will be able to get knowledge thorough understanding of Grouping & Data Insertion in Apache Storm as well as understand the fundamentals of Storm Interfaces with Kafka, Cassandra and Java
3 Days
Pre-Requisites
Participants should have basic knowledge of programming languages.
Course Outline
- Hadoop Distributed Computing
- Legacy Architecture of Real-Time System
- Difference b/w Storm and Hadoop
- The fundamental concept of storm
- Storm Development Environment
- Real Life Storm Project
- Storm Architecture
- Logical Dynamic and Components in Storm
- Topology in Storm
- Storm Execution Components
- Stream Grouping
- Tuple
- Spout
- Reliable versus Unreliable Messages
- Getting Data
- Direct connection
- Enqueued Messages
- DRPC
- Bolt Lifecycle
- Bolt Structure
- Bolt-normalization bolt
- Reliable versus Unreliable Bolts
- Multiple Streams
- Multiple Anchoring
- Using IBasicBolt to Ack Automatically
- Hands-On:
- Creating Storm project in eclipse
- Running Storm bolt and spouts
- Running twitter example using Storm
- Reliable and unreliable messaging
- How to get Data thorugh Direct connection and Enqueued message
- Life cycle of bolt
- Fields Grouping
- All Grouping
- Custom Grouping
- Direct Grouping
- Global Grouping
- None Grouping
- Hands-On:
- Using different grouping techniques in Storm topologies
- Basic Kafka Concepts
- Kafka vs Other Messaging Systems
- An Inside Look at Kafka’s Components
- Use Cases
- Quick Start
- Ecosystem
- Upgrading
- Producer API
- Consumer API
- Streams API
- Connect API
- AdminClient API
- Broker Configs
- Topic Configs
- Producer Configs
- Consumer Configs
- New Consumer Configs
- Motivation
- Persistence
- Efficiency
- The Producer
- The Consumer
- Message Delivery Semantics
- Replication
- Log Compaction
- Kafka Callback and Acknowledgments
- Use of batches in Kafka producer
- Consumer groups
- Network Layer
- Messages
- Message format
- Log
- Distribution
- What is Trident
- Trident Spouts
- Types of Trident Spouts
- Trident Spout components
- Trident spout Interface
- Trident filter, function & Aggregator
- Hands-On:
- Implementing Trident Spouts and Bolts
- Transactional Topologies
- Partitioned Transactional Spouts
- Opaque Transactional Topologies
- Hands-On:
- Implementing transactional system using Transactional topologies
