Cassandra
Overview
In this Cassandra training course we will provide participants with a summary of the fundamentals of Big Data and NoSQL databases, Cassandra’s role in the Hadoop Big Data ecosystem, understanding of Cassandra & its features, architecture & data model, show participants how to do installation, configuration and monitor Cassandra.3 Days
Pre-Requisites
Knowledge on Core Java and Basic understanding on Linux OS (Ubuntu)
Course Outline
- Introducing Cassandra
- Understanding what Cassandra is?
- Learning what Cassandra is used for?
- CAP Theorem
- Cluster Architecture
- Eventual Consistency
- Understanding System Requirements
- Understanding our lab
- Understanding Cassandra as Distributed DB
- Snitch
- Gossip
- Learning How Data gets distributed
- Replication
- Virtual Nodes
- Downloading Cassandra
- Java
- Understanding Cassandra configuration files
- Cassandra foreground and background mode
- Checking Cassandra Status
- Accessing and understanding of Log Structure
- Using CQLSH
- Creating a Database
- Defining a Key space
- Deleting a Key space
- Creating a Table
- Defining Columns and Datatypes
- Defining Primary Key
- Recognizing a Partition Key
- Specifying a descending cluster order
- Understanding ways to write data
- Using INSERT INTO command
- Using COPY command
- How data is stored in Cassandra?
- How data is stored in Disk?
- Understanding Data model
- Understanding where clause criteria in Cassandra
- Loading Bulk Data
- JSON format Import and Export
- Using Primary Index
- Creating a Secondary Index
- Defining a Composite Partition Key
- Understanding Cassandra Drivers
- Exploring the Datastax Java Driver
- Setting up Eclipse Environment
- Creating an Application Webpage
- Acquiring Java Driver Files
- Understanding Packaging using Maven
- Understanding Packaging using Manual Methods
- Connecting to Cassandra Cluster using Webpage
- Executing a Query using Webpage
- Using MVC Pattern Example
- Understanding Cluster Class for load balancing & DC aware policy
- Creating a long-lived Session object
- Batch Operations in Cassandra
- Understanding Token Aware Policy and DCA ware Round Robin Policy
- Updating Data
- Understanding How updating Works
- Deleting Data
- Understanding the role of Tombstones
- Using TT
- Understanding Hardware Choices for production
- Understanding RAM and CPU Recommendations
- Things to be considered while storage selection
- Things to be considered while Deploying in Cloud
- Understanding Cassandra Nodes
- Network Connection Setup
- Specifying Seed Nodes
- Bootstrapping a node
- Cleaning up a node
- Using Cassandra-stress for stress testing cluster
- Understanding Cassandra Monitoring Tools
- Using Node tool
- Using Jconsole
- Learning about OpsCenter
- Understanding Repair
- Repairing Nodes
- Understanding Consistency
- Understanding Hinted Handoff
- Understanding Read Repair
- Using Commit Log Feature for Performance Tuning
- Using Restore Methods
- Storage Strategies and OS tuning
- JVM Tuning
- Caching Strategies
- Compaction and Compression
- Stress Testing Strategies
