Cassandra

Overview

In this Cassandra training course we will provide participants with a summary of the fundamentals of Big Data and NoSQL databases, Cassandra’s role in the Hadoop Big Data ecosystem, understanding of Cassandra & its features, architecture & data model, show participants how to do installation, configuration and monitor Cassandra.
Duration
3 Days

Pre-Requisites
Knowledge on Core Java and Basic understanding on Linux OS (Ubuntu)

Course Outline

  • Introducing Cassandra
  • Understanding what Cassandra is?
  • Learning what Cassandra is used for?
  • CAP Theorem
  • Cluster Architecture
  • Eventual Consistency
  • Understanding System Requirements
  • Understanding our lab
  • Understanding Cassandra as Distributed DB
  • Snitch
  • Gossip
  • Learning How Data gets distributed
  • Replication
  • Virtual Nodes
  • Downloading Cassandra
  • Java
  • Understanding Cassandra configuration files
  • Cassandra foreground and background mode
  • Checking Cassandra Status
  • Accessing and understanding of Log Structure
  • Using CQLSH
  • Creating a Database
  • Defining a Key space
  • Deleting a Key space
  • Creating a Table
  • Defining Columns and Datatypes
  • Defining Primary Key
  • Recognizing a Partition Key
  • Specifying a descending cluster order
  • Understanding ways to write data
  • Using INSERT INTO command
  • Using COPY command
  • How data is stored in Cassandra?
  • How data is stored in Disk?
  • Understanding Data model
  • Understanding where clause criteria in Cassandra
  • Loading Bulk Data
  • JSON format Import and Export
  • Using Primary Index
  • Creating a Secondary Index
  • Defining a Composite Partition Key
  • Understanding Cassandra Drivers
  • Exploring the Datastax Java Driver
  • Setting up Eclipse Environment
  • Creating an Application Webpage
  • Acquiring Java Driver Files
  • Understanding Packaging using Maven
  • Understanding Packaging using Manual Methods
  • Connecting to Cassandra Cluster using Webpage
  • Executing a Query using Webpage
  • Using MVC Pattern Example
  • Understanding Cluster Class for load balancing & DC aware policy
  • Creating a long-lived Session object
  • Batch Operations in Cassandra
  • Understanding Token Aware Policy and DCA ware Round Robin Policy
  • Updating Data
  • Understanding How updating Works
  • Deleting Data
  • Understanding the role of Tombstones
  • Using TT
  • Understanding Hardware Choices for production
  • Understanding RAM and CPU Recommendations
  • Things to be considered while storage selection
  • Things to be considered while Deploying in Cloud
  • Understanding Cassandra Nodes
  • Network Connection Setup
  • Specifying Seed Nodes
  • Bootstrapping a node
  • Cleaning up a node
  • Using Cassandra-stress for stress testing cluster
  • Understanding Cassandra Monitoring Tools
  • Using Node tool
  • Using Jconsole
  • Learning about OpsCenter
  • Understanding Repair
  • Repairing Nodes
  • Understanding Consistency
  • Understanding Hinted Handoff
  • Understanding Read Repair
  • Using Commit Log Feature for Performance Tuning
  • Using Restore Methods
  • Storage Strategies and OS tuning
  • JVM Tuning
  • Caching Strategies
  • Compaction and Compression
  • Stress Testing Strategies