BigData and Cassandra

Products Category: Tech & Programming
Product Code: -


1) Introduction of BigData and Cassandra 2) Cassandra fundamentals 3) Installation and setup 4) Cassandra data modeling 5) Cassandra Architecture 6) Deep dive into Cassandra Database 7) Cassandra monitoring and Administration 8) Cassandra analytics, backu




Introduction of BigData and Cassandra

  • Introduction of Big Data and problems caused by it for traditional Database Management Systems like RDBMS.
  • How Cassandra solves these BigData problems.
  • Relational DB vs. NoSQL
  • NoSQL Database and its types
  • CAP Theorem and Eventual Consistency
  • Common characteristics of NoSQL databases
  • Features of Cassandra

Cassandra fundamentals

  • Introduction to Cassandra, key concepts.
  • Column-oriented database
  • Distributed and Decentralized
  • Elastic Scalability
  • High Availability and Fault Tolerance
  • Tunable Consistency
  • Row-Oriented
  • Schema-Free
  • High Performance

 

Installation and setup

  • Single Node Setup
  • Multi-Node Cluster Setup
  • Key Configurations for Cassandra

Cassandra data modeling

  • Introduction to Database Model
  • Difference between RDBMS and Cassandra Data Model
  • Database Elements: Cluster, Keyspace, Column Family/Table, Column
  • Column Family
  • Columns 
  • Query first approch
  • Hands-on Session

Cassandra Architecture

  • Explain the Architecture of Cassandra
  • Different Layers of Cassandra Architecture
  • Peer to Peer Model Logical Data Model: Keyspace, Column Family/Table, Rows, Columns
  • Cluster Ring design and VNodes
  • Partitioning and Snitches
  • Petitioners : Murmur3, Random (md5) and Byte Ordered
  • Consistent hashing
  • Gossip and Failure Detection
  • Repairs

10) Key Cassandra Elements

  1. Memtable
  2. Commit log
  3. SSTables
  • Compaction fundamentals to reduce SSTable data files
  • Data Replication , Replication Factor
  • Staged Event-Driven Architecture (SEDA) 
  • Managers and Services 
  • Write path and Read path
  • Consistency level
  • Bloom Filters, Tombstones
  • Indexes and Caches
  • Coordinator node
  • Seed nodes
  • Write/Read consistency levels: Any, One, Two, Three, Quorum
  • Routing Client requests
  • Nodetool commands: gossipinfo, cfstats, describing
  • YAML file fundamentals
  • Operations management web GUI
  • Stress testing Cassandra
  • CQL command fundamentals

 

Deep dive into Cassandra Database

  • Replication Factor
  • Replication Strategy
  • Defining columns and data types
  • Defining a partition key
  • Recognizing a partition key
  • Specifying a descending clustering order
  • Updating data
  • Tombstones
  • Deleting data
  • Using TTL
  • Updating a TTL

 

Cassandra monitoring and Administration

  • Cassandra monitoring tools
    1. Logging 
    2. Tailing 
    3. Using Nodetool Utility
    4. Using JConsole
    5. Learning about OpsCenter
    6. Runtime Analysis Tools

 

  • Hands On below tools:
    1. JMX and Jconsole
    2. OpsCenter

 

Cassandra analytics, backup , restore and performance tuning

  • Creating a Snapshot
    1. Restoring from a Snapshot
    2. RAM and CPU recommendations
    3. Hardware choices
    4. Selecting storage
    5. Types of Storage to Avoid
    6. Cluster connectivity, security and the factors that affect distributed system performance
    7. End-to-end performance tuning of Cassandra clusters against very large data sets
    8. Load balance and streams

 

  • Hands On below tools:
    1. Creating Snapshots
    2. Integration with Kafka
    3. Integration with Spark
    4. Cassandra Hadoop Integration

 

 


Price: 25,000.00 INR