The Cassandra (C*) database is a massively scalable NoSQL database that provides high availability and fault tolerance, as well as linear scalability when adding new nodes to a cluster. It has many powerful capabilities, such as tunable and eventual consistency, that allow it to meet the needs of modern applications, but also introduce a new paradigm for data modeling that many organizations do not have the expertise to use in the best way. Introduction to Cassandra is a hands-on course designed to teach attendees the basics of how to create good data models with Cassandra.

starstarstarstarstar_outline

* Actual course outline may vary depending on offering center. Contact your sales representative for more information.

Learning Objectives

The goal of this course is to enable technical students new to Cassandra to begin working with Cassandra in an optimal manner. Throughout the course students will learn to:
Understand the Big Data needs that C* addresses
Be familiar with the operation and structure of C*
Be able to install and set up a C* database
Use the C* tools, including cqlsh, nodetool, and ccm (Cassandra Cluster Manager)
Be familiar with the C* architecture, and how a C* cluster is structured
Understand how data is distributed and replicated in a C* cluster
Understand core C* data modeling concepts, and use them to create well-structured data models
Be familiar with the C* eventual consistency model and use it intelligently
Be familiar with consistency mechanisms such as read repair and hinted handoff
Understand and use CQL to create tables and query for data
Know and use the CQL data types (numerical, textual, uuid, etc.)
Be familiar with the various kinds of primary keys available (simple, compound, and composite primary keys)
Be familiar with the C* write and read paths
Understand C* deletion and compaction

1
  • SESSION 1: CASSANDRA OVERVIEW

  • Why We Need Cassandra - Big Data Challenges vs RDBMS

    High level Cassandra Overview

    Cassandra Features

    Optional: Basic Cassandra Installation and Configuration


2
  • SESSION 2: CASSANDRA ARCHITECTURE AND CQL OVERVIEW

  • Cassandra Architecture Overview

    Cassandra Clusters and Rings

    Nodes and Virtual Nodes

    Data Replication in Cassandra

    Introduction to CQL

    Defining Tables with a Single Primary Key

    Using cqlsh for Interactive Querying

    Selecting and Inserting/Upserting Data with CQL

    Data Replication and Distribution

    Basic Data Types (including uuid, timeuuid)


3
  • SESSION 3: DATA MODELING AND CQL CORE CONCEPTS

  • Defining a Compound Primary Key

    CQL for Compound Primary Keys

    Partition Keys and Data Distribution

    Clustering Columns

    Overview of Internal Data Organization

    Overview of Other Querying Capabilities

    ORDER BY, CLUSTERING ORDER BY, UPDATE , DELETE, ALLOW FILTERING

    Batch Queries

    Data Modeling Guidelines

    Denormalization

    Data Modeling Workflow

    Data Modeling Principles

    Primary Key Considerations

    Composite Partition Keys

    Defining with CQL

    Data Distribution with Composite Partition Key

    Overview of Internal Data Organization


4
  • SESSION 4: ADDITIONAL CQL CAPABILITIES

  • Indexing

    Primary/Partition Keys and Pagination with token()

    Secondary Indexes and Usage Guidelines

    Cassandra collections

    Collection Structure and Uses

    Defining and Querying Collections (set, list, and map)

    Materialized View

    Overview

    Usage Guidelines


5
  • SESSION 5: DATA CONSISTENCY IN CASSANDRA

  • Overview of Consistency in Cassandra

    CAP Theorem

    Eventual (Tunable) Consistency in C* - ONE, QUORUM, ALL

    Choosing CL ONE

    Choosing CL QUORUM

    Achieving Immediate Consistency

    Overview of Other Consistency Levels

    Supportive Consistency Mechanisms

    Writing / Hinted Handoff

    Read Repair

    Nodetool repair


6
  • SESSION 6: INTERNAL MECHANISMS

  • Ring Details

    Partitioners

    Gossip Protocol

    Snitches

    Write Path

    Overview / Commit Log

    Memtables and SSTables

    Write Failure

    Unavailable Nodes and Node Failure

    Requirements for Write Operations

    Read Path Overview

    Read Mechanism

    Replication and Caching

    Deletion/Compaction Overview

    Delete Mechanism

    Tombstones and Compaction


7
  • SESSION 7: WORKING WITH INTELLIJ

  • Configuring JDBC Data Source for Cassandra

    Reading Schema Information

    Querying and Editing Tables.


Audience

Language

English

Prerequisites

Attendees should have incoming experience with and knowledge of SQL. Some familiarity with distributed systems is also helpful.

$1,995

Length: 3.0 days (24 hours)

Level:

Not Your Location? Change

Course Schedule:

Schedule select
11
Dec
Monday
10:00 AM ET -
6:00 PM ET
Filling Fast
Available
Loading...