Cloudera University’s four-day training course for Apache HBase enables participants to store and access massive quantities of multi-structured data and perform hundreds of thousands of operations per second.

starstarstarstarstar_half

* Actual course outline may vary depending on offering center. Contact your sales representative for more information.

Learning Objectives

Skills learned on the course include:
The use cases and usage occasions for HBase, Hadoop, and RDBMS
Using the HBase shell to directly manipulate HBase tables
Designing optimal HBase schemas for efficient data storage and recovery
How to connect to HBase using the Java API, configure the HBase cluster, and administer an HBase cluster
Best practices for identifying and resolving performance bottlenecks

1
  • Introduction to Hadoop & HBase

  • What Is Big Data?
    Introducing Hadoop
    Hadoop Components
    What Is HBase?
    Why Use HBase?
    Strengths of HBase
    HBase in Production
    Weaknesses of HBase

2
  • HBase Tables

  • HBase Concepts
    HBase Table Fundamentals
    Thinking About Table Design

3
  • The HBase Shell

  • Creating Tables with the HBase Shell
    Working with Tables
    Working with Table Data

4
  • HBase Architecture Fundamentals

  • HBase Regions
    HBase Cluster Architecture
    HBase and HDFS Data Locality

5
  • HBase Schema Design

  • General Design Considerations
    Application-Centric Design
    Designing HBase Row Keys
    Other HBase Table Features

6
  • Basic Data Access with the HBase API

  • Options to Access HBase Data
    Creating and Deleting HBase Tables
    Retrieving Data with Get
    Retrieving Data with Scan
    Inserting and Updating Data
    Deleting Data

7
  • More Advanced HBase API Features

  • Filtering Scans
    Best Practices
    HBase Coprocessors

8
  • HBase on the Cluster

  • How HBase Uses HDFS
    Compactions and Splits

9
  • HBase Reads & Writes

  • How HBase Writes Data
    How HBase Reads Data
    Block Caches for Reading

10
  • HBase Performance Tuning

  • Column Family Considerations
    Schema Design Considerations
    Configuring for Caching
    Dealing with Time Series and Sequential Data
    Pre-Splitting Regions

11
  • HBase Administration and Cluster Management

  • HBase Daemons
    ZooKeeper Considerations
    HBase High Availability
    Using the HBase Balancer
    Fixing Tables with hbck
    HBase Security

12
  • HBase Replication & Backup

  • HBase Replication
    HBase Backup
    MapReduce and HBase Clusters

13
  • Using Hive & Impala with HBase

  • Using Hive and Impala with HBase

14
  • Appendix A: Accessing Data with Python and Thrift

  • Thrift Usage
    Working with Tables
    Getting and Putting Data
    Scanning Data
    Deleting Data
    Counters
    Filters

15
  • Appendix B: OpenTSDB


Audience

This course is appropriate for developers and administrators who intend to use HBase.

Language

English

Prerequisites

Prerequisites for this course include: experience with databases and data modeling, knowledge of Java, & Cloudera Developer Training for Apache Hadoop

$2,595

Length: 4.0 days (32 hours)

Level:

Not Your Location? Change

Course Schedule:

To request a custom delivery, please chat with an expert.

Loading...