HBase for Developers (HBDEV)

Course Description Schedule Course Outline
 

Course Content

HBase for Developers is a three-day, hands-on course introducing you to HBase – a NoSQL store on top of Hadoop. In this course you will learn HBase architecture and data modelling as well as application development on HBase. This course also covers using MapReduce with HBase, and some administration topics, related to performance optimization.

Who should attend

  • Developers who will be using HBase
  • Administrators who will be managing a HBase cluster

Prerequisites

  • Comfortable with Java programming language
  • Comfortable navigating the Linux command line and editing files using vi or nano

Detailed Course Outline

Module 1: Introduction to Big Data & NoSQL

  • Big Data ecosystem
  • NoSQL overview
  • CAP theorem
  • When is NoSQL appropriate
  • Columnar storage
  • HBase and NoSQL

Module 2: HBase Intro

  • Concepts and design
  • Architecture (HMaster and region server)
  • Data integrity
  • HBase ecosystem
  • Lab: Exploring HBase

Module 3: HBase Data model

  • Namespaces, tables, and regions
  • Rows, columns, column families, versions
  • HBase shell and admin commands
  • Lab

Module 4: Accessing HBase using Java API

  • Introduction to Java API
  • Read / Write path
  • Time series data
  • Scans
  • MapReduce
  • Filters
  • Counters
  • Co-processors
  • Labs (multiple)

Module 5: HBase schema Design : Group session

  • Real world use cases
  • Group design solution development
  • Discussion and critique of the designs

Module 6: HBase Internals

  • Understanding HBase under the hood
  • Memfile / HFile / WAL
  • HDFS storage
  • Compactions
  • Splits
  • Bloom filters
  • Caches
  • Diagnostics

Module 7: HBase installation and configuration

  • Hardware selection
  • Install methods
  • Common configurations
  • Lab: installing HBase

Module 8: HBase eco-system

  • Developing applications using HBase
  • Interacting with other Hadoop stack (MapReduce, Pig, Hive)
  • Frameworks around HBase
  • Advanced concepts (co-processors)

Module 9: Monitoring And Best Practices

  • Monitoring tools and practices
  • Optimizing HBase
  • HBase in the cloud
  • Real world use cases of HBase
Classroom Training

Duration 3 days

Price
  • United States: US$ 2,500
Enroll now
Online Training

Duration 3 days

Price
  • United States: US$ 2,500
Enroll now