Advanced Hadoop using Pig, Hive, and HBase Training

We offer private customized training for groups of 3 or more attendees.

Course Description

 
This course delves into data management in HDFS, advanced Pig, Hive, and Hbase. These advanced programming techniques will be beneficial to experienced Hadoop developers. Course Topics... Data Management in HDFS... Advanced Pig ... Advanced Hive ... Advanced HBase
Course Length: 3 Days
Course Tuition: $1190 (US)

Prerequisites

This course is intended for experienced software developers and architects who know the basics of Hadoop and looking for advanced programming techniques.

Course Outline

 
 
I. Data Management in HDFS
A. Various Data Formats (JSON / Avro / Parquet)
B. Compression Schemes
C. Data Masking
D. Labs
 
II. Advanced Pig
A. User-defined Functions
B. Introduction to Pig Libraries (ElephantBird / Data-Fu)
C. Loading Complex Structured Data using Pig
D. Pig Tuning
E. Labs
 
III. Advanced Hive
A. User-defined Functions
B. Compressed Tables
C. Hive Performance Tuning
D. Labs
IV. HBase
 
A. Advanced Schema Modeling
B. Compression
C. Bulk Data Ingest
D. Wide-table / Tall-table comparison
E. HBase and Pig
F. HBase and Hive
G. HBase Performance Tuning
H. Labs
 
V. Final Project
A. End-to-End Project includes use of Learned Technologies

Course Directory [training on all levels]

Upcoming Classes
Gain insight and ideas from students with different perspectives and experiences.

Interesting Reads Take a class with us and receive a book of your choosing for 50% off MSRP.