Advanced Hadoop using Pig, Hive, and HBase Training in Pasadena

Enroll in or hire us to teach our Advanced Hadoop using Pig, Hive, and HBase class in Pasadena, Texas by calling us @303.377.6176. Like all HSG classes, Advanced Hadoop using Pig, Hive, and HBase may be offered either onsite or via instructor led virtual training. Consider looking at our public training schedule to see if it is scheduled: Public Training Classes
Provided there are enough attendees, Advanced Hadoop using Pig, Hive, and HBase may be taught at one of our local training facilities.
We offer private customized training for groups of 3 or more attendees.

Course Description

 
This course delves into data management in HDFS, advanced Pig, Hive, and Hbase. These advanced programming techniques will be beneficial to experienced Hadoop developers. Course Topics... Data Management in HDFS... Advanced Pig ... Advanced Hive ... Advanced HBase
Course Length: 3 Days
Course Tuition: $1190 (US)

Prerequisites

This course is intended for experienced software developers and architects who know the basics of Hadoop and looking for advanced programming techniques.

Course Outline

 
 
I. Data Management in HDFS
A. Various Data Formats (JSON / Avro / Parquet)
B. Compression Schemes
C. Data Masking
D. Labs
 
II. Advanced Pig
A. User-defined Functions
B. Introduction to Pig Libraries (ElephantBird / Data-Fu)
C. Loading Complex Structured Data using Pig
D. Pig Tuning
E. Labs
 
III. Advanced Hive
A. User-defined Functions
B. Compressed Tables
C. Hive Performance Tuning
D. Labs
IV. HBase
 
A. Advanced Schema Modeling
B. Compression
C. Bulk Data Ingest
D. Wide-table / Tall-table comparison
E. HBase and Pig
F. HBase and Hive
G. HBase Performance Tuning
H. Labs
 
V. Final Project
A. End-to-End Project includes use of Learned Technologies

Interesting Reads Take a class with us and receive a book of your choosing for 50% off MSRP.