Hadoop Programming Training in Santa Rosa

Enroll in or hire us to teach our Hadoop Programming class in Santa Rosa, California by calling us @303.377.6176. Like all HSG classes, Hadoop Programming may be offered either onsite or via instructor led virtual training. Consider looking at our public training schedule to see if it is scheduled: Public Training Classes
Provided there are enough attendees, Hadoop Programming may be taught at one of our local training facilities.
We offer private customized training for groups of 3 or more attendees.

Course Description

 
This intense course is designed to give developers hands-on working knowledge for harnessing the power of Hadoop. The course begins with a discussion of the Hadoop ecosystem and then works its way to hands-on exploration of MapReduce with datasets and live clusters. The course also goes over some common configuration mechanisms, tools and debugging.
Course Length: 3 Days
Course Tuition: $1190 (US)

Prerequisites

This course is intended for Java software developers who need to write applications that use distributed systems and MapReduce.

Course Outline

 
Course Topics
 
• MapReduce Jobs
• Counters
• Distributed Cache
• Combiner/Partitioner
• Configuration
• Debugging
• Input and Output Formats
• Tuning and Optimizing MapReduce
• Joins
• MapReduce Streaming
• Unit Testing
• Workflows
• Library Classes
• Filters and Sorting (if time permits)
• Other Hadoop Tools (if time permits)
• Architecting Solutions (if time permits)
 
Course Objectives
Upon completion of this course attendees will be able to:
• Describe the anatomy of a MapReduce job
• Perform joins by writing MapReduce code in Java
• Implement common algorithms in Hadoop
• Apply best practices for Hadoop development and debugging
• Describe other Hadoop tools including Hive and Pig
 
Course Outline
 
I. DAY 1
A. Review of MapReduce Basics
B. Anatomy of a MapReduce Job
C. MapReduce Web UI
D. Counters
E. Distributed Cache
 
II. DAY 2
A. Combiner/Partition
B. Configuration
C. Debugging
D. Writing Fool-Proof MapReduce Code
E. Input and Output Formats
F. Tuning and Optimizing MapReduce
G. Joins
 
III. DAY 3
A. MapReduce Streaming (using Ruby or Python if possible)
B. Unit Testing
C. Work Flow Tools
D. Useful Library Classes
 
IV. IF TIME PERMITS
A. Filters
B. Sorting
C. Higher Level MapReduce (Pig & Hive) and Other Tools
1. Advanced HQL
2. Serdes
3. Pig Basics
4. Mahout Basics
D. Architecting Solutions with MapReduce and Case Studies

Course Directory [training on all levels]

Upcoming Classes
Gain insight and ideas from students with different perspectives and experiences.

Interesting Reads Take a class with us and receive a book of your choosing for 50% off MSRP.