Hadoop Programming Training
We offer private customized training for groups of 3 or more attendees.
|
||
Course Description |
||
This intense course is designed to give developers hands-on working
knowledge for harnessing the power of Hadoop. The course begins with a
discussion of the Hadoop ecosystem and then works its way to hands-on
exploration of MapReduce with datasets and live clusters. The course
also goes over some common configuration mechanisms, tools and debugging.
Course Length: 3 Days
Course Tuition: $1190 (US) |
Prerequisites |
|
This course is intended for Java software developers who need to write applications that use distributed systems and MapReduce. |
Course Outline |
Course Topics
• MapReduce Jobs
• Counters
• Distributed Cache
• Combiner/Partitioner
• Configuration
• Debugging
• Input and Output Formats
• Tuning and Optimizing MapReduce
• Joins
• MapReduce Streaming
• Unit Testing
• Workflows
• Library Classes
• Filters and Sorting (if time permits)
• Other Hadoop Tools (if time permits)
• Architecting Solutions (if time permits)
Course Objectives
Upon completion of this course attendees will be able to:
• Describe the anatomy of a MapReduce job
• Perform joins by writing MapReduce code in Java
• Implement common algorithms in Hadoop
• Apply best practices for Hadoop development and debugging
• Describe other Hadoop tools including Hive and Pig
Course Outline
I. DAY 1
A. Review of MapReduce Basics
B. Anatomy of a MapReduce Job
C. MapReduce Web UI
D. Counters
E. Distributed Cache
II. DAY 2
A. Combiner/Partition
B. Configuration
C. Debugging
D. Writing Fool-Proof MapReduce Code
E. Input and Output Formats
F. Tuning and Optimizing MapReduce
G. Joins
III. DAY 3
A. MapReduce Streaming (using Ruby or Python if possible)
B. Unit Testing
C. Work Flow Tools
D. Useful Library Classes
IV. IF TIME PERMITS
A. Filters
B. Sorting
C. Higher Level MapReduce (Pig & Hive) and Other Tools
1. Advanced HQL
2. Serdes
3. Pig Basics
4. Mahout Basics
D. Architecting Solutions with MapReduce and Case Studies
|
Course Directory [training on all levels]
Technical Training Courses
Software engineer/architect, System Admin ... Welcome!
- .NET Classes
- Agile/Scrum Classes
- Ajax Classes
- Android and iPhone Programming Classes
- Blaze Advisor Classes
- C Programming Classes
- C# Programming Classes
- C++ Programming Classes
- Cisco Classes
- Cloud Classes
- CompTIA Classes
- Crystal Reports Classes
- Design Patterns Classes
- DevOps Classes
- Foundations of Web Design & Web Authoring Classes
- Git, Jira, Wicket, Gradle, Tableau Classes
- IBM Classes
- Java Programming Classes
- JBoss Administration Classes
- JUnit, TDD, CPTC, Web Penetration Classes
- Linux Unix Classes
- Machine Learning Classes
- Microsoft Classes
- Microsoft Development Classes
- Microsoft SQL Server Classes
- Microsoft Team Foundation Server Classes
- Microsoft Windows Server Classes
- Oracle, MySQL, Cassandra, Hadoop Database Classes
- Perl Programming Classes
- Python Programming Classes
- Ruby Programming Classes
- Security Classes
- SharePoint Classes
- SOA Classes
- Tcl, Awk, Bash, Shell Classes
- UML Classes
- VMWare Classes
- Web Development Classes
- Web Services Classes
- Weblogic Administration Classes
- XML Classes
Business Training Courses
Project Managers, Business Analysts, Paralegals ... Welcome!
Upcoming Classes
Gain insight and ideas from students with different perspectives and experiences.
- Introduction to Spring 5 (2022)
16 December, 2024 - 18 December, 2024 - Introduction to C++ for Absolute Beginners
16 December, 2024 - 17 December, 2024 - See our complete public course listing