HADOOP FOR SYSTEMS ADMINISTRATORS Training in Palo Alto

Enroll in or hire us to teach our HADOOP FOR SYSTEMS ADMINISTRATORS class in Palo Alto, California by calling us @303.377.6176. Like all HSG classes, HADOOP FOR SYSTEMS ADMINISTRATORS may be offered either onsite or via instructor led virtual training. Consider looking at our public training schedule to see if it is scheduled: Public Training Classes
Provided there are enough attendees, HADOOP FOR SYSTEMS ADMINISTRATORS may be taught at one of our local training facilities.
We offer private customized training for groups of 3 or more attendees.

Course Description

 
This course covers the essentials of deploying and managing an Apacheâ?¢ Hadoop cluster. The course is lab intensive with each participant creating their own Hadoop cluster using either the CDH (Cloudera's Distribution, including Apache Hadoop) or Hortonworks Data Platform stacks. Core Hadoop services are explored in depth with emphasis on troubleshooting and recovering from common cluster failures. The fundamentals of related services such as Ambari, Zookeeper, Pig, Hive, HBase, Sqoop, Flume, and Oozie are also covered. The course is approximately 60% lecture and 40% labs.
Course Length: 3 Days
Course Tuition: $1890 (US)

Prerequisites

Qualified participants should be comfortable with the Linux commands and have some systems administration experience, but do not need previous Hadoop experience.

Course Outline

 
  1. HADOOP OVERVIEW
    1. Data Analysis
    2. Big Data
    3. Origins of Hadoop
    4. Hadoop Marketplace
    5. Hadoop Core
    6. Hadoop Ecosystem
    7. Cluster Architecture
    8. Hardware/Software Requirements
    9. Running Commands on Multiple Systems
    LAB TASKS
    1. Running Commands on Multiple Hosts
    2. Preparing to Install Hadoop
  2. HDFS
    1. Design Goals
    2. Design
    3. Blocks
    4. Block Replication
    5. Namenode Daemon
    6. Secondary Namenode Daemon
    7. Datanode Daemon
    8. Accessing HDFS
    9. Permissions and Users
    10. Adding and Removing Datanodes
    11. Balancing
    LAB TASKS
    1. Single Node HDFS
    2. Multi-node HDFS
    3. Files and HDFS
    4. Managing and Maintaining HDFS
  3. YARN
    1. YARN Design Goals
    2. YARN Architecture
    3. Resource Manager
    4. Node Manager
    5. Containers
    6. YARN: Other Important Features
    7. Slider
    LAB TASKS
    1. YARN
  4. MAPREDUCE
    1. MapReduce
    2. Terminology and Data Flow
    LAB TASKS
    1. Mapreduce
  5. INSTALLING HADOOP WITH AMBARI LAB TASKS
    1. CDH Uninstall
    2. Installing Hadoop with Ambari
    3. Tez
  6. DATA INGESTION
    1. Sqoop
    2. Flume
    3. Kafka
    LAB TASKS
    1. Sqoop
  7. DATA LINEAGE AND GOVERNANCE
    1. Falcon
    2. Atlas
    3. Oozie
  8. DATA PROCESSING FRAMEWORKS
    1. The Bane of MapReduce
    2. Tez overview
    3. Pig
    4. Hive
    5. Spark
    6. Storm
    7. Solr
    LAB TASKS
    1. Pig
  9. NOSQL IMPLEMENTATIONS
    1. HBase
    2. Phoenix
  10. CLUSTER MANAGEMENT
    1. Ambari Metrics System (AMS)
    2. Zookeeper

Course Directory [training on all levels]

Upcoming Classes
Gain insight and ideas from students with different perspectives and experiences.

Linux Uses & Stats

Linux is Used For:
Desktop Mainframe Computers Mobile Devices Embedded Devices
Difficulty
Popularity
Year Created
1991/1971
Pros
Performance:
Linux supports many efficient tools and operates them seamlessly. Because it's architecture is lightweight it runs faster than both Windows 8.1 and 10. 
 
Security:
Because Linux is an open-source software,  anyone can contribute code to help enhance the users’ experience i.e., adding features, fixing bugs, reducing security risks, and more.
 
 
Software Development:
The terminal in Linux is a *wild card*. You can do almost anything with it. This includes software installation, application and server configurations, file system management, and etc.
 
Large-scale:
Open-source projects benefit from having an attentive community. As a result, Linux is more secure than Windows. Instead of installing anti viruses to clean malware, you just have to stick to the recommended repositories. 
 
Efficient: 
Developers have the convenience of running servers, training machine learning models, accessing remote machines, and compiling and running scripts from the same terminal window. 
 
Free: 
Linux is free (you can put it on as many systems as you like) and you can change it to suit your needs.
Cons
Learning Curve: 
Linux is not for everyone, there is a learning curve in switching to Ubuntu. To actually learn Linux efficiently would take a user one to several years.
 
No Tech Support:
Unlike Windows, there isn’t a dedicated tech support, so getting help for things is up to you. 
 
Designer Compatabilty:
Linux is not as user friendly as Windows or as ‘straight out of the box design’ As an example for design choices, Adobe hasn’t released any of its products to Linux users. So it’s impossible to run them directly. The Ubuntu alternative is a free software called GIMP. 
 
Gaming Capabilities: 
Most games aren’t available in Linux. But that’s not to say you can’t make it happen, it's just not as easy.   
Linux Job Market
Average Salary
$85k-$105k
Job Count
n/a
Top Job Locations

New York City
Boston
San Francisco 

Complimentary Skills to have along with Linux
The following are types of jobs that may require Linux skills.  The top 15 job titles on Dice.com that mention Linux in their postings are:
- DevOps Engineer
- Software Engineer
- Java Developer
- Systems Engineer
- Systems Administrator
- Senior Software Engineer
- Network Engineer
- Python Developer
- Linux Systems Administrator
- Software Developer
- System Administrator
- Linux Administrator
- Linux Engineer
- Senior Java Developer
- C++ Developer

Interesting Reads Take a class with us and receive a book of your choosing for 50% off MSRP.