This 5-day training course is designed for primarily for systems administrators and platform architects who need to understand HDP cluster capabilities, and manage HDP clusters. Topics include: Understanding HDF capabilities, Apache Hadoop, Apache YARN, HDFS, and other Hadoop ecosystem components. Students will understand how to administer, manage, and monitor HDP clusters.
Student Testimonials
Instructor did a great job, from experience this subject can be a bit dry to teach but he was able to keep it very engaging and made it much easier to focus.
Student
Excellent presentation skills, subject matter knowledge, and command of the environment.
Student
Instructor was outstanding. Knowledgeable, presented well, and class timing was perfect.
Student
Click here to print this page »
Prerequisites
Students should be familiar with server or platform software concepts and have a basic understanding of system administration.
Detailed Class Syllabus
Day 1: An Introduction to Apache Hadoop and HDFS
Big Data, Hadoop and the Hortonworks Data Platform
Installing the Hortonworks Data Platform
Using HDFS Storage
Managing Apache Ambari Users and Groups
Managing Hadoop Services
Day 2: Working with HDFS
Using HDFS Storage
Managing HDFS Storage
Adding, Deleting, and Replacing Worker Nodes
Configuring Rack Awareness
Day 3: Working with Apache YARN
YARN Resource Management
YARN Applications
YARN Capacity Scheduler
Day 4: High Availability, Backups and Configuring Centralized Cache
HDFS and YARN High Availability
Monitoring a Cluster
Protecting a Cluster with Backups
Configuring Heterogenous HDFS Storage
Managing the HDFS NFS Gateway
Configuring HDFS Centralized Cache
Day 5: Performing a Rolling Upgrade
Apache Hive Tuning
Managing Workflows Using Apache Oozie
Integrating Ambari with LDAP
Automating Cluster Provisioning Using Ambari Blueprints
Performing an HDP Rolling Upgrade