HWOPMAN - Apache Hadoop 2.0 Operations Management with the Hortonworks Data Platform

This four-day Apache Hadoop 2.0 training course is designed for administrators who deploy and manage Apache Hadoop 2.0 clusters. Through a combination of lecture and hands-on exercises you will learn how to install, configure, maintain and scale your Hadoop 2.0 environment. At the end of this course you will have a solid understanding of how Hadoop works with Big Data and through the hands-on exercises will have completed the Hadoop deployment lifecycle for a multi-node cluster.

Click here to print this page »


This course is designed for IT administrators and operators responsible for installing, configuring and supporting an Apache Hadoop 2.0 deployment in a Linux environment.

Detailed Class Syllabus

Day 1: Foundation, Planning and Installation

Introduction to Hortonworks Data Platform & Hadoop 2.0
Hadoop Storage: HDFS Architecture
Installation Prerequisites
HDP Management: Ambari
Ambari and the Command Line
Hadoop Operating System (YARN) & MapReduce

Day 2: Configuration / Data Management

Configuring Services
Configuring HDFS
Configuring Hadoop Operating System (YARN) & MapReduce
Configuring ZooKeeper
Configuring Schedulers
Data Integrity
Extract-Load-Transform (ELT) Data Movement
Copying Data Between Clusters

Day 3: Data Management / Hortonworks Data Platform (HDP) 2.0 Operations

HDFS Web Services
Apache Hive Data Warehouse
Transferring data with Sqoop
Moving Log Data with Flume
Setting up the HDFS NFS Gateway
Workflow Management: Ooze
Monitoring HDP 2.0 Services
Commissioning and Decommissioning a Nodes and Services

Day 4: Hortonworks Data Platform (HDP) 2.0 Operations

Rack Awareness and Topology
NameNode Federation Architecture
NameNode High-Availability (HA) Architecture
Backup & Recovery