• Classroom
  • Online, Instructor-Led
Course Description

This Cloudera developer training course delivers the key concepts and expertise participants need to create robust data processing applications using Apache Hadoop.

Learning Objectives

  • Understand what is Hadoop and what are the ecosystem components
  • Hadoop Infrastructure & Data Management & Job Mechanics
  • Querying Hadoop & working with Pig, Sqoop, Flume and Oozie.
  • Analyze the benefits and challenges of the HDFS architecture
  • Identify the role of Apache Hadoop Classes, Interfaces, and Methods
  • Understand the role of the RecordReader, and of sequence files and compression
  • Write a MapReduce job to implement a HiveQL statement
  • Write a MapReduce job to query data stored in HDFS

Framework Connections

The materials within this course focus on the Knowledge Skills and Abilities (KSAs) identified within the Specialty Areas listed below. Click to view Specialty Area details within the interactive National Cybersecurity Workforce Framework.