• Classroom
  • Online, Instructor-Led
Course Description

Hive and Pig allows the management and manipulation of data in a Hadoop cluster without Java programming experience. Apache Hive is Hadoop's data warehouse infrastructure and it makes multi-structured data accessible to analysts, database administrators, and others without Java programming expertise. Apache Pig applies the fundamentals of familiar scripting languages to the Hadoop cluster. It is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. In this hands-on course, students learn how Apache Pig and Apache Hive enable data transformations and analyses via filters, joins, and user-defined functions. Students learn how to apply data analytics and business intelligence skills to big data, including how to access, manipulate, and analyze complex data sets using SQL and other scripting languages.

Learning Objectives

Learn how Apache Pig and Apache Hive enable data transformations and analyses via filters, joins, and user-defined functions. Students learn how to apply data analytics and business intelligence skills to big data, including how to access, manipulate, and analyze complex data sets using SQL and other scripting languages

Framework Connections

The materials within this course focus on the Knowledge Skills and Abilities (KSAs) identified within the Specialty Areas listed below. Click to view Specialty Area details within the interactive National Cybersecurity Workforce Framework.