• Classroom
  • Online, Instructor-Led
Course Description

This Cloudera training course is available online and in-person. This three-day instructor-led training addresses traditional data analysis techniques, analytics with SQL, and other scripting languages. Students come to understand how to use Apache Pig, Hive, and Cloudera Impala tools. Attendees can gain a stronger grasp on the best practices for managing, manipulating, and querying large complex data sets in real-time.

Learning Objectives

Students taking this course will gain knowledge in traditional data analysis techniques, analytics with SQL, and other scripting languages. Students come to understand how to use Apache Pig, Hive, and Cloudera Impala tools. Attendees can gain a stronger grasp on the best practices for managing, manipulating, and querying large complex data sets in real-time. The course objectives include: Hadoop Fundamentals, Apache Pig, Pig and Data Analysis, Pig and Complex Data Analysis, Pig for Multi-Dataset Operations, Pig for Optimizing and Troubleshooting, Apache Hive, Hive and Relational Data Analysis, Hive Data Management, Processing Text in Hive, Hive Optimization, Extensions for Hive, and Impala and Data Analysis. This course will also prepare students for the Data Science Essentials exam. 

Framework Connections