• Online, Self-Paced
Course Description

Big data analytics with Microsoft Azure requires effective deployment of various frameworks and technologies. This course covers batch processing technologies and data security tools in Azure, and prepares you for exam 70-475.

Learning Objectives

Batch Processing Utilities

  • start the course
  • recognize key features of Apache Sqoop
  • demonstrate how to import data from an RDBMS to the Hadoop Distributed File System
  • manage and monitor clusters with Apache Ambari
  • manage workflows with Oozie
  • use HCatalog for table and storage management in Hadoop
  • recognize the key functions of Apache Zookeeper

Batch Processing Technologies

  • use Apache Pig for data analysis
  • manage large datasets with Apache Hive
  • recognize key features and functionalities of Azure Batch
  • recognize key features and functionalities of Apache Mahout
  • identify key features and data sources of Spark SQL
  • use MapReduce for writing applications
  • use PowerShell to handle big data
  • use SQL Server Analysis Services
  • process large datasets with Data Factory and Batch

Designing Data Security

  • recognize Azure's technical data security capabilities
  • features of role-based and row-based security
  • configure firewall and proxy server settings
  • recognize key functions of Shared Access Signatures

Practice: Batch Processing

  • recognize key features and capabilities of batch processing technologies

Framework Connections

The materials within this course focus on the Knowledge Skills and Abilities (KSAs) identified within the Specialty Areas listed below. Click to view Specialty Area details within the interactive National Cybersecurity Workforce Framework.