Big data analytics with Microsoft Azure requires effective deployment of various frameworks and technologies. This course covers batch processing technologies and data security tools in Azure, and prepares you for exam 70-475.
Learning Objectives
Batch Processing Utilities
- start the course
- recognize key features of Apache Sqoop
- demonstrate how to import data from an RDBMS to the Hadoop Distributed File System
- manage and monitor clusters with Apache Ambari
- manage workflows with Oozie
- use HCatalog for table and storage management in Hadoop
- recognize the key functions of Apache Zookeeper
Batch Processing Technologies
- use Apache Pig for data analysis
- manage large datasets with Apache Hive
- recognize key features and functionalities of Azure Batch
- recognize key features and functionalities of Apache Mahout
- identify key features and data sources of Spark SQL
- use MapReduce for writing applications
- use PowerShell to handle big data
- use SQL Server Analysis Services
- process large datasets with Data Factory and Batch
Designing Data Security
- recognize Azure's technical data security capabilities
- features of role-based and row-based security
- configure firewall and proxy server settings
- recognize key functions of Shared Access Signatures
Practice: Batch Processing
- recognize key features and capabilities of batch processing technologies