• Online, Self-Paced
Course Description

This course covers various data genres and management tools, the reasons behind the evolving plethora of new big data platforms from the perspective of big data management systems, and analytical tools.

Learning Objectives

Big Data Designing and Modeling

  • start the course
  • define data management
  • recognize important data modeling concepts in Hadoop
  • identify important issues for storing data in Hadoop
  • recognize important considerations when designing HDFS schema
  • recognize important points when designing HDFS schema
  • identify basic concepts of data movement in Hadoop
  • list important factors that need to be considered for importing data into Hadoop
  • identify tools and methods for moving data into Hadoop
  • recognize characteristics of a data stream
  • define how data lakes enable batch processing
  • define data security management and its major domains
  • define Kerberos
  • define basics of authentication in Hadoop using Kerberos
  • identify central issues in processing and management of big data

Practice: Key Features of Data Modeling

  • identify important points in Hadoop data modeling

Framework Connections

The materials within this course focus on the Knowledge Skills and Abilities (KSAs) identified within the Specialty Areas listed below. Click to view Specialty Area details within the interactive National Cybersecurity Workforce Framework.