In this course, the students will implement various data platform technologies into solutions that are in line with business and technical requirements including on-premises, cloud, and hybrid data scenarios incorporating both
relational and No-SQL data. They will also learn how to process data using a range of technologies and languages for both streaming and batch data.
The students will also explore how to implement data security including authentication, authorization, data policies, and standards. They will also define and implement data solution monitoring for both the data storage and data processing activities. Finally, they will manage and troubleshoot Azure data solutions which include the optimization and disaster recovery of big data, batch processing, and streaming data solutions.
The primary audience for this course is data professionals, data architects, and business intelligence professionals who want to learn about the data platform technologies that exist on Microsoft Azure.
The secondary audience for this course is individuals who develop applications that deliver content from the data platform technologies that exist on Microsoft Azure.
In addition to their professional experience, students who take this training should have technical knowledge equivalent to the following courses:
After completing the course delegates will be able to:
Module 1 - Azure for the Data Engineer
This module explores how the world of data has evolved and how cloud data platform technologies are providing new opportunities for business to explore their data in different ways. The student will gain an overview of the various data platform technologies that are available, and how a Data Engineers' role and responsibilities have evolved to work in this new world to an organization's benefit.
Lessons:
Lab: Azure for the Data Engineer
Module 2 - Working with Data Storage
This module teaches the variety of ways to store data in Azure. The Student will learn the basics of storage management in Azure, how to create a Storage Account, and how to choose the right model for the data you want to store in the cloud. They will also understand how data lake storage can be created to support a wide variety of big data analytics solutions with minimal effort.
Lessons:
Lab: Working with Data Storage
Module 3 - Enabling Team Based Data Science with Azure Databricks
This module introduces students to Azure Databricks and how a Data Engineer works with it to enable an organization to perform Team Data Science projects. They will learn the fundamentals of Azure Databricks and Apache Spark notebooks; how to provision the service and workspaces and learn how to perform data preparation tasks that can contribute to the data science project.
Lessons:
Lab: Enabling Team Based Data Science with Azure Databricks
Module 4 - Building Globally Distributed Databases with Cosmos DB
In this module, students will learn how to work with NoSQL data using Azure Cosmos DB. They will learn how to provide the service, and how they can load and interrogate data in the service using Visual Studio Code extensions, and the Azure Cosmos DB .NET Core SDK. They will also learn how to configure the available options so that users are able to access the data from anywhere in the world.
Lessons:
Lab: Building Globally Distributed Databases with Cosmos DB
Module 5 - Working with Relational Data Stores in the Cloud
In this module, students will explore the Azure relational data platform options including SQL Database and SQL Data Warehouse. The student will be able to explain why they would choose one service over another, and how to provision, connect and manage each of the services.
Lessons:
Lab: Working with Relational Data Stores in the Cloud
Module 6 - Performing Real-Time Analytics with Stream Analytics
In this module, students will learn the concepts of event processing and streaming data and how this applies to Events Hubs and Azure Stream Analytics. The students will then set up a stream analytics job to stream data and learn how to query the incoming data to perform analysis of the data. Finally, you will learn how to manage and monitor running jobs.
Lessons:
Lab: Performing Real-Time Analytics with Stream Analytics
Module 7 - Orchestrating Data Movement with Azure Data Factory
In this module, students will learn how the Azure Data factory can be used to orchestrate the data movement and transformation from a wide range of data platform technologies. They will be able to explain the capabilities of the technology and be able to set up an end-to-end data pipeline that ingests and transforms data.
Lessons:
Lab: Orchestrating Data Movement with Azure Data Factory
Module 8 - Securing Azure Data Platforms
In this module, students will learn how Azure Storage provides a multi-layered security model to protect your data. The students will explore how security can range from setting up secure networks and access keys, to defining permission through to monitoring with Advanced Threat Detection.
Lessons:
Lab: Securing Azure Data Platforms
Module 9 - Monitoring and Troubleshooting Data Storage and Processing
In this module, the student will look at the wide range of monitoring capabilities that are available to provide operational support should there be an issue with a data platform architecture. They will explore the data engineering troubleshooting approach and be able to apply this to common data storage and data processing issues.
Lessons:
Lab: Monitoring and Troubleshooting Data Storage and Processing
Module 10 - Integrating and Optimizing Data Platforms
In this module, the student will explore the various ways in which data platforms can be integrated based upon different business requirements. They will also explore the various ways in which data platforms can be optimized from a storage and data processing perspective to improve data loads. Finally, disaster recovery options are revealed to ensure business continuity.
Lessons:
Lab: Integrating and Optimizing Data Platforms
Notifications