Call Us: +32 2 466 00 16

Email: info@u2u.be

Data Engineering on Microsoft Azure

5 days

dp-203

5 days

Upcoming Sessions

Date:

Format:

Price:

Location:

Book now

Date:

Format:

Price:

Location:

Book now

Date:

Format:

Price:

Location:

Book now

Date:

Format:

Price:

Book now

Show fewer Show more

Need a private training for your team? Request a private training

Not ready to book yet? Request an offer

Introduction to data engineering on Azure

This module describes how Microsoft Azure provides a comprehensive platform for data engineering.

What is data engineering
Important data engineering concepts
Data engineering in Microsoft Azure

Introduction to Azure Data Lake Storage Gen2

Discover how Data Lake Storage provides a repository where you can upload and store unstructured data bringing new efficiencies to processing big data analytics.

Understand Azure Data Lake Storage Gen2
Enable Azure Data Lake Storage Gen2 in Azure Storage
Compare Azure Data Lake Store to Azure Blob storage
Understand the stages for processing big data
Use Azure Data Lake Storage Gen2 in data analytics workloads

Introduction to Azure Synapse Analytics

What is Azure Synapse Analytics
How Azure Synapse Analytics works
When to use Azure Synapse Analytics
Exercise - Explore Azure Synapse Analytics

Use Azure Synapse serverless SQL pool to query files in a data lake

Understand Azure Synapse serverless SQL pool capabilities and use cases
Query files using a serverless SQL pool
Create external database objects
Exercise - Query files using a serverless SQL pool

Use Azure Synapse serverless SQL pools to transform data in a data lake

Transform data files with the CREATE EXTERNAL TABLE AS SELECT statement
Encapsulate data transformations in a stored procedure
Include a data transformation stored procedure in a pipeline
Exercise - Transform files using a serverless SQL pool

Create a lake database in Azure Synapse Analytics

Understand lake database concepts
Explore database templates
Create a lake database
Use a lake database
Exercise - Analyze data in a lake database

Analyze data with Apache Spark in Azure Synapse Analytics

Get to know Apache Spark
Use Spark in Azure Synapse Analytics
Analyze data with Spark
Visualize data with Spark
Exercise - Analyze data with Spark

Transform data with Spark in Azure Synapse Analytics

Learn how to use Apache Spark pools in Azure Synapse Analytics to transform data.

Modify and save dataframes
Partition data files
Transform data with SQL
Exercise: Transform data with Spark in Azure Synapse Analytics

Use Delta Lake in Azure Synapse Analytics

Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Synapse Analytics.

Understand Delta Lake
Create Delta Lake tables
Create catalog tables
Use Delta Lake with streaming data
Use Delta Lake in a SQL pool
Exercise - Use Delta Lake in Azure Synapse Analytics

Analyze data in a relational data warehouse

Design a data warehouse schema
Create data warehouse tables
Load data warehouse tables
Query a data warehouse
Exercise - Explore a data warehouse

Load data into a relational data warehouse

Learn how to load tables in a relational data warehouse that is hosted in a dedicated SQL pool in Azure Synapse Analytics.

Load staging tables
Load dimension tables
Load time dimension tables
Load slowly changing dimensions
Load fact tables
Perform post load optimization
Exercise - load data into a relational data warehouse

Build a data pipeline in Azure Synapse Analytics

Understand pipelines in Azure Synapse Analytics
Create a pipeline in Azure Synapse Studio
Define data flows
Run a pipeline
Exercise - Build a data pipeline in Azure Synapse Analytics

Use Spark Notebooks in an Azure Synapse Pipeline

This module describes how Apache Spark notebooks can be integrated into an Azure Synapse Analytics pipeline.

Understand Synapse Notebooks and Pipelines
Use a Synapse notebook activity in a pipeline
Use parameters in a notebook
Exercise - Use an Apache Spark notebook in a pipeline

Plan hybrid transactional and analytical processing using Azure Synapse Analytics

Understand hybrid transactional and analytical processing patterns
Describe Azure Synapse Link

Implement Azure Synapse Link with Azure Cosmos DB

Enable Cosmos DB account to use Azure Synapse Link
Create an analytical store enabled container
Create a linked service for Cosmos DB
Query Cosmos DB data with Spark
Query Cosmos DB with Synapse SQL
Exercise - Implement Azure Synapse Link for Cosmos DB

Implement Azure Synapse Link for SQL

What is Azure Synapse Link for SQL?
Configure Azure Synapse Link for Azure SQL Database
Configure Azure Synapse Link for SQL Server 2022
Exercise - Implement Azure Synapse Link for SQL

Get started with Azure Stream Analytics

Understand data streams
Understand event processing
Understand window functions
Exercise - Get started with Azure Stream Analytics

Ingest streaming data using Azure Stream Analytics and Azure Synapse Analytics

Azure Stream Analytics provides a real-time data processing engine that you can use to ingest streaming event data into Azure Synapse Analytics for further analysis and reporting.

Stream ingestion scenarios
Configure inputs and outputs
Define a query to select, filter, and aggregate data
Run a job to ingest data
Exercise - Ingest streaming data into Azure Synapse Analytics

Visualize real-time data with Azure Stream Analytics and Power BI

By combining the stream processing capabilities of Azure Stream Analytics and the data visualization capabilities of Microsoft Power BI, you can create real-time data dashboards.

Use a Power BI output in Azure Stream Analytics
Create a query for real-time visualization
Create real-time data visualizations in Power BI
Exercise - Create a real-time data visualization

Introduction to Microsoft Purview

What is Microsoft Purview?
How Microsoft Purview works
When to use Microsoft Purview

Integrate Microsoft Purview and Azure Synapse Analytics

Learn how to integrate Microsoft Purview with Azure Synapse Analytics to improve data discoverability and lineage tracking.

Catalog Azure Synapse Analytics data assets in Microsoft Purview
Connect Microsoft Purview to an Azure Synapse Analytics workspace
Search a Purview catalog in Synapse Studio
Track data lineage in pipelines
Exercise - Integrate Azure Synapse Analytics and Microsoft Purview

Explore Azure Databricks

Get started with Azure Databricks
Identify Azure Databricks workloads
Understand key concepts
Exercise - Explore Azure Databricks

Use Apache Spark in Azure Databricks

Get to know Spark
Create a Spark cluster
Use Spark in notebooks
Use Spark to work with data files
Visualize data
Exercise - Use Spark in Azure Databricks

Run Azure Databricks Notebooks with Azure Data Factory

Understand Azure Databricks notebooks and pipelines
Create a linked service for Azure Databricks
Use a Notebook activity in a pipeline
Use parameters in a notebook
Exercise - Run an Azure Databricks Notebook with Azure Data Factory

In this course, the student will learn how to implement and manage data engineering workloads on Microsoft Azure, using Azure services such as Azure Synapse Analytics, Azure Data Lake Storage Gen2, Azure Stream Analytics, Azure Databricks, and others. The course focuses on common data engineering tasks such as orchestrating data transfer and transformation pipelines, working with data files in a data lake, creating and loading relational data warehouses, capturing and aggregating streams of real-time data, and tracking data assets and lineage.

The primary audience for this course is data professionals, data architects, and business intelligence professionals who want to learn about data engineering and building analytical solutions using data platform technologies that exist on Microsoft Azure. The secondary audience for this course includes data analysts and data scientists who work with analytical solutions built on Microsoft Azure.

Contact Us

Address:
U2U nv/sa
Z.1. Researchpark 110
1731 Zellik (Brussels)
BELGIUM
Phone: +32 2 466 00 16
Email: info@u2u.be
Monday - Friday: 9:00 - 17:00
Saturday - Sunday: Closed

Developer and IT Training

Data Engineering on Microsoft Azure

dp-203

5 days

Upcoming Sessions

Introduction to data engineering on Azure

Introduction to Azure Data Lake Storage Gen2

Introduction to Azure Synapse Analytics

Use Azure Synapse serverless SQL pool to query files in a data lake

Use Azure Synapse serverless SQL pools to transform data in a data lake

Create a lake database in Azure Synapse Analytics

Analyze data with Apache Spark in Azure Synapse Analytics

Transform data with Spark in Azure Synapse Analytics

Use Delta Lake in Azure Synapse Analytics

Analyze data in a relational data warehouse

Load data into a relational data warehouse

Build a data pipeline in Azure Synapse Analytics

Use Spark Notebooks in an Azure Synapse Pipeline

Plan hybrid transactional and analytical processing using Azure Synapse Analytics

Implement Azure Synapse Link with Azure Cosmos DB

Implement Azure Synapse Link for SQL

Get started with Azure Stream Analytics

Ingest streaming data using Azure Stream Analytics and Azure Synapse Analytics

Visualize real-time data with Azure Stream Analytics and Power BI

Introduction to Microsoft Purview

Integrate Microsoft Purview and Azure Synapse Analytics

Explore Azure Databricks

Use Apache Spark in Azure Databricks

Run Azure Databricks Notebooks with Azure Data Factory

Contact Us

Say Hi