Interested in a private company training? Request it here.
The cloud requires to reconsider some of the choices made for on-premisses data handling. This module introduces the different services in Azure that can be used for data processing, and compares them to the traditional on-premisses data stack.
This module discusses the different types of storage available in Azure Storage as well as data lake storage. Also some of the tools to load and manage files in Azure storage and Data lake storage are covered.
Azure SQL Databases have their limitations in compute power since they run on a single machine, and their size is limited to the Terabyte range. Azure Data Warehouse is a service aiming at an analytical workload on data volumes hundreds of times larger than what Azure SQL databases can handle. Yet at the same time we can keep on using the familiar T-SQL query language, or we can connect traditional applications such as Excel and Management Studio to interact with this service. Both storage and compute can be scaled independently.
Azure Databricks allows us to use the power of Spark without the configuration hassle of Hadoop clusters. Using popular languages such as Python, SQL and R data can be loaded, visualized, transformed and analyzed via interactive notebooks.
Power BI is a product that covers many things: Power BI in Excel, Power BI Desktop, the Power BI online service, ... . In this introduction we set the scene for the rest of the training, introducing the different aspects of the Power BI product.
In this chapter you will learn how to create queries in Power BI Desktop to extract data from source systems like SQL Server, Oracle, Excel files, CSV files,... Applying transformations that can filter, sort and clean the extracted data will also be covered.
Participants learn why and how to create a data model, including data sorting, data types and formatting, hiding tables and columns, creating hierarchies, ... . We will also have a look at the storage of data in import mode, including the techniques that are used for compression. We will see the Vertipaq analyzer as a tool to analyze and reduce storage costs.
This course focusses on developers, administrators and project managers who are developing new data centric applications in the Microsoft Azure cloud. Some familiarity with relational database systems such as SQL Server is handy.