Vendors

In this course, the student will learn how to implement and manage data engineering workloads on Microsoft Azure, using Azure services such as Azure Synapse Analytics, Azure Data Lake Storage Gen2, Azure Stream Analytics, Azure Databricks, and others. The course focuses on common data engineerings tasks such as orchestrating data transfer and transformation pipelines, working with data files in a data lake, creating and loading relational data warehouses, capturing and aggregating streams of real-time data, and tracking data assets and lineage.

img-course-overview.jpg

What You'll Learn

  • Identify common data engineering tasks
  • Describe common data engineering concepts
  • Identify Azure services for data engineering
  • Describe the key features and benefits of Azure Data Lake Storage Gen2
  • Enable Azure Data Lake Storage Gen2 in an Azure Storage account
  • Compare Azure Data Lake Storage Gen2 and Azure Blob storage
  • Describe where Azure Data Lake Storage Gen2 fits in the stages of analytical processing
  • Describe how Azure data Lake Storage Gen2 is used in common analytical workloads
  • Identify the business problems that Azure Synapse Analytics addresses.
  • Describe core capabilities of Azure Synapse Analytics.
  • Determine when to use Azure Synapse Analytics.
  • Identify capabilities and use cases for serverless SQL pools in Azure Synapse Analytics
  • Query CSV, JSON, and Parquet files using a serverless SQL pool
  • Create external database objects in a serverless SQL pool
  • Use a CREATE EXTERNAL TABLE AS SELECT (CETAS) statement to transform data.
  • Encapsulate a CETAS statement in a stored procedure.
  • Include a data transformation stored procedure in a pipeline.
  • Understand lake database concepts and components
  • Describe database templates in Azure Synapse Analytics
  • Create a lake database
  • Identify core features and capabilities of Apache Spark.
  • Configure a Spark pool in Azure Synapse Analytics.
  • Run code to load, analyze, and visualize data in a Spark notebook.
  • Use Apache Spark to modify and save dataframes
  • Partition data files for improved performance and scalability.
  • Transform data with SQL
  • Describe core features and capabilities of Delta Lake.
  • Create and use Delta Lake tables in a Synapse Analytics Spark pool.
  • Create Spark catalog tables for Delta Lake data.
  • Use Delta Lake tables for streaming data.
  • Query Delta Lake tables from a Synapse Analytics SQL pool.
  • Design a schema for a relational data warehouse.
  • Create fact, dimension, and staging tables.
  • Use SQL to load data into data warehouse tables.
  • Use SQL to query relational data warehouse tables.
  • Load staging tables in a data warehouse
  • Load dimension tables in a data warehouse
  • Load time dimensions in a data warehouse
  • Load slowly changing dimensions in a data warehouse
  • Load fact tables in a data warehouse
  • Perform post-load optimizations in a data warehouse
  • Describe core concepts for Azure Synapse Analytics pipelines.
  • Create a pipeline in Azure Synapse Studio.
  • Implement a data flow activity in a pipeline.
  • Initiate and monitor pipeline runs.
  • Describe notebook and pipeline integration.
  • Use a Synapse notebook activity in a pipeline.
  • Use parameters with a notebook activity.
  • Describe Hybrid Transactional / Analytical Processing patterns.
  • Identify Azure Synapse Link services for HTAP.
  • Configure an Azure Cosmos DB Account to use Azure Synapse Link.
  • Create an analytical store enabled container.
  • Create a linked service for Azure Cosmos DB.
  • Analyze linked data using Spark.
  • Analyze linked data using Synapse SQL.
  • Understand key concepts and capabilities of Azure Synapse Link for SQL.
  • Configure Azure Synapse Link for Azure SQL Database.
  • Configure Azure Synapse Link for Microsoft SQL Server.
  • Understand data streams.
  • Understand event processing.
  • Understand window functions.
  • Get started with Azure Stream Analytics.
  • Describe common stream ingestion scenarios for Azure Synapse Analytics.
  • Configure inputs and outputs for an Azure Stream Analytics job.
  • Define a query to ingest real-time data into Azure Synapse Analytics.
  • Run a job to ingest real-time data, and consume that data in Azure Synapse Analytics.
  • Configure a Stream Analytics output for Power BI.
  • Use a Stream Analytics query to write data to Power BI.
  • Create a real-time data visualization in Power BI.
  • Evaluate whether Microsoft Purview is appropriate for data discovery and governance needs.
  • Describe how the features of Microsoft Purview work to provide data discovery and governance.
  • Catalog Azure Synapse Analytics database assets in Microsoft Purview.
  • Configure Microsoft Purview integration in Azure Synapse Analytics.
  • Search the Microsoft Purview catalog from Synapse Studio.
  • Track data lineage in Azure Synapse Analytics pipelines activities.
  • Provision an Azure Databricks workspace.
  • Identify core workloads and personas for Azure Databricks.
  • Describe key concepts of an Azure Databricks solution.
  • Describe key elements of the Apache Spark architecture.
  • Create and configure a Spark cluster.
  • Describe use cases for Spark.
  • Use Spark to process and analyze data stored in files.
  • Use Spark to visualize data.
  • Describe how Azure Databricks notebooks can be run in a pipeline.
  • Create an Azure Data Factory linked service for Azure Databricks.
  • Use a Notebook activity in a pipeline.
  • Pass parameters to a notebook.

Who Should Attend

The primary audience for this course is data professionals, data architects, and business intelligence professionals who want to learn about data engineering and building analytical solutions using data platform technologies that exist on Microsoft Azure. The secondary audience for this course is data analysts and data scientists who work with analytical solutions built on Microsoft Azure.
img-who-should-learn.png

Prerequisites

Please review the prerequisites listed for each module in the course content and click on the provided links for more information.

1. Introduction to data engineering on Azure

Microsoft Azure provides a comprehensive platform for data engineering; but what is data engineering? Complete this module to find out.

Click here to know more

2. Introduction to Azure Data Lake Storage Gen2

Data lakes are a core element of data analytics architectures. Azure Data Lake Storage Gen2 provides a scalable, secure, cloud-based solution for data lake storage.

Click here to know more

3. Introduction to Azure Synapse Analytics

Learn about the features and capabilities of Azure Synapse Analytics - a cloud-based platform for big data processing and analysis.

Click here to know more

4. Use Azure Synapse serverless SQL pool to query files in a data lake

With Azure Synapse serverless SQL pool, you can leverage your SQL skills to explore and analyze data in files, without the need to load the data into a relational database.

Click here to know more

5. Use Azure Synapse serverless SQL pools to transform data in a data lake

By using a serverless SQL pool in Azure Synapse Analytics, you can use the ubiquitous SQL language to transform data in files in a data lake.

Click here to know more

6. Create a lake database in Azure Synapse Analytics

Why choose between working with files in a data lake or a relational database schema? With lake databases in Azure Synapse Analytics, you can combine the benefits of both.

Click here to know more

7. Analyze data with Apache Spark in Azure Synapse Analytics

Apache Spark is a core technology for large-scale data analytics. Learn how to use Spark in Azure Synapse Analytics to analyze and visualize data in a data lake.

Click here to know more

8. Transform data with Spark in Azure Synapse Analytics

Data engineers commonly need to transform large volumes of data. Apache Spark pools in Azure Synapse Analytics provide a distributed processing platform that they can use to accomplish this goal.

Click here to know more

9. Use Delta Lake in Azure Synapse Analytics

Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Synapse Analytics.

Click here to know more

10. Analyze data in a relational data warehouse

Relational data warehouses are a core element of most enterprise Business Intelligence (BI) solutions, and are used as the basis for data models, reports, and analysis.

Click here to know more

11. Load data into a relational data warehouse

A core responsibility for a data engineer is to implement a data ingestion solution that loads new data into a relational data warehouse.

Click here to know more

12. Build a data pipeline in Azure Synapse Analytics

Pipelines are the lifeblood of a data analytics solution. Learn how to use Azure Synapse Analytics pipelines to build integrated data solutions that extract, transform, and load data across diverse systems.

Click here to know more

13. Use Spark Notebooks in an Azure Synapse Pipeline

Apache Spark provides data engineers with a scalable, distributed data processing platform, which can be integrated into an Azure Synapse Analytics pipeline.

Click here to know more

14. Plan hybrid transactional and analytical processing using Azure Synapse Analytics

Learn how hybrid transactional / analytical processing (HTAP) can help you perform operational analytics with Azure Synapse Analytics.

Click here to know more

15. Implement Azure Synapse Link with Azure Cosmos DB

Azure Synapse Link for Azure Cosmos DB enables HTAP integration between operational data in Azure Cosmos DB and Azure Synapse Analytics runtimes for Spark and SQL.

Click here to know more

16. Implement Azure Synapse Link for SQL

Azure Synapse Link for SQL enables low-latency synchronization of operational data in a relational database to Azure Synapse Analytics.

Click here to know more

17. Get started with Azure Stream Analytics

Azure Stream Analytics enables you to process real-time data streams and integrate the data they contain into applications and analytical solutions.

Click here to know more

18. Ingest streaming data using Azure Stream Analytics and Azure Synapse Analytics

Azure Stream Analytics provides a real-time data processing engine that you can use to ingest streaming event data into Azure Synapse Analytics for further analysis and reporting.

Click here to know more

19. Visualize real-time data with Azure Stream Analytics and Power BI

By combining the stream processing capabilities of Azure Stream Analytics and the data visualization capabilities of Microsoft Power BI, you can create real-time data dashboards.

Click here to know more

20. Introduction to Microsoft Purview

In this module, you'll evaluate whether Microsoft Purview is the right choice for your data discovery and governance needs.

Click here to know more

21. Integrate Microsoft Purview and Azure Synapse Analytics

Learn how to integrate Microsoft Purview with Azure Synapse Analytics to improve data discoverability and lineage tracking.

Click here to know more

22. Explore Azure Databricks

Azure Databricks is a cloud service that provides a scalable platform for data analytics using Apache Spark.

Click here to know more

23. Use Apache Spark in Azure Databricks

Azure Databricks is built on Apache Spark and enables data engineers and analysts to run Spark jobs to transform, analyze and visualize data at scale.

Click here to know more

24. Run Azure Databricks Notebooks with Azure Data Factory

Using pipelines in Azure Data Factory to run notebooks in Azure Databricks enables you to automate data engineering processes at cloud scale.

Click here to know more

Skills Measured

  • Design and implement data storage
  • Develop data processing
  • Secure, monitor, and optimize data storage and data processing

Frequently Asked Questions (FAQs)

  • Why get Microsoft certified?

    Microsoft certifications validate your skills and expertise in Microsoft technologies and solutions, demonstrating your ability to design, implement, and manage cutting-edge technologies.

    These certifications are globally recognized and highly sought after by employers, as they signify your proficiency in using Microsoft products and services to drive innovation and solve business challenges.

    Microsoft-certified professionals are in high demand, opening doors to new career opportunities and higher earning potential.

  • What to expect for the examination?

    Microsoft certification exams are designed to assess your knowledge and skills in specific Microsoft technologies and solutions.

    Exams typically consist of multiple-choice, multiple-select, and case study questions, and some may include lab simulations to evaluate your practical skills.

    Note: Certification requirements and policies may be updated by Microsoft from time to time. We apologize for any discrepancies; do get in touch with us if you have any questions.

  • How long is Microsoft certification valid for?

    Most Microsoft role-based and specialty certifications are valid for one year from the date of passing the exam.

    To maintain your certification, you will need to renew it annually by passing a free online assessment on Microsoft Learn.

    However, Microsoft Applied Skills credentials and Fundamentals certifications do not expire.

    Note: Certification requirements and policies may be updated by Microsoft from time to time. We apologize for any discrepancies; do get in touch with us if you have any questions.

  • Why take this course with Trainocate?

    Here’s what sets us apart:

    - Global Reach, Localized Accessibility: Benefit from our geographically diverse training hubs in 16 countries (and counting!).

    - Top-Rated Instructors: Our team of subject matter experts (with high average CSAT and MTM scores) are passionate to help you accelerate your digital transformation.

    - Customized Training Solutions: Choose from on-site, virtual classrooms, or self-paced learning to fit your organization and individual needs.

    - Experiential Learning: Dive into interactive training with our curated lesson plans. Participate in hands-on labs, solve real-world challenges, and take on comprehensive assessments.

    - Learn From The Best: With 30+ authorized training partnerships and countless awards from Microsoft, AWS, Google – you're guaranteed learning from the industry's elite.

    - Your Bridge To Success: We provide up-to-date course materials, helpful exam guides, and dedicated support to validate your expertise and elevate your career.

Keep Exploring

Course Curriculum

Course Curriculum

Training Schedule

Training Schedule

Exam & Certification

Exam & Certification

FAQs

Frequently Asked Questions

img-improve-career.jpg

Improve yourself and your career by taking this course.

img-get-info.jpg

Ready to Take Your Business from Great to Awesome?

Level-up by partnering with Trainocate. Get in touch today.

Name
Email
Phone
I'm inquiring for

Inquiry Details

By providing your contact details, you agree to our Privacy Policy.