Data Engineering Training course with Azure Data Lake, Synapse, Data Factory and Spark

Reference: SM/EN/203

2,420 € excl. VAT

4 Days

28 h

4‘ Feb. 2027

Remote

1‘ Mar. 2027

Remote

1‒12 Apr. 2027

Remote

29 Mar.‒31 May 2027

Remote

4‒28 Jun. 2027

Remote

8–5 Jul. 2027

Remote

Each session will take place even if only one person is registered (except in cases of force majeure).

Description of this training Microsoft DP-203

With the rise of big data and distributed architectures, companies need engineers who can design, automate, and manage reliable and scalable data pipelines.
This training covers key Azure technologies: Data Lake, Synapse Analytics, Data Factory, Azure SQL, Stream Analytics, and Spark.

Through a project-based approach, you will learn how to ingest, store, transform and secure data, whether in batch or real time. Also discover our Training course SC-300 : Identity and Access Management with Azure AD and Microsoft Entra and our Microsoft 365 Security Training.

Format

Remote (recorded sessions).

GOOD TO KNOW

This training course includes numerous exercises (60% practical) to enhance learning. Each session will take place even if only one person is registered (except in cases of force majeure). A preliminary interview is held between the participant and/or a company representative in order to fully assess the participant’s profile (level, needs, professional context, challenges, etc.).
Assessment : during the training course, the trainer assesses the participants’ progress through multiple-choice questions, role-playing exercises and practical work. Participants receive a certificate of completion at the end of the training course.

This training course is part of our Cloud Computing Training. Discover our other cloud trainings to master architectures, services and best practices on AWS, Microsoft Azure, VMware and Google Cloud Platform.

objectives of this training course Microsoft DP-203

By the end, each participant will be able to :

Understanding the architecture of data solutions in Azure
Designing ingestion pipelines with Azure Data Factory and Synapse
Working with structured and unstructured data (SQL, NoSQL, JSON, Parquet)
Implementing analytics solutions with Azure Synapse and Apache Spark
Managing data security, privacy, and governance

Prerequisites

Experience in data processing, SQL, ETL or BI
Basic knowledge of cloud computing and the Azure platform
Optional but useful : development skills (Python, Spark, Scala)
Because each participant is unique, a personalised interview with our expert allows us to design a training course that is perfectly aligned with their objectives, level and professional challenges.

Target Audience

Data teams, BI developers, cloud consultants, and data solution architects in Azure.

Detailed of this training course Microsoft DP-203

Fundamentals of data architecture on Azure

Overview of services (Synapse, Data Lake, Datalake Gen2, Storage, SQL, Cosmos DB)

Data ingestion with Azure Data Factory

Pipelines, triggers and Mapping Dataflows. Cloud and hybrid integration scenarios. Connectors and self-hosted integration runtimes.

Modelling and transformation with Synapse and Spark

Spark pools, Notebooks, exploratory analysis, cleaning, joins, Parquet/CSV formats.

Data security and governance

RBAC access, Azure Purview, dynamic masking, key management, GDPR compliance.

Real-time and streaming data

Azure Stream Analytics, Event Hub, stream processing, Lambda architecture.

The advantages of this training course

Covers the entire data lifecycle on Azure (ingestion → storage → analysis)
Strong practical focus, geared towards real data and cloud projects
Includes modelling, security, batch and stream processing, all in a single course

FAQ – Microsoft DP-203 Azure Data Engineer Training

What is the DP-203 certification?

DP-203 was Microsoft’s Azure Data Engineer Associate certification covering Azure Data Lake Storage, Synapse Analytics, Data Factory, Spark, and Stream Analytics. As of 2024, Microsoft retired DP-203 and replaced it with DP-700 (Microsoft Fabric Data Engineer Associate), reflecting Fabric’s central role in Microsoft’s modern data stack. MFE-IT trains data engineers on the data engineering technologies originally covered in DP-203 and their evolution to Fabric.

What is Azure Data Lake Storage?

Azure Data Lake Storage Gen2 (ADLS) is Microsoft’s hyperscale data lake service built on Azure Blob Storage with hierarchical namespace and POSIX-style permissions optimized for analytics workloads. It is the standard storage foundation for Synapse, Databricks, and Microsoft Fabric data lakehouse architectures. The MFE-IT DP-203 training covers ADLS architecture, security, and integration with the broader analytics stack.

What is Azure Synapse Analytics?

Azure Synapse Analytics is a unified analytics service combining a SQL data warehouse, Apache Spark pools, data integration (Data Factory pipelines), and analytics tooling in a single workspace. It targets enterprise data warehousing and lakehouse scenarios. Microsoft is gradually steering Synapse customers toward Microsoft Fabric for new projects. Through MFE-IT’s hands-on approach, learners use Synapse end to end and understand the migration path to Fabric.

Is DP-203 still available?

DP-203 was retired by Microsoft in 2024 and replaced by the DP-700 Microsoft Fabric Data Engineer Associate certification, which covers a more modern stack centered on Microsoft Fabric. Our MFE-IT training course originally based on DP-203 has been updated to reflect this transition while still covering the underlying Azure technologies (ADLS, Spark, Synapse) that remain relevant.

Would you like to know about upcoming sessions ?

Would you like to schedule this Training Course on a specific date ? Contact us by email or by filling out the contact form.