MFE-IT

Data Engineering Training course with Azure Data Lake, Synapse, Data Factory and Spark

Reference: SM/EN/203
2,420 € excl. VAT
4 Days
28 h
4‘ Feb. 2027
Remote
1‘ Mar. 2027
Remote
1‒12 Apr. 2027
Remote
29 Mar.‒31 May 2027
Remote
4‒28 Jun. 2027
Remote
8–5 Jul. 2027
Remote

Each session will take place even if only one person is registered (except in cases of force majeure).

Training Microsoft DP-203

Description of this training Microsoft DP-203

With the rise of big data and distributed architectures, companies need engineers who can design, automate, and manage reliable and scalable data pipelines.
This training covers key Azure technologies: Data Lake, Synapse Analytics, Data Factory, Azure SQL, Stream Analytics, and Spark.

Through a project-based approach, you will learn how to ingest, store, transform and secure data, whether in batch or real time. Also discover our Training course SC-300 : Identity and Access Management with Azure AD and Microsoft Entra and our Microsoft 365 Security Training.

Format

Remote (recorded sessions). 

GOOD TO KNOW

This training course includes numerous exercises (60% practical) to enhance learning. Each session will take place even if only one person is registered (except in cases of force majeure). A preliminary interview is held between the participant and/or a company representative in order to fully assess the participant’s profile (level, needs, professional context, challenges, etc.).
Assessment : during the training course, the trainer assesses the participants’ progress through multiple-choice questions, role-playing exercises and practical work. Participants receive a certificate of completion at the end of the training course. 

objectives of this training course Microsoft DP-203

By the end, each participant will be able to :

  • Understanding the architecture of data solutions in Azure
  • Designing ingestion pipelines with Azure Data Factory and Synapse
  • Working with structured and unstructured data (SQL, NoSQL, JSON, Parquet)
  • Implementing analytics solutions with Azure Synapse and Apache Spark
  • Managing data security, privacy, and governance

Prerequisites

  • Experience in data processing, SQL, ETL or BI
  • Basic knowledge of cloud computing and the Azure platform
  • Optional but useful : development skills (Python, Spark, Scala)
  • Because each participant is unique, a personalised interview with our expert allows us to design a training course that is perfectly aligned with their objectives, level and professional challenges.

Target Audience

Data teams, BI developers, cloud consultants, and data solution architects in Azure.

Detailed of this training course Microsoft DP-203

Fundamentals of data architecture on Azure

Overview of services (Synapse, Data Lake, Datalake Gen2, Storage, SQL, Cosmos DB)

Pipelines, triggers and Mapping Dataflows. Cloud and hybrid integration scenarios. Connectors and self-hosted integration runtimes.

Spark pools, Notebooks, exploratory analysis, cleaning, joins, Parquet/CSV formats.

RBAC access, Azure Purview, dynamic masking, key management, GDPR compliance.

Azure Stream Analytics, Event Hub, stream processing, Lambda architecture.

  • Covers the entire data lifecycle on Azure (ingestion → storage → analysis)
  • Strong practical focus, geared towards real data and cloud projects
  • Includes modelling, security, batch and stream processing, all in a single course

FAQ – Microsoft DP-203 Azure Data Engineer Training

DP-203 was Microsoft’s Azure Data Engineer Associate certification covering Azure Data Lake Storage, Synapse Analytics, Data Factory, Spark, and Stream Analytics. As of 2024, Microsoft retired DP-203 and replaced it with DP-700 (Microsoft Fabric Data Engineer Associate), reflecting Fabric’s central role in Microsoft’s modern data stack. MFE-IT trains data engineers on the data engineering technologies originally covered in DP-203 and their evolution to Fabric.

Azure Data Lake Storage Gen2 (ADLS) is Microsoft’s hyperscale data lake service built on Azure Blob Storage with hierarchical namespace and POSIX-style permissions optimized for analytics workloads. It is the standard storage foundation for Synapse, Databricks, and Microsoft Fabric data lakehouse architectures. The MFE-IT DP-203 training covers ADLS architecture, security, and integration with the broader analytics stack.

Azure Synapse Analytics is a unified analytics service combining a SQL data warehouse, Apache Spark pools, data integration (Data Factory pipelines), and analytics tooling in a single workspace. It targets enterprise data warehousing and lakehouse scenarios. Microsoft is gradually steering Synapse customers toward Microsoft Fabric for new projects. Through MFE-IT’s hands-on approach, learners use Synapse end to end and understand the migration path to Fabric.

DP-203 was retired by Microsoft in 2024 and replaced by the DP-700 Microsoft Fabric Data Engineer Associate certification, which covers a more modern stack centered on Microsoft Fabric. Our MFE-IT training course originally based on DP-203 has been updated to reflect this transition while still covering the underlying Azure technologies (ADLS, Spark, Synapse) that remain relevant.

Would you like to know about upcoming sessions ?

Would you like to schedule this Training Course on a specific date ? Contact us by email or by filling out the contact form.