Vendors

This course provides a comprehensive introduction to Lakeflow Connect as a scalable and simplified solution for ingesting data into Databricks from a variety of data sources. You will begin by exploring the different types of connectors within Lakeflow Connect (Standard and Managed), learn about various ingestion techniques, including batch, incremental batch, and streaming, and then review the key benefits of Delta tables and the Medallion architecture.

From there, you will gain practical skills to efficiently ingest data from cloud object storage using Lakeflow Connect Standard Connectors with methods such as CREATE TABLE AS (CTAS), COPY INTO, and Auto Loader, along with the benefits and considerations of each approach. You will then learn how to append metadata columns to your bronze level tables during ingestion into the Databricks data intelligence platform. This is followed by working with the rescued data column, which handles records that don’t match the schema of your bronze table, including strategies for managing this rescued data.

The course also introduces techniques for ingesting and flattening semi-structured JSON data, as well as enterprise-grade data ingestion using Lakeflow Connect Managed Connectors.

Finally, learners will explore alternative ingestion strategies, including MERGE INTO operations and leveraging the Databricks Marketplace, equipping you with foundational knowledge to support modern data engineering ingestion.

 

img-course-overview.jpg

What You'll Learn

  • Cloud Storage Ingestion with LakeFlow Connect Standard Connectors
  • Introduction to Data Engineering in Databricks
  • Enterprise Data Ingestion with LakeFlow Connect Managed Connectors
  • Ingestion Alternatives

Who Should Attend

  • N/A
img-who-should-learn.png

Prerequisites

  • Basic understanding of the Databricks Data Intelligence platform, including Databricks Workspaces, Apache Spark, Delta Lake, the Medallion Architecture and Unity Catalog.
  • Experience working with various file formats (e.g., Parquet, CSV, JSON, TXT).
  • Proficiency in SQL and Python.
  • Familiarity with running code in Databricks notebooks.

Learning Journey

Coming Soon...

Introduction to Data Engineering in Databricks

  • Data Engineering in Databricks
  • What is Lakeflow Connect?
  • Delta Lake Review
  • Exploring the Lab Environment

Cloud Storage Ingestion with LakeFlow Connect Standard Connectors

  • Introduction to Data Ingestion from Cloud Storage
  • Appending Metadata Columns on Ingest
  • Working with the Rescued Data Column
  • Ingesting Semi-Structured Data: JSON

Enterprise Data Ingestion with LakeFlow Connect Managed Connectors

  • Ingesting Enterprise Data into Databricks Overview
  • Enterprise Data Ingestion with Lakeflow Connect

Ingestion Alternatives

  • Ingesting Data with Databricks Marketplace
  • Ingesting into Existing Delta Tables
  • Data Ingestion with MERGE INTO

img-exam-cert

Frequently Asked Questions (FAQs)

None

Keep Exploring

Course Curriculum

Course Curriculum

Training Schedule

Training Schedule

Exam & Certification

Exam & Certification

FAQs

Frequently Asked Questions

img-improve-career.jpg

Improve yourself and your career by taking this course.

img-get-info.jpg

Ready to Take Your Business from Great to Awesome?

Level-up by partnering with Trainocate. Get in touch today.

Name
Email
Phone
I'm inquiring for
Inquiry Details

By submitting this form, you consent to Trainocate processing your data to respond to your inquiry and provide you with relevant information about our training programs, including occasional emails with the latest news, exclusive events, and special offers.

You can unsubscribe from our marketing emails at any time. Our data handling practices are in accordance with our Privacy Policy.