Trending Courses
Vendors
Agile & Scrum
POPULAR COURSES
Read More In Blog
#BeCyberSmart with Microsoft: Cybersecurity Awareness Month 2024
AI & Machine Learning
Analytics & Data Management
Big Data
Business Application
Cloud Computing
Cyber Security
Database Admin & Dev
Data Engineering & Science
DevOps
Digital Transformation
IT Governance
IT Infrastructure
IT Service Management
Networking
Programming & Development
Project Management
Virtualization
Overview
This course explores software engineering best practices and DevOps principles, specifically designed for data engineers working with Databricks. Participants will build a strong foundation in key topics such as code quality, version control, documentation, and testing. The course emphasizes DevOps, covering core components, benefits, and the role of continuous integration and delivery (CI/CD) in optimizing data engineering workflows.You will learn how to apply modularity principles in PySpark to create reusable components and structure code efficiently. Hands-on experience includes designing and implementing unit tests for PySpark functions using the pytest framework, followed by integration testing for Databricks data pipelines with DLT and Workflows to ensure reliability.The course also covers essential Git operations within Databricks, including using Databricks Git Folders to integrate continuous integration practices. Finally, you will take a high level look at various deployment methods for Databricks assets, such as REST API, CLI, SDK, and Databricks Asset Bundles (DABs), providing you with the knowledge of techniques to deploy and manage your pipelines.By the end of the course, you will be proficient in software engineering and DevOps best practices, enabling you to build scalable, maintainable, and efficient data engineering solutions.
Software Engineering Best Practices, DevOps, and CI/CD Fundamentals Continuous Integration (CI) Introduction to Continuous Deployment (CD)
This course is designed for professionals who:
Coming Soon...
Module 1. Software Engineering Best Practices, DevOps, and CI/CD Fundamentals Introduction to Software Engineering (SWE) Best Practices Introduction to Modularizing PySpark Code Modularizing PySpark Code DevOps Fundamentals The Role of CI/CD in DevOps Knowledge Check/Discussion Module 2. Continuous Integration (CI) Planning the Project Project Setup Exploration Introduction to Unit Tests for PySpark Creating and Executing Unit Tests Executing Integration Tests with DLT and Workflows Performing Integration Tests with DLT and Workflows Version Control with Git Overview Module 3. Introduction to Continuous Deployment (CD) Deplyoying Databricks Assets Overview (slides) Deploying the Databricks Project
Module 1. Software Engineering Best Practices, DevOps, and CI/CD Fundamentals
Module 2. Continuous Integration (CI)
Module 3. Introduction to Continuous Deployment (CD)
Level-up by partnering with Trainocate. Get in touch today.
By submitting this form, you consent to Trainocate processing your data to respond to your inquiry and provide you with relevant information about our training programs, including occasional emails with the latest news, exclusive events, and special offers.
You can unsubscribe from our marketing emails at any time. Our data handling practices are in accordance with our Privacy Policy.