Module 2: Delta Lake 1.2 Tutorial with Jacek Laskowski

Delta Lake
Tue, May 31, 9:30 AM (PDT)

About this event

Join us for Module 2: DML and Schema - Tuesday, May 31

-Create, Insert, Update, Delete, Merge

-Schema Enforcement and Evolution


This 3-part workshop is intended to teach you what Delta Lake is and how to use Apache Spark™ and Delta Lake in your data architectures for reliable large-scale distributed data pipelines. This course will show the features of Delta Lake that, alongside Spark SQL and Spark Structured Streaming, introduce ACID transactions and time travel (data versioning) to your ETL batch and streaming workloads. Slides, demos, exercises, and Q&A sessions should all together help you understand the concepts of the modern data lakehouse architecture.


Requirements

-Sign up for Databricks Community Edition

-Participants are recommended to have experience with Apache Spark SQL and Python (PySpark)


Register and Attend the Full Series!

Module 3: Tuesday, June 14: SQL and the Transaction Log

Speakers


Organizers