Module 1: Delta Lake 1.2 Tutorial with Jacek Laskowski

Name: Module 1: Delta Lake 1.2 Tutorial with Jacek Laskowski
Start: 2022-05-19T09:00:00-07:00
End: 2022-05-19T10:00:00-07:00

Delta Lake

May 19, 2022, 4:00 – 5:00 PM

206

RSVPs

deltalake

About this event

Join us for Module 1: Introduction to Delta Lake - Thursday, May 19
-Bringing Reliability to Data Lakes (Concepts)
-Convert existing tables to Delta Lake [SQL]
-Unified Batch and Streaming [Python, SQL]

This 3-part workshop is intended to teach you what Delta Lake is and how to use Apache Spark™ and Delta Lake in your data architectures for reliable large-scale distributed data pipelines. This course will show the features of Delta Lake that, alongside Spark SQL and Spark Structured Streaming, introduce ACID transactions and time travel (data versioning) to your ETL batch and streaming workloads. Slides, demos, exercises, and Q&A sessions should all together help you understand the concepts of the modern data lakehouse architecture.

Requirements
-Sign up for Databricks Community Edition
-Participants are recommended to have experience with Apache Spark SQL and Python (PySpark)

Register and Attend the Full Series!Module 2: Tuesday, May 31: DML and Schema
Module 3: Tuesday, June 14: SQL and the Transaction Log

Speakers

Jacek Laskowski

IT Freelancer for Apache Spark, Delta Lake, Apache Kafka & Kafka Streams

Denny Lee

Databricks

Developer Advocate

Organizer

Carly Akerly

Marketing Manager

See bio