youtube image
From YouTube: Tech Talk: Serverless CDC on GCP using Datastream and Delta Lake on Google Cloud

Description

Change data capture is a popular method for unobtrusively ingesting data from SQL sources. In this talk, we will show how to easily incorporate your SQL data sources in near-real-time into Databricks and Delta Lake on Google Cloud. We will provide a short introduction to change-data-capture, Google Datastream (serverless CDC on Google Cloud), Databricks, and Delta Lake. In addition, we will also give a walk-through of our new open source Spark Structured Streaming connector which provides an easy-to-use / configure method of linking Datastream to Delta Lake.

Quick links:
https://delta.io/
https://github.com/badal-io/datastream-deltalake-connector
https://databricks.com/blog/2022/02/03/google-datastream-integration-with-delta-lake-for-change-data-capture.html