youtube image
From YouTube: Tech Talk | Addressing GDPR and CCPA Scenarios with Delta Lake and Apache Spark™

Description

Join us for an online tech talk on Delta Lake. Tech talks include a technical presentation with slides and a demo, with time for Q&A at the end.

Abstract:
The General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA) both aim to guarantee strong protection for individuals regarding their personal data and apply to businesses that collect, use, or share consumer data, whether the information was obtained online or offline. This remains one of the top priorities for the companies to be compliant and they are spending a lot of time and resources on being GDPR and CCPA compliant.

Your organization may manage hundreds of terabytes worth of personal information in your cloud. Bringing these datasets into GDPR and CCPA compliance is of paramount importance, but this can be a big challenge, especially for larger datasets stored in data lakes.

Learn how you can use Delta Lake which is created by Databricks and powered by Apache Spark™ to manage GDPR and CCPA compliance for your data lake. Because Delta Lake adds a transactional layer that provides structured data management on top of your data lake, it can dramatically simplify and accelerate your ability to locate and remove personal information (also known as “personal data”) in response to consumer GDPR or CCPA requests without disrupting your data pipelines.

Join our Tech Talk to learn:
- The compliance challenges big data and data lakes create for organizations.
- How Delta Lake improves data lake management and makes it possible to quickly find and surgically remove or modify individual records.
- Best practices for GDPR and CCPA Compliance using Delta Lake.
- Use of “Pseudonymization” (https://en.wikipedia.org/wiki/Pseudonymization) and structuring pipelines to locate and remove the identifier to destroy the linkage between the pseudonyms and identifiers.
- Demo on how to easily fulfill data requests with Delta Lake and Databricks.

Agenda: 10AM PDT - 11AM PDT (GMT-8)

10:00AM - 10:50AM - Tech Talk
10:50AM - 11:00AM - Q&A

To join the live chat, check out the meetup page: https://www.meetup.com/data-ai-online/events/270370715/ Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. https://databricks.com/databricks-named-leader-by-gartner