youtube image
From YouTube: Tempo: Distributed Time Series Analysis with Apache Sparkā„¢ and Delta Lake

Description

In this talk, Tristan Nixon, a Solutions Architect at Databricks and Ricardo Portilla, Lead Solutions Architect at Databricks, will demonstrate how data teams can leverage an open-source package tempo (available in Python and Scala) to advance time series use cases with Delta Lake and Spark.

In particular, we will show you how resampling to AS OF joins, and descriptive analytics of up to millions of time series can be done in parallel using a simple interface.

Speakers:

Tristan Nixon is a Solution Architect at Databricks. Tristan has been working in Data-science and ML engineering for over 15 years, in industries from Education to Telecoms and Chemical Manufacturing. He joined Databricks about a year ago where he acts as an SME for time series and Natural Language Processing (NLP).

Ricardo Portilla is a Solutions Architect at Databricks. Ricardo works with data teams to put data engineering, data analytics, and data science use cases into production. He as been at Databricks for ~3 years helping customers with use cases in all verticals and previously worked in the financial industry for 7 years. Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. https://databricks.com/databricks-named-leader-by-gartner