DataHub / Tech Deep Dives

Add meeting Rate page Subscribe

DataHub / Tech Deep Dives

These are all the meetings we have in "Tech Deep Dives" (part of the organization "DataHub"). Click into individual meeting pages to watch the recording and search or read the transcript.

23 Mar 2023

Hyejin Yoon (Acryl Data) gives an overview of DataHub's various APIs and new use-case-oriented guides.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 1 participant
  • 7 minutes
api
apis
data
graphql
hub
metadata
sdk
demo
creating
github
youtube image

12 Aug 2022

Shirshanka Das (Acryl Data) shares recent speed and functionality improvements to ingesting metadata from Snowflake during the July 2022 Town Hall.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 1 participant
  • 7 minutes
connector
merge
snowflake
access
metadata
users
hub
layered
streams
bigquery
youtube image

27 May 2022

John Joyce & Tamás Nemeth go in-depth about how you can use DataHub + Airflow + Great Expectations to scalably address data reliability.


Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 2 participants
  • 28 minutes
reliability
reliable
validations
data
datahub
quality
care
thinking
important
question
youtube image

20 May 2022

Ryan Holstien (Acryl Data) shares details about how we're making it easier for developers to interface with DataHub via OpenAPI during the April 2022 Town Hall.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 1 participant
  • 8 minutes
apis
api
endpoint
metadata
client
json
schemas
sdk
rest
deletes
youtube image

19 May 2022

Surya Lanka & Pedro Silva (Acryl Data) share the latest advancements to managing deletes within DataHub during the April Town Hall.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 2 participants
  • 11 minutes
deletion
deleting
deletes
rollback
rollbacks
delete
deleted
processing
registry
data
youtube image

16 May 2022

Kartik Darapuneni (Included Health) shares his experience building embedding Looker, Querybook, and Jupyter into DataHub.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 2 participants
  • 15 minutes
datahub
healthcare
included
managed
workflow
help
community
functionality
kartik
important
youtube image

13 May 2022

John Joyce (Acryl Data) shares the new Actions Framework for developing & deploying real-time outbound integrations with DataHub during the April 2022 Town Hall.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 1 participant
  • 16 minutes
datahub
dataset
data
hub
workflow
implementing
handling
integrate
capabilities
activity
youtube image

18 Apr 2022

David Leifker (Zendesk) gives a demo of the new Protobuf Ingestion Source during the March Town Hall.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 2 participants
  • 14 minutes
workflow
schemas
protobuf
hub
datahub
context
process
documentation
annotate
zendesk
youtube image

8 Mar 2022

Edward Vaisman (Wavelo) gives a demo of how you can define Dataset-to-Dataset lineage via YAML during the February 2022 Community Town Hall.

Learn more about DataHub: https://datahubproject.io
Join us on Slack: http://slack.datahubproject.io
Follow us on Twitter: https://twitter.com/datahubproject
  • 1 participant
  • 7 minutes
datahub
wavelo
datawi
hub
providers
streaming
microservices
docker
slack
kafka
youtube image

9 Dec 2021

Tamás Németh (Acryl Data) talks about how DataHub can now automatically extract Table, View, and S3 lineage using Redshift system tables.

This functionality is available as of v0.8.18
  • 2 participants
  • 14 minutes
lineager
sdscan
redshift
scan
ratchet
schema
registry
cluster
collectors
guide
youtube image

9 Dec 2021

John Joyce (Acryl Data) gives a deep-dive into the DataHub Metadata Service Authentication during the November 2021 Community Town Hall

Referenced Links:

https://datahubproject.io/docs/how/auth/jaas https://datahubproject.io/docs/how/auth/sso/configure-oidc-react/
https://github.com/linkedin/datahub/blob/681ed91a0006a2d20535c0d5c30f0a68afcfab9f/docs/introducing-metadata-service-authentication.md
  • 1 participant
  • 11 minutes
authentication
authenticated
authenticator
proxy
authorization
metadata
hosted
datahub
hub
sso
youtube image

24 Sep 2021

Shirshanka Das and Maggie Hays from Acryl Data review recent improvements to the DataHub Looker connector.
  • 2 participants
  • 10 minutes
looker
connects
project
collaborate
local
repository
views
complicated
hub
meta
youtube image

24 Sep 2021

Surya Lanka and Shirshanka Das (Acryl Data) give a demo of stateful ingestion works in DataHub, ensuring that you ingest only net-new metadata to minimize redundancy and optimize ingestion performance.
  • 2 participants
  • 13 minutes
connector
snowflake
current
configuration
demoing
users
information
start
injection
scheduler
youtube image

27 Aug 2021

Dexter Lee (Acryl Data) describes how DataHub is being instrumented for supporting performance monitoring use-cases.

Note: This was a session that was scheduled to be presented live at the townhall, but we couldn't accommodate it due to time concerns. Dexter was kind enough to record it later to share with the community.
  • 1 participant
  • 9 minutes
monitoring
data
dashboards
metadata
endpoints
telemetry
testing
throughput
elasticsearch
jms
youtube image

27 Aug 2021

John Joyce (Acryl Data) gives an update on recent improvements to fine-grained access control during the DataHub Community Town Hall on August 27, 2021.
  • 2 participants
  • 16 minutes
manage
policies
privileges
access
restrict
controls
datahub
overview
scope
hub
youtube image

27 Aug 2021

John Joyce (Acryl Data) provides an update on user and group management during the DataHub Community Town Hall on August 27, 2021.

Recent developments focus on:
- New user ingestion sources! Okta & Azure AD Batch
- Just-in-Time User & Group Provisioning with OIDC - when users log in, we will automatically provision an account if they do not already have one
- Within the UI, Groups are now searchable and Group Members appear on the Groups Page
  • 2 participants
  • 6 minutes
provisioned
users
datahub
manage
authentication
onboarding
ui
oidc
group
integration
youtube image

5 Jul 2021

Harshal Sheth (Acryl Data) describes how Dataset Popularity is implemented in DataHub and gives a demo.
  • 2 participants
  • 15 minutes
popularity
data
querying
information
users
varies
analysis
understand
rely
enterprises
youtube image

5 Jul 2021

John Joyce and Gabe Lyons from Acryl Data present simplifications in deploying DataHub at the Community Townhall on June 25.
- Standalone mode can run on much less memory and requires fewer containers
- Neo4j is now optional; DataHub can run on Elastic as Graph backend.
  • 7 participants
  • 1:01 hours
community
discussions
thoughtworks
initiative
2021
analytics
annotations
slack
saxo
backend
youtube image

27 May 2021

John Joyce (Acryl Data) presents a tech deep dive on the new support for no code metadata modeling in DataHub that is going to be released as part of release 0.8.0
  • 2 participants
  • 26 minutes
metadata
configuration
backend
gcp
details
code
capabilities
problem
boilerplate
contributor
youtube image