Ceph / Ceph Days NYC 2023

Add meeting Rate page Subscribe

Ceph / Ceph Days NYC 2023

These are all the meetings we have in "Ceph Days NYC 2023" (part of the organization "Ceph"). Click into individual meeting pages to watch the recording and search or read the transcript.

19 May 2023

Presented by: Chris MacNaughton

Building up a Ceph cluster can be a bit tricky and time consuming, especially if it’s just for testing or a small home lab. To make this much easier, we’ve started working on microceph. It's a snap package that uses a small management daemon that allows for very easy clustering of multiple systems which, combined with an easy bootstrap process, allows for setting up a Ceph cluster in just a few minutes!
  • 3 participants
  • 11 minutes
daemons
cluster
deployments
servers
linux
gateways
microsoft
host
snap
lvm
youtube image

19 May 2023

Presented by: Neha Ojha & Josh Durgin

In this talk, we'll provide an update on the state of the Ceph upstream project, recent development efforts, current priorities, and community initiatives. We will share details of features released across components in the latest Ceph release, Quincy, and explain how this release is different from previous Ceph releases. The talk will also provide a sneak peek into features being planned for the next Ceph release, Reef.
  • 1 participant
  • 23 minutes
stuff
monitoring
dashboards
hosts
governance
cluster
community
project
improvements
deployments
youtube image

19 May 2023

Presented by: Joachim Kraftmayer

Over the last few years, we have been gaining experience with Rook in production. One of our challenges was to implement dynamic resource management between 50+ Ceph clusters. Kubernetes events dynamically and fully automatically distribute loads and capacity between Ceph clusters. This is done by removing single or multiple Ceph nodes from Ceph clusters while ensuring data integrity at all times. In the next step, the released Ceph nodes are integrated into other Ceph clusters as needed.
  • 2 participants
  • 29 minutes
kubernetes
cluster
server
premise
infrastructure
platform
capacity
virtualization
cloud
security
youtube image

19 May 2023

Presented by: Danny Abukalam

Benchmarking Ceph has always been a complex task - there are lots of tools but many have drawbacks and are written for more general-purpose use. For Ceph we need to benchmark Librados, RBD, CephFS, and RGW and each of these protocols has unique challenges and typical deployment scenarios. Not only that, Ceph works better at scale and so we need to ensure that we can build a benchmarking system that will also scale and be able to generate an adequate load at large scale.
  • 6 participants
  • 32 minutes
benchmarking
benchmarked
benchmarks
benchmark
testing
benchmasters
performance
workloads
protocols
difficult
youtube image

17 May 2023

Presented by: Frank Yang

Working together with a major American sports league, we built a multi-site 40 PB active archive housing over 100 years of game video and audio assets by using Ceph as the foundational storage technology. Along the way, we learned many lessons about architecting, deploying, and operationalizing Ceph from the vantage point of a large, modern, and rapidly growing media company. We would like to share our experience and learnings with the community to help others traveling a similar road.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 6 participants
  • 35 minutes
stadium
infrastructure
capacity
league
data
accessible
important
documentation
archived
backups
youtube image

17 May 2023

Presented by: Vincent Hsu | IBM

The new generation of hybrid cloud provides a common platform across all your cloud, on-premises, and edge environments. That means you can skill once, build once, and manage from a single pane of glass. That also implies platform needs to support diverse workloads and different level of maturity in management skills. In this presentation, we will cover the open source projects and proposals to enhance Ceph's consumability and manageability to enable Ceph in more environments.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 1 participant
  • 25 minutes
ibm
enterprise
storage
sev
hey
customers
invited
morning
security
talking
youtube image

17 May 2023

Presented by: Yaarit Hatuka | IBM

To increase product observability and robustness, Ceph’s telemetry module allows users to automatically report anonymized data about their clusters. Ceph’s telemetry backend runs tools that analyze this data to help developers understand how Ceph is used and what problems users may be experiencing. In this session, we will overview the various aspects of Ceph’s upstream telemetry and its benefits for users, and explore how telemetry can be deployed independently as a tool for fleet observability.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 7 participants
  • 39 minutes
cluster
telemetry
clusters
dashboards
data
insights
reports
deploying
triage
bot
youtube image

17 May 2023

Presented by: Dan van der Ster | Clyso

In 2013, the data storage team at CERN began investigating Ceph to solve an emerging problem: how to provide reliable, flexible, future-proof storage for our growing on-premises OpenStack cloud. Beginning with a humble 3PB cluster, the infrastructure has grown to support the entire lab, with 50PB of storage across multiple data centres used across a variety of use-cases ranging from basic IT apps, databases, HPC, cloud storage, and others.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 2 participants
  • 32 minutes
cern
physicists
lhc
stuff
hadron
ve
nobel
acceleration
2018
space
youtube image

17 May 2023

Presented by: Federico Lucifredi & Sage McTaggart | IBM

We explore the security model exposed by Rook with Ceph, the leading software-defined storage platform of the Open Source world. Digging increasingly deeper in the stack, we examine options for hardening Ceph storage that are appropriate for a variety of threat profiles.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 5 participants
  • 35 minutes
security
transitions
hosts
cruz
administrative
ago
career
backed
good
york
youtube image

17 May 2023

Presented by: Jonas Pfefferle | IBM Research

NVMe-over-Fabrics (NVMeoF) is a widely adopted, de facto standard in remote block storage access. Ceph clients use the RADOS protocol to access RBD images, but there are good reasons to enable access via NVMeoF: to allow existing NVMeoF storage users to easily migrate to Ceph and to enable the use of NVMeoF offloading hardware. This talk presents our effort to provide native NVMeoF support for Ceph. We discuss some of the challenges, including multi-pathing for fault tolerance and performance.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 7 participants
  • 31 minutes
envy
threads
seth
enterprise
gateways
remote
backend
ram
storage
nvm
youtube image

17 May 2023

Presented by: Patrick Donnelly | IBM

Ceph was originally designed to fill a need for a distributed file system within scientific computing environments but has since grown to become a dominant *unified* software-defined distribute storage system. This talk will cover the new development of an SQLite Virtual File System (VFS) on top of Ceph's distributed object store (RADOS). I will show how SQL can now be run on Ceph for both its internal use and for new application storage requirements.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 2 participants
  • 29 minutes
manages
interface
threads
fadm
config
modules
databases
debugging
libsep
ceft
youtube image

17 May 2023

Welcoming remarks from Ceph board member Matthew Leonard.

Community update from Mike Perez.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 2 participants
  • 12 minutes
bloomberg
staff
facilities
service
today
thank
saf
company
contributors
enterprise
youtube image

17 May 2023

Presented by: Nathan Hoad | Bloomberg

Ceph’s Prometheus module provides performance counter metrics via the ceph-mgr component. While this works well for smaller installations, it can be problematic to put metric workloads into ceph-mgr at scale. Ceph is just one component of our internal S3 product. We also need to gather telemetry data about space, objects per bucket, buckets per tenancy, etc., as well as telemetry from a software-defined distributed quality of service (QoS) system which is not natively supported by Ceph.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 4 participants
  • 29 minutes
bloomberg
storage
processes
terminal
nfs
users
distributed
model
tenancies
manager
youtube image

17 May 2023

Presented by: Anthony D'Atri

Ceph enables flexible and scalable object storage of unstructured data for a wide variety of workloads. RGW (RADOS GateWay) deployments experience a wide variety of object sizes and must balance workload, cost, and performance requirements. S3 storage classes are an established way to steer data onto underlying media that meet specific resilience, cost, and performance requirements. One might for example define RGW back end storage classes for SSD or HDD media, non-redundant vs replicated vs erasure coding pools, etc. Diversion of individual objects or entire buckets into a non-default storage class usually requires specific client action. Compliance however can be awkward to request and impossible to enforce, especially in multi-tenant deployments that may include paying customers as well as internal users. This work enables the RGW back end to enforce storage class on uploaded objects based on specific criteria without requiring client actions. For example, one might define a default storage class on performance TLC or Optane media for resource-intensive small S3 objects while assigning larger objects to cost-effective QLC SSD media.

https://ceph.io/en/community/events/2023/ceph-days-nyc/
  • 1 participant
  • 19 minutes
capacity
gigabyte
4kb
workloads
nand
storages
deployments
servers
bottleneck
spinning
youtube image

8 Mar 2023

Presented by: Frank Yang

Working together with a major American sports league, we built a multi-site 40 PB active archive housing over 100 years of game video and audio assets by using Ceph as the foundational storage technology. Along the way, we learned many lessons about architecting, deploying, and operationalizing Ceph from the vantage point of a large, modern, and rapidly growing media company. We would like to share our experience and learnings with the community to help others traveling a similar road.
  • 6 participants
  • 35 minutes
stadium
infrastructure
league
capacity
data
important
accessible
preserving
motivation
backups
youtube image

8 Mar 2023

Presented by: Chris MacNaughton | Canonical

Building up a Ceph cluster can be a bit tricky and time consuming, especially if it’s just for testing or a small home lab. To make this much easier, we’ve started working on microceph. It's a snap package that uses a small management daemon that allows for very easy clustering of multiple systems which, combined with an easy bootstrap process, allows for setting up a Ceph cluster in just a few minutes!
  • 3 participants
  • 11 minutes
daemons
cluster
clustering
deployments
linux
servers
gateways
microsoft
lvm
snap
youtube image

8 Mar 2023

Presented by: Yaarit Hatuka | IBM

To increase product observability and robustness, Ceph’s telemetry module allows users to automatically report anonymized data about their clusters. Ceph’s telemetry backend runs tools that analyze this data to help developers understand how Ceph is used and what problems users may be experiencing. In this session, we will overview the various aspects of Ceph’s upstream telemetry and its benefits for users, and explore how telemetry can be deployed independently as a tool for fleet observability.
  • 7 participants
  • 39 minutes
telemetry
cluster
telepublicstaff
dashboards
data
deploying
insights
reports
triage
bot
youtube image

8 Mar 2023

Presented by: Dan van der Ster | Clyso

In 2013, the data storage team at CERN began investigating Ceph to solve an emerging problem: how to provide reliable, flexible, future-proof storage for our growing on-premises OpenStack cloud. Beginning with a humble 3PB cluster, the infrastructure has grown to support the entire lab, with 50PB of storage across multiple data centres used across a variety of use-cases ranging from basic IT apps, databases, HPC, cloud storage, and others.
  • 2 participants
  • 32 minutes
cern
physicists
lhc
stuff
hadron
ve
nobel
acceleration
2018
space
youtube image

8 Mar 2023

Presented by: Vincent Hsu | IBM

The new generation of hybrid cloud provides a common platform across all your cloud, on-premises, and edge environments. That means you can skill once, build once, and manage from a single pane of glass. That also implies platform needs to support diverse workloads and different level of maturity in management skills. In this presentation, we will cover the open source projects and proposals to enhance Ceph's consumability and manageability to enable Ceph in more environments.
  • 1 participant
  • 25 minutes
ibm
enterprise
storage
sev
hey
customers
invited
morning
security
talking
youtube image

8 Mar 2023

We explore the security model exposed by Rook with Ceph, the leading software-defined storage platform of the Open Source world. Digging increasingly deeper in the stack, we examine options for hardening Ceph storage that are appropriate for a variety of threat profiles.
  • 5 participants
  • 35 minutes
security
transitions
hosts
cruz
administrative
ago
career
backed
good
york
youtube image

8 Mar 2023

Presented by: Jonas Pfefferle | IBM Research

NVMe-over-Fabrics (NVMeoF) is a widely adopted, de facto standard in remote block storage access. Ceph clients use the RADOS protocol to access RBD images, but there are good reasons to enable access via NVMeoF: to allow existing NVMeoF storage users to easily migrate to Ceph and to enable the use of NVMeoF offloading hardware. This talk presents our effort to provide native NVMeoF support for Ceph. We discuss some of the challenges, including multi-pathing for fault tolerance and performance.
  • 7 participants
  • 31 minutes
envy
threads
seth
backend
gateways
enterprise
remote
fabric
storage
nvm
youtube image

8 Mar 2023

Presented by: Josh Durgin | IBM

In this talk, we'll provide an update on the state of the Ceph upstream project, recent development efforts, current priorities, and community initiatives. We will share details of features released across components in the latest Ceph release, Quincy, and explain how this release is different from previous Ceph releases. The talk will also provide a sneak peek into features being planned for the next Ceph release, Reef.
  • 1 participant
  • 23 minutes
stuff
monitoring
governance
hosts
community
improvements
project
dashboards
cluster
deployments
youtube image

8 Mar 2023

Welcoming remarks from Ceph board member Matthew Leonard.

Community update from Mike Perez.
  • 2 participants
  • 12 minutes
bloomberg
staff
facilities
service
today
thank
saf
company
contributors
enterprise
youtube image

8 Mar 2023

Presented by: Anthony D'Atri

Ceph enables flexible and scalable object storage of unstructured data for a wide variety of workloads. RGW (RADOS GateWay) deployments experience a wide variety of object sizes and must balance workload, cost, and performance requirements. S3 storage classes are an established way to steer data onto underlying media that meet specific resilience, cost, and performance requirements. One might for example define RGW back end storage classes for SSD or HDD media, non-redundant vs replicated vs erasure coding pools, etc. Diversion of individual objects or entire buckets into a non-default storage class usually requires specific client action. Compliance however can be awkward to request and impossible to enforce, especially in multi-tenant deployments that may include paying customers as well as internal users. This work enables the RGW back end to enforce storage class on uploaded objects based on specific criteria without requiring client actions. For example, one might define a default storage class on performance TLC or Optane media for resource-intensive small S3 objects while assigning larger objects to cost-effective QLC SSD media.
  • 1 participant
  • 19 minutes
capacity
gigabyte
4kb
workloads
nand
storages
deployments
servers
bottleneck
spinning
youtube image

8 Mar 2023

Presented by: Patrick Donnelly | IBM

Ceph was originally designed to fill a need for a distributed file system within scientific computing environments but has since grown to become a dominant **unified** software-defined distribute storage system. This talk will cover the new development of an SQLite Virtual File System (VFS) on top of Ceph's distributed object store (RADOS). I will show how SQL can now be run on Ceph for both its internal use and for new application storage requirements.
  • 2 participants
  • 29 minutes
manages
interface
threads
fadm
config
modules
databases
debugging
libsep
ceft
youtube image

8 Mar 2023

Presented by: Joachim Kraftmayer | Clyso

Over the last few years, we have been gaining experience with Rook in production. One of our challenges was to implement dynamic resource management between 50+ Ceph clusters. Kubernetes events dynamically and fully automatically distribute loads and capacity between Ceph clusters. This is done by removing single or multiple Ceph nodes from Ceph clusters while ensuring data integrity at all times. In the next step, the released Ceph nodes are integrated into other Ceph clusters as needed.
  • 1 participant
  • 29 minutes
kubernetes
cluster
server
capacity
platform
virtualization
foundation
cloud
iop
security
youtube image

8 Mar 2023

Presented by: Danny Abukalam | SoftIron

Benchmarking Ceph has always been a complex task - there are lots of tools but many have drawbacks and are written for more general-purpose use. For Ceph we need to benchmark Librados, RBD, CephFS, and RGW and each of these protocols has unique challenges and typical deployment scenarios. Not only that, Ceph works better at scale and so we need to ensure that we can build a benchmarking system that will also scale and be able to generate an adequate load at large scale.
  • 6 participants
  • 32 minutes
benchmarking
benchmarked
benchmarks
benchmark
testing
benchmasters
performance
workloads
protocols
difficult
youtube image

8 Mar 2023

Presented by: Nathan Hoad | Bloomberg

Ceph’s Prometheus module provides performance counter metrics via the ceph-mgr component. While this works well for smaller installations, it can be problematic to put metric workloads into ceph-mgr at scale. Ceph is just one component of our internal S3 product. We also need to gather telemetry data about space, objects per bucket, buckets per tenancy, etc., as well as telemetry from a software-defined distributed quality of service (QoS) system which is not natively supported by Ceph.
  • 4 participants
  • 29 minutes
bloomberg
storage
processes
terminal
nfs
users
distributed
model
tenancies
manager
youtube image