Ceph Ceph Days NYC 2023 Open Meetings

19 May 2023

Presented by: Chris MacNaughton

Building up a Ceph cluster can be a bit tricky and time consuming, especially if it’s just for testing or a small home lab. To make this much easier, we’ve started working on microceph. It's a snap package that uses a small management daemon that allows for very easy clustering of multiple systems which, combined with an easy bootstrap process, allows for setting up a Ceph cluster in just a few minutes!

3 participants
11 minutes

daemons

cluster

deployments

servers

linux

gateways

microsoft

host

snap

lvm

19 May 2023

Presented by: Neha Ojha & Josh Durgin

In this talk, we'll provide an update on the state of the Ceph upstream project, recent development efforts, current priorities, and community initiatives. We will share details of features released across components in the latest Ceph release, Quincy, and explain how this release is different from previous Ceph releases. The talk will also provide a sneak peek into features being planned for the next Ceph release, Reef.

1 participant
23 minutes

stuff

monitoring

dashboards

hosts

governance

cluster

community

project

improvements

deployments

19 May 2023

Presented by: Joachim Kraftmayer

Over the last few years, we have been gaining experience with Rook in production. One of our challenges was to implement dynamic resource management between 50+ Ceph clusters. Kubernetes events dynamically and fully automatically distribute loads and capacity between Ceph clusters. This is done by removing single or multiple Ceph nodes from Ceph clusters while ensuring data integrity at all times. In the next step, the released Ceph nodes are integrated into other Ceph clusters as needed.

2 participants
29 minutes

kubernetes

cluster

server

premise

infrastructure

platform

capacity

virtualization

cloud

security

19 May 2023

Presented by: Danny Abukalam

Benchmarking Ceph has always been a complex task - there are lots of tools but many have drawbacks and are written for more general-purpose use. For Ceph we need to benchmark Librados, RBD, CephFS, and RGW and each of these protocols has unique challenges and typical deployment scenarios. Not only that, Ceph works better at scale and so we need to ensure that we can build a benchmarking system that will also scale and be able to generate an adequate load at large scale.

6 participants
32 minutes

benchmarking

benchmarked

benchmarks

benchmark

testing

benchmasters

performance

workloads

protocols

difficult

17 May 2023

Presented by: Frank Yang

Working together with a major American sports league, we built a multi-site 40 PB active archive housing over 100 years of game video and audio assets by using Ceph as the foundational storage technology. Along the way, we learned many lessons about architecting, deploying, and operationalizing Ceph from the vantage point of a large, modern, and rapidly growing media company. We would like to share our experience and learnings with the community to help others traveling a similar road.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

6 participants
35 minutes

stadium

infrastructure

capacity

league

data

accessible

important

documentation

archived

backups

17 May 2023

Presented by: Vincent Hsu | IBM

The new generation of hybrid cloud provides a common platform across all your cloud, on-premises, and edge environments. That means you can skill once, build once, and manage from a single pane of glass. That also implies platform needs to support diverse workloads and different level of maturity in management skills. In this presentation, we will cover the open source projects and proposals to enhance Ceph's consumability and manageability to enable Ceph in more environments.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

1 participant
25 minutes

ibm

enterprise

storage

sev

hey

customers

invited

morning

security

talking

17 May 2023

Presented by: Yaarit Hatuka | IBM

To increase product observability and robustness, Ceph’s telemetry module allows users to automatically report anonymized data about their clusters. Ceph’s telemetry backend runs tools that analyze this data to help developers understand how Ceph is used and what problems users may be experiencing. In this session, we will overview the various aspects of Ceph’s upstream telemetry and its benefits for users, and explore how telemetry can be deployed independently as a tool for fleet observability.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

7 participants
39 minutes

cluster

telemetry

clusters

dashboards

data

insights

reports

deploying

triage

bot

17 May 2023

Presented by: Dan van der Ster | Clyso

In 2013, the data storage team at CERN began investigating Ceph to solve an emerging problem: how to provide reliable, flexible, future-proof storage for our growing on-premises OpenStack cloud. Beginning with a humble 3PB cluster, the infrastructure has grown to support the entire lab, with 50PB of storage across multiple data centres used across a variety of use-cases ranging from basic IT apps, databases, HPC, cloud storage, and others.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

2 participants
32 minutes

cern

physicists

lhc

stuff

hadron

ve

nobel

acceleration

2018

space

17 May 2023

Presented by: Federico Lucifredi & Sage McTaggart | IBM

We explore the security model exposed by Rook with Ceph, the leading software-defined storage platform of the Open Source world. Digging increasingly deeper in the stack, we examine options for hardening Ceph storage that are appropriate for a variety of threat profiles.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

5 participants
35 minutes

security

transitions

hosts

cruz

administrative

ago

career

backed

good

york

17 May 2023

Presented by: Jonas Pfefferle | IBM Research

NVMe-over-Fabrics (NVMeoF) is a widely adopted, de facto standard in remote block storage access. Ceph clients use the RADOS protocol to access RBD images, but there are good reasons to enable access via NVMeoF: to allow existing NVMeoF storage users to easily migrate to Ceph and to enable the use of NVMeoF offloading hardware. This talk presents our effort to provide native NVMeoF support for Ceph. We discuss some of the challenges, including multi-pathing for fault tolerance and performance.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

7 participants
31 minutes

envy

threads

seth

enterprise

gateways

remote

backend

ram

storage

nvm

17 May 2023

Presented by: Patrick Donnelly | IBM

Ceph was originally designed to fill a need for a distributed file system within scientific computing environments but has since grown to become a dominant *unified* software-defined distribute storage system. This talk will cover the new development of an SQLite Virtual File System (VFS) on top of Ceph's distributed object store (RADOS). I will show how SQL can now be run on Ceph for both its internal use and for new application storage requirements.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

2 participants
29 minutes

manages

interface

threads

fadm

config

modules

databases

debugging

libsep

ceft

17 May 2023

Welcoming remarks from Ceph board member Matthew Leonard.

Community update from Mike Perez.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

2 participants
12 minutes

bloomberg

staff

facilities

service

today

thank

saf

company

contributors

enterprise

17 May 2023

Presented by: Nathan Hoad | Bloomberg

Ceph’s Prometheus module provides performance counter metrics via the ceph-mgr component. While this works well for smaller installations, it can be problematic to put metric workloads into ceph-mgr at scale. Ceph is just one component of our internal S3 product. We also need to gather telemetry data about space, objects per bucket, buckets per tenancy, etc., as well as telemetry from a software-defined distributed quality of service (QoS) system which is not natively supported by Ceph.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

4 participants
29 minutes

bloomberg

storage

processes

terminal

nfs

users

distributed

model

tenancies

manager

17 May 2023

Presented by: Anthony D'Atri

Ceph enables flexible and scalable object storage of unstructured data for a wide variety of workloads. RGW (RADOS GateWay) deployments experience a wide variety of object sizes and must balance workload, cost, and performance requirements. S3 storage classes are an established way to steer data onto underlying media that meet specific resilience, cost, and performance requirements. One might for example define RGW back end storage classes for SSD or HDD media, non-redundant vs replicated vs erasure coding pools, etc. Diversion of individual objects or entire buckets into a non-default storage class usually requires specific client action. Compliance however can be awkward to request and impossible to enforce, especially in multi-tenant deployments that may include paying customers as well as internal users. This work enables the RGW back end to enforce storage class on uploaded objects based on specific criteria without requiring client actions. For example, one might define a default storage class on performance TLC or Optane media for resource-intensive small S3 objects while assigning larger objects to cost-effective QLC SSD media.

https://ceph.io/en/community/events/2023/ceph-days-nyc/

1 participant
19 minutes

capacity

gigabyte

4kb

workloads

nand

storages

deployments

servers

bottleneck

spinning

8 Mar 2023

Presented by: Frank Yang

Working together with a major American sports league, we built a multi-site 40 PB active archive housing over 100 years of game video and audio assets by using Ceph as the foundational storage technology. Along the way, we learned many lessons about architecting, deploying, and operationalizing Ceph from the vantage point of a large, modern, and rapidly growing media company. We would like to share our experience and learnings with the community to help others traveling a similar road.

6 participants
35 minutes

stadium

infrastructure

league

capacity

data

important

accessible

preserving

motivation

backups

8 Mar 2023

Presented by: Chris MacNaughton | Canonical

Building up a Ceph cluster can be a bit tricky and time consuming, especially if it’s just for testing or a small home lab. To make this much easier, we’ve started working on microceph. It's a snap package that uses a small management daemon that allows for very easy clustering of multiple systems which, combined with an easy bootstrap process, allows for setting up a Ceph cluster in just a few minutes!

3 participants
11 minutes

daemons

cluster

clustering

deployments

linux

servers

gateways

microsoft

lvm

snap

8 Mar 2023

Presented by: Yaarit Hatuka | IBM

To increase product observability and robustness, Ceph’s telemetry module allows users to automatically report anonymized data about their clusters. Ceph’s telemetry backend runs tools that analyze this data to help developers understand how Ceph is used and what problems users may be experiencing. In this session, we will overview the various aspects of Ceph’s upstream telemetry and its benefits for users, and explore how telemetry can be deployed independently as a tool for fleet observability.

7 participants
39 minutes

telemetry

cluster

telepublicstaff

dashboards

data

deploying

insights

reports

triage

bot

8 Mar 2023

Presented by: Dan van der Ster | Clyso

In 2013, the data storage team at CERN began investigating Ceph to solve an emerging problem: how to provide reliable, flexible, future-proof storage for our growing on-premises OpenStack cloud. Beginning with a humble 3PB cluster, the infrastructure has grown to support the entire lab, with 50PB of storage across multiple data centres used across a variety of use-cases ranging from basic IT apps, databases, HPC, cloud storage, and others.

2 participants
32 minutes

cern

physicists

lhc

stuff

hadron

ve

nobel

acceleration

2018

space

8 Mar 2023

Presented by: Vincent Hsu | IBM

The new generation of hybrid cloud provides a common platform across all your cloud, on-premises, and edge environments. That means you can skill once, build once, and manage from a single pane of glass. That also implies platform needs to support diverse workloads and different level of maturity in management skills. In this presentation, we will cover the open source projects and proposals to enhance Ceph's consumability and manageability to enable Ceph in more environments.

1 participant
25 minutes

ibm

enterprise

storage

sev

hey

customers

invited

morning

security

talking

8 Mar 2023

We explore the security model exposed by Rook with Ceph, the leading software-defined storage platform of the Open Source world. Digging increasingly deeper in the stack, we examine options for hardening Ceph storage that are appropriate for a variety of threat profiles.

5 participants
35 minutes

security

transitions

hosts

cruz

administrative

ago

career

backed

good

york

8 Mar 2023

Presented by: Jonas Pfefferle | IBM Research

NVMe-over-Fabrics (NVMeoF) is a widely adopted, de facto standard in remote block storage access. Ceph clients use the RADOS protocol to access RBD images, but there are good reasons to enable access via NVMeoF: to allow existing NVMeoF storage users to easily migrate to Ceph and to enable the use of NVMeoF offloading hardware. This talk presents our effort to provide native NVMeoF support for Ceph. We discuss some of the challenges, including multi-pathing for fault tolerance and performance.

7 participants
31 minutes

envy

threads

seth

backend

gateways

enterprise

remote

fabric

storage

nvm

8 Mar 2023

Presented by: Josh Durgin | IBM

In this talk, we'll provide an update on the state of the Ceph upstream project, recent development efforts, current priorities, and community initiatives. We will share details of features released across components in the latest Ceph release, Quincy, and explain how this release is different from previous Ceph releases. The talk will also provide a sneak peek into features being planned for the next Ceph release, Reef.

1 participant
23 minutes

stuff

monitoring

governance

hosts

community

improvements

project

dashboards

cluster

deployments

8 Mar 2023

Welcoming remarks from Ceph board member Matthew Leonard.

Community update from Mike Perez.

2 participants
12 minutes

bloomberg

staff

facilities

service

today

thank

saf

company

contributors

enterprise

8 Mar 2023

Presented by: Anthony D'Atri

Ceph enables flexible and scalable object storage of unstructured data for a wide variety of workloads. RGW (RADOS GateWay) deployments experience a wide variety of object sizes and must balance workload, cost, and performance requirements. S3 storage classes are an established way to steer data onto underlying media that meet specific resilience, cost, and performance requirements. One might for example define RGW back end storage classes for SSD or HDD media, non-redundant vs replicated vs erasure coding pools, etc. Diversion of individual objects or entire buckets into a non-default storage class usually requires specific client action. Compliance however can be awkward to request and impossible to enforce, especially in multi-tenant deployments that may include paying customers as well as internal users. This work enables the RGW back end to enforce storage class on uploaded objects based on specific criteria without requiring client actions. For example, one might define a default storage class on performance TLC or Optane media for resource-intensive small S3 objects while assigning larger objects to cost-effective QLC SSD media.

1 participant
19 minutes

capacity

gigabyte

4kb

workloads

nand

storages

deployments

servers

bottleneck

spinning

8 Mar 2023

Presented by: Patrick Donnelly | IBM

Ceph was originally designed to fill a need for a distributed file system within scientific computing environments but has since grown to become a dominant **unified** software-defined distribute storage system. This talk will cover the new development of an SQLite Virtual File System (VFS) on top of Ceph's distributed object store (RADOS). I will show how SQL can now be run on Ceph for both its internal use and for new application storage requirements.

2 participants
29 minutes

manages

interface

threads

fadm

config

modules

databases

debugging

libsep

ceft

8 Mar 2023

Presented by: Joachim Kraftmayer | Clyso

Over the last few years, we have been gaining experience with Rook in production. One of our challenges was to implement dynamic resource management between 50+ Ceph clusters. Kubernetes events dynamically and fully automatically distribute loads and capacity between Ceph clusters. This is done by removing single or multiple Ceph nodes from Ceph clusters while ensuring data integrity at all times. In the next step, the released Ceph nodes are integrated into other Ceph clusters as needed.

1 participant
29 minutes

kubernetes

cluster

server

capacity

platform

virtualization

foundation

cloud

iop

security

8 Mar 2023

Presented by: Danny Abukalam | SoftIron

Benchmarking Ceph has always been a complex task - there are lots of tools but many have drawbacks and are written for more general-purpose use. For Ceph we need to benchmark Librados, RBD, CephFS, and RGW and each of these protocols has unique challenges and typical deployment scenarios. Not only that, Ceph works better at scale and so we need to ensure that we can build a benchmarking system that will also scale and be able to generate an adequate load at large scale.

6 participants
32 minutes

benchmarking

benchmarked

benchmarks

benchmark

testing

benchmasters

performance

workloads

protocols

difficult

8 Mar 2023

Presented by: Nathan Hoad | Bloomberg

Ceph’s Prometheus module provides performance counter metrics via the ceph-mgr component. While this works well for smaller installations, it can be problematic to put metric workloads into ceph-mgr at scale. Ceph is just one component of our internal S3 product. We also need to gather telemetry data about space, objects per bucket, buckets per tenancy, etc., as well as telemetry from a software-defined distributed quality of service (QoS) system which is not natively supported by Ceph.

4 participants
29 minutes

bloomberg

storage

processes

terminal

nfs

users

distributed

model

tenancies

manager

Ceph / Ceph Days NYC 2023

19 May 2023

19 May 2023

19 May 2023

19 May 2023

17 May 2023

17 May 2023

17 May 2023

17 May 2023

17 May 2023

17 May 2023

17 May 2023

17 May 2023

17 May 2023

17 May 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023

8 Mar 2023