►
From YouTube: Ceph Month 2021: RGW Update
Description
Presented By Casey Bodley
Ceph Month 2021 schedule: https://pad.ceph.com/p/ceph-month-june-2021
A
Well,
welcome
everybody
to
another
ceph
month
week,
two
and
this
time
we're
going
to
have
a
radio
skateway
update
with
casey
and
then
followed
by.
We
have
two
birds
of
a
feather
session,
one
on
the
cefe
research
and
scientific
computing
group,
as
well
as
the
gosef
get
together,
which
is
the
go
apis
for
seth,
though
casey
will
you
please
take
it
away
for
the
first
presentation
for
week,
two.
B
B
B
So
a
background
process
just
builds
an
index
of
these
large
buckets
in
a
convenient
and
cheap
format
to
read
so
it
kind
of
trades.
Some
of
the
consistency
from
our
bucket
index
for
performance.
B
So
we
recently
started
a
collaboration
with
some
engineers
from
flipkart
and
we're
excited
to
see
progress
here
also
request
rate
limiting
per
user
and
bucket.
We've
got
a
proof
of
concept
for
this
working
so
expect
to
see
that
in
in
quincy,
we've
also
been
working
on
some
data.
Caching
projects
with
mass
open
cloud,
with
the
goal
of
accelerating
workloads
with
some
local
data.
B
Caching,
and
we're
also
planning
to
build
on
jaeger,
tracing
support
in
rgw
and
use
it
to
do
some
performance
evaluation
and
optimization
work
next,
introducing
project
zipper,
which
is
project
led
by
dan
grinowicz,
where
we've
built
an
abstraction
in
rgw
for
the
liberators
back
end.
B
One
of
the
cool
ideas
is
to
have
a
policy
layer
that
enables
some
lua
scripting
to
control
how
requests
are
processed
before
they
get
written
out
to
rados,
and
this
interface
also
can
be
really
useful
for
some
benchmarking
and
performance
work
where
we
drive
a
workload
just
against
specific
parts
of
the
interface
or.
If
we
have
like
a
memory
back
end,
then
we
can
do
benchmarking
on
the
on
the
front
end
code
without
the
overhead
of
rights
to
rados.
B
B
B
You
know
it
is
working
on
support
for
identifying
changes
in
s3
so
that
they
can
be
replicated
back
into
stuff
via
multi-site.
C
Hey
ckcr,
this
is
prasad
from
flipkart,
so
that's
an
impressive
set
of
features.
So
I
was
just
keen
about
s3,
select,
support
and
bucket
inventory
and
rate
limit.
We
we.
We
have
a.
C
Rate
limit
infrastructure,
which
is
a
patch
on
top
of
rgw,
but
with
a
lot
of
limitations.
So
I
was
wondering
how
what
would
be
what's
the
mind
about
this
rate?
Limiting
I
mean,
is
it
based
on
the
dm
clock?
Is
it
a
moving
window
and
what
are
the
contours
of
you
know
and.
B
Yeah
good
question,
so
we
had
been
looking
at
distributed
rate
limiting
so
that
the
cluster
as
a
whole
could
give
a
consistent
rate.
But
I
think
we're
going
to
just
go
with
a
per
rgw
one,
which
is
a
lot
easier
to
implement.
C
Because
you
know,
like
I
said
you
know,
we
have
a
internal
implementation
which
is
again
on
rgw
itself,
but
we
to
be
honest.
We
are
not
very
happy
with
it
because
of
a
it's,
not
a
distributed
rate
limited
it's
on
a
per
radar's
gateway
and
it's
a
fixed
window
protocol.
So
the
windows,
like
you
know,
if
you
set
a
rate
limit
on
a
per
minute
basis,
then
it's
possible
to
front
load
all
the
requests
at
the
start
of
the
minute
and
then
see
you
know,
get
a
sawtooth
sort
of
a
pattern.
C
Yeah.
Those
are
some
of
the
problems
that
we
have
and
with
respect
to
s3
select,
we
would
be
having
support
for
only
those
features
which
aws
s3
already
supports,
right,
like
the
sql
query
or
the
json
or
the
csv
parser.
B
That's
my
understanding
that
it's
it's
antennas
to
be
compatible.
Okay,.
C
Would
we
be
open
to,
I
know,
I'm
having
more
functions
which
are
not
aws
s3?
I
know
which
are
not
provided
by
aws
s3,
but
which
is
something
that
the
rgw
can
handle.
C
Okay:
okay,
one
of
the
things
that
we
were
contemplating,
although
we
are
not
going
to
work,
is
to
be
able
to
select
a
single
page
out
of
a
multi-page
pda.
C
E
B
F
Us
I
had
a
couple
quick
questions
about
some.
Some
gaps
that
I
ran
into
one
question
is
around
the
ganesha
export
support.
F
F
Do
you
know
is
that
like?
Is
that
a
known
gap,
or
is
that
a
documentation
issue
or
me
being
confused
issue,
and
or
do
you
know
who
I
should
where
she
go.
B
F
Okay,
the
demo
is
probably
tied
to
it
so,
okay,
but
in
principle
it
should
work
with
even
in
a
multi-site
situation.
F
F
A
Okay,
another
question
that
came
into
chat:
does
data
caching
still
foresee
to
use
nginx
caching,
or
would
it
be
internal
to
the
rate
of
gateway.
G
Yeah,
can
you
hear
me
first,
yes,
yeah,
okay,
excellent
yeah!
I
was
just
curious
about
this
because
actually
I'm
from
cern,
we
are
still
running
now
to
lose
for
our
s3
clusters.
So
at
the
moment
we
have
no
caching
in
between,
but
we
are
planning
to
to
upgrade
our
staff
version.
So
I
know
that
in
next
one
the
nginx
caching
layer
would
be
supported,
but
then,
from
my
understanding
this
this
new
data
caching
would
be
something
implemented
on
the
router's
gateway
itself.
G
So
can
you
can
you
maybe
expand
a
bit
more
on
that?
Would
we
be
able
to
leverage
and
take
advantage
of
this
caching
just
addressing
dorado's
gateway
without
any
other.
B
Yeah,
that's
definitely
the
goal,
so
the
the
first
step
is
just
having
rgw
cache
data
locally,
but
there's
a
another
layer
to
this
research
project
level
2,
which
is
basically
localized
caching,
so
some
objects
would
be
placed
or
pinned
to
a
specific
rgw
for
the
cache
and
there
would
be
redirects
to
get
to
the
right
cache.
G
Well
at
the
moment,
I
cannot
really
report
our
experience
for
for
what
concerns
the
nginx
cache
because,
as
I
said,
unfortunately,
we
cannot
use
it
at
the
moment.
Just
another
quick
question:
I
might
have
missed
it
at
the
very
beginning.
Are
this
new
feature
coming
to
to
pacific
or
you
for
cities
to
be
also
back
ported
or
are
for
future
releases.
A
F
Yeah
I
had
a
a
question
about
some
of
the
zipper
stuff
with
the
non-radius
back-ends.
F
It
feels
like
we
have
a
whole
sort
of
slew
of
different
scenarios
that
we're
looking
at
here
and
with
this,
the
local
database
for
the
metadata
being
one
of
the
first
ones,
just
to
make
sure
I'm
understanding
sort
of
what
this
initial
target
is.
This
is
basically
the
the
local
zone
metadata
being
stored.
Is
it
like
sqlite
or
something
like
that?
F
It's
equal
okay.
So
this
would
the
the
constraints
there
would
be
that
you'd
have
a
single
demon
instance
for
that
zone,
and
it
would
those
would
be
makes
sense
for
like
an
edge
deployment
or
something
like
that.
B
Yeah
and
I'm
I'm
not
even
sure
that
the
the
database
backend
would
be
something
that
we
prioritize
and
support.
I
think
it's
more
of
a
proof
of
concept
to
get
the
interfaces
right.
F
Right:
okay,
okay,
yeah,
I
mean
that
makes
sense
as
a
development
milestone.
I
think
there
are
edge
scenarios
where
it
might
make
sense
too,
where
you
have,
you
know
a
larger
deployment
in
the
cloud
or
in
your
data
center
or
whatever,
but
then
you
have
a
bunch
of
edge
sites
that
are
producing
data
and
just
want
to
like
write
things
locally
initially
and
then
use
multi-site
to
sync
them
out.
F
I
wouldn't
discard
that
completely.
I
think
the
thing
that
I
wonder
about
that
scenario
that
once
that
you
start
thinking
about
what
it
would
look
like
in
production
is
there
it
seems
like
there
would
be
a
bunch
of
other
of
additional
other
steps
that
would
be
necessary
to
make
the
radio
skater
run
without
sort
of
a
sub
cluster.
F
That's
there
with
the
way
that
the
stuff
configs
are
managed
or
all
the
random
things
that
we
do
where
we
talk
to
the
monitor
rather
sort
of
runtime
state.
H
It
can
be
a
very
minimal
subclass
direct
signal
monitor
on
the
edge
and
if,
if
you're
already
there
and
put
just
a
single
osd
and
and
do
that,
instead
of
the
backend
of
the
database,
back-end
yeah
one
gap
that
the
database
back-end
has
at
the
moment
it's
got
a
problem
with
the
the
idea
of
edge
deployment.
Is
it
doesn't
support
multi-site
right
now?
H
F
F
B
D
One
question:
it's
faster
to
say
it
than
type
it
it's
dan
from
cern.
I
don't
know
if
you
mentioned,
because
I
joined
about
one
minute
late,
but
in
the
past
there
were
discussions
of
the
the
index
format.
The
buffet
the
binary
like
using
omap
for
indexes
might
change
in
future
release.
Is
that
planned
in
the
foreseeable
future
or
are
the
index
formats
stable?
B
We've
definitely
put
a
lot
of
thought
into
alternate
indexing
schemes
and
they
all
tend
to
be
very
complicated.
So
I
don't
expect
a
new
index
format
in
quincy
past
that
I'm
optimistic.
D
B
Well,
we
have
an
abstraction
for
resharding
that
can
track
different
types
of
indexes,
so
you
could
restart
from
a
bucket
index
in
omap
to
some
other
format.
F
B
H
F
Just
from
a
logistical
perspective,
that's
that's
sort
of
the
big
chunk
that
that
a
bunch
of
other
pull
requests
are
sort
of
waiting
on
right.
F
Is
there
a
is
there
sort
of
a
expected
timeline
there.
B
It's
still
hard
to
say:
we
just
recently
got
some
reshard
tests
to
pass
so
we're
making
really
good
progress,
but
it's
hard
to
know
all
of
the
bugs
that
we've
yet
to
uncover.
So
it
needs
a
lot
more
testing.
A
All
right:
well,
are
there
any
other
questions
before
we
move
along
to
our
birds
of
a
feather.
A
Okay,
let's
see
yes,
thank
you,
casey,
for
the
update
and
for
answering
all
of
our
questions.