►
Description
Presented by: chunsong feng]
Full schedule: https://pad.ceph.com/p/ceph-month-june-2021
A
A
A
We
also
do
some
io
performance
optimization
today.
We
as
we
will
introduce
this
first
is
there
access
across
new
mom
and
multiple
nikkei
deployment,
ddr
multi-channel
deployment
messages,
loader
slot
a
very
slow
long
queue
waiting
time
in
their
osd
and
use
the
34
page
size
opt
message
clc
32c
in
luxdbm.
A
A
A
A
A
Before
for
multiple
technique,
and
it
will,
for
example
it
if
they
need
install
in
numerous
yellow
and
as
it
will
have
multiple,
for
example,
it
have
four
board.
Now
we
can,
we
can
assign
exams
and.
A
A
A
This
will
be
much
balanced,
saying
the
system,
which
is
one
we
if
we
get
a
we
have
tested
as
this
deployment
with
before
with
this
opportunity,
optimized
as
a
4k
running
light,
is
7
lps
higher
and
as
the
latency
is
this
position.
A
Lower
ddi
channels,
we
will
have
support
added
channels
in
one
cpu,
two
cpu
either
it
can
separate.
A
A
A
A
Then
in
restore
kiwi
sync
thread,
it
will
swap
towards
the
qbq
to
kill
committing
queue
and
then
flash
it
to
wall
and
a
db
a
partition
or
disk
and
saying
it
will
out
of
the
flash
it
will
include.
You
can
recommend
to
finalize
queue
saying
you
know
in
pistol.
Okay,
we
finally
finalized
thread.
It
will
include
your
contact,
sql
and
saying
osd
slider
will
hunt
will
swap
was
the
context
queue
to
oncoming
school.
A
We
have,
we
have
collected
the
latest,
let's
say
they
say
opp
for
qrp.
Let's
say
it's
occupied
80
and
16
percent
of
all
the
our
latency,
and
it
is
this.
It
is
the
status
queue
let
us
see
from
here
to
before.
Sync
people
sing
it
about
it.
Take
a
occupant
building,
36
percent,
since,
let
me
see,
can
we
commit
a
little
same?
A
Since
2
latency
is
much
higher
since
this
one
I
have
and
collected
the
coolant
cool
length
of
foundation.
The
average
length
of
obq
lens
is
about
a
10,
it's
less
than
time,
and
so
the
queue
time
is
short.
It's
a
length
of
kvq
and
the
current
committing
is
about
it's
greater
than
scheme.
A
The
energy
cool
latency
is
long
because
the
op-q
is
processed
by
by
multi-sliding
and
the
kbq,
and
the
communicable
kv
committed
to
finance
skill
is
processed
in
one
thread.
A
A
A
We
use
the
default
usb
precise
to
align
and
after
we
use
a
small
size,
a
line.
This
is
a
4k.
This
is
a
64k,
and
so
is
it.
It
can
reduce
memory
waste
and,
secondly,
we
found
that
in
memory
form
it
users
use
the
user
troll,
which
is
for
4k
practices,
pcc
shift,
we
modify
it,
they
use
the
same
page
shifter
for
com
compatible
with
wireless
page
size.
A
A
Okay,
now
we
have
same
magnification
factor
is
two
points:
x4
2.464
4k
pages
and
four
five,
four
five
point:
four,
six:
four
sixty
four
pages
we
have
tested
10
blue
fs
power.
L
to
force
the
metadata
is
rather
using
directional
single
file
range
is
not
a
coordinate.
A
A
The
memory
keeps
increased
until
it
approach
was
the
memory
target
and
the
performance
details
significating
to
ensure
that
the
pesticides
of
which
is
america
is
great.
Greater
than
kernel,
precise,.
A
A
A
B
C
It
wasn't
clear
to
me
why
that's
such
a
noisy
line,
I
mean
whether
there's
some
architectural
thing,
why
say
200
is
better
than
400
or
whether
it's
just
that
you've
got
a
lot
of
noise
in
your
benchmarking
or
if
you
could
yeah.
Just
the
shape
of
that
graph
was
quite
confusing
to
me,
and
I
was
wondering
if
you
had
an
explanation
for
it.
A
Hi
cohen.
D
D
We
have
taken
from
the
experimental
environment,
yeah.
E
I
have
a
related
question
about
this
graph.
Was
there
a
a
big
q
depth
when,
when
this
test
was
done,
like
did
you
have
to
have
a
very
deep
cue
before
you
would
see
this
improvement,
or
did
you
also
see
an
improvement
iops
with
a
shallow
q.
D
So
hi
yeah,
so
you
mean
the
testing
environment
or
yeah.
I
mean
the
testing
cluster
yeah.
E
D
What
what
your
tesla
is
so
a
hydrants
also,
could
you
a
sure
actually,
what
is
the
scale
of
the
cluster?
We
will?
We
have
site
to
to
get
this
data.
D
Okay,
so
hi
so
high
surgery,
so
we
have
actually
three
node
of
osd.
So
each
of
the
osd
node
we've
got
20
osgs
so
totally
for
16.,
yeah.
Okay,
sorry
totally
for
6k
yeah.
Sorry.
E
Okay,
that
makes
sense
first
of
all,
just
a
thank
you
for
this
talk.
This
is
full
of
really
good
information.
I
really
enjoyed
it.
I
have.
I
have
some
other
questions
too.
If
you
have
a
psych
on
the
the
the
first
section
where
you
talked
about
the
pneuma
affinity,
there
was
some
code
we
added
in
an
octopus,
maybe
that
tries
to
automatically
pin
osds
to
a
node
when
the
nic
and
the
and
the
network
adapter
the
nick
and
the
nvme
are
on
the
same
pneuma
node.
E
E
D
Using
the
version
beyond
beyond
staff
14,
so
that
can
support
to
have
the
pneuma
affinity
automatically
yeah.
So
we
don't
need
the
actually
the
minor
side.
E
That's
excellent.
Okay!
That's
good
news.
Glad
to
hear
that
the
the
the
cpu
partitioning
that
you
mentioned
is
a
new
concept
to
me.
I
didn't
realize
I
didn't
realize
you
could
do
that.
I
would
be
curious
to
see
details
of
how
you
how
you
set
that
up.
I
don't
know
if
you
could
send
us
a
follow-up
email
or
maybe
this
is
all
common
knowledge.
I
didn't
actually
realize
them.
E
E
Do
you
have
any
sense
of
whether
that's
that's
something
that
that
we
can
automatically
do
or
did
you
find?
Is
this
something
that
you
think
has
to
be
sort
of
manually
set
up
in
order
to
get
this
type
of.
E
Next
one
this
one:
yes,
this
one
here,
I'm
wondering
if
this
is
something
that
you
think
we
can
make
the
osd
automatically
do
on
its
own.
So
it
separates
those
two
thread
pools
or
is
it
something
that
is
has
to
sort
of
be
manually
set
up.
E
Like
ideally,
we
would
want
this
to
just
the
osu
to
just
do
this
on
its
own,
so
it
can
maximize
the
performance.
Okay,
do
you
think
that's
feasible,
and
if
what
do
you
think
what
would
be
necessary?
Do
you
think
to
make
that
happen?.
D
F
I
think
the
question
is
just
like:
how
can
we
partition
these
threads
and
assign
them
to
different
course?
Is
this
feasible?
Okay?
Yes,
can
we
do
this
problem,
programmatically,
writing
security
and
let
it
do
it
on
behalf
of
ourselves,
so
you
can
can
improve
your
preference
without
the
people
in
the
intervention.
A
The
surf
there
still
curve
and
this
one
is
another
acidity
like
a
mystic
misca
chord.
This
one
is
the
thera
chord.
This
one
is
musical,
and
so
we
can't
configure
anything
surf
conf.
A
A
I
added
two
to
configure
once
there
are
air
cores,
it
is
modifiers
threaded
affinity,
our
messages,
work
and
a
tv
http,
because
the
words
is
threaded
is
malicious
and
has
a
controls.
The
data
flow
can
use
the
data
flow
and,
as
I
said,
blue
store
kv,
fellow
okay,
final,
and
I
will
restore
all
thread
and
to
the
mystical,
because
there
are
only
we
have
some
some
workout,
maybe
signal
you
will
use
this
to
separate,
say
two
threader.
So
it's
a
two
area.
E
Let's
see
okay,
yeah
yeah,
I
wonder
if
this
is
something
that
we
could
teach
that
fadium
to
do.
Is
it
has
sort
of
the
whole
node
view
like
if
we,
if
we
know
that
a
certain
number
of
cores
are
dedicated
to
osds,
then
it
could.
C
E
Anyway,
yeah,
that's
that's
awesome.
A
couple,
a
couple,
other
quick
questions.
There's
a
there's
a
page
size.
There
were
two
code
changes
you
had
for
the
the
page
sizes.
Have
you
have
you
submitted
patches
or
pull
requests
for
those
upstream.
F
E
Because
they've
looked
at
even
monster,
oh
they're
already
in
master
excellent,
okay,
awesome,
great
news,
and
then
there
was
the
the
point
you
mentioned
about
right
amplification.
If
my,
if
I
remember
correctly,
I
think
the
most
recent
change
there
was
that
we're
gonna,
our
blue
store
metadata
is
writing
using
direct.
I
o,
which
means
it
shouldn't,
have
any
right
amp
in
this
case,
but
the
reads
are
still
using:
buffered
io,
I
don't
know,
keep
it.
E
E
E
Yeah,
I
think,
okay
and
then
the
last
thing
was
this:
that
tc
malik
and
kernel
page
size.
What
is
the
what's
the
tc
malik
option
that
you
were
tuning,
you
control
that
tc
malik
h,
size.
D
Actually
is
also
is
referring
to
what
is
a
choosing
optimizations.
We
have
done.
A
F
E
E
Yeah
or
yeah
I
mean
it's,
not
the
disturber
version
right,
because
it's
it
if
you
reboot
it's
like
a
kernel
option,
isn't
it
right?
So
in
the
in
like
the
startup
script,
it
would
have
to
like
look
at
the
current
running
page
size
and
then
ld
preload,
the
right
compiled
variant
of
tc
malik,
based
on
that
when
starting
usd
yeah.
F
F
E
At
the
time,
okay,
okay,.
E
Okay-
and
I
guess
the
the
last
thing
I
had
was
just
a
quick
question
about
the
the
rocks
tv
drc-
backport
that
you
did
is
there
a
maybe
we
should
include
that
patch
in
the
the
the
version
of
rocks
tv
that
we're
building
with
or
possibly
we
should
just
fast
forward
like
for
quincy.
We
should
probably
just
fast
forward
to
whatever
the
latest
rocks
to
be
released
that
hopefully
includes
that
patch.
F
B
All
right:
well,
we
got
a
little
bit
of
a
gap
in
our
schedule,
so
we
have
about
19
minutes
before
the
next
presentation,
with
anthony
on
intel
flash
base.
So,
but
I
wanted
to
thank
chonson
for
taking
the
time
to
present
to
us
as
well
as
kevin
for
helping
with
the
translations
as
well
with
the
questions.
So
thank
you.
Everyone
for
your
time
appreciate
it.