►
From YouTube: Cassandra Day Atlanta 2015: Recording the Web: High-fidelity Storage and Playback with Cassandra
Description
Speaker: Ian Rose, Software Engineer at FullStory
Company: FullStory
FullStory is like a DVR for the web. An in-page script captures everything that happens during a user's online session, with the ability to play it back later with high-fidelity. In this talk I'll describe how we use Cassandra, starting with the initial requirements and our resulting schema design (including both good and bad choices there), and following up with some highlights of our experience as well as future plans.
A
I
guess
because
we
started
middle
/,
stove
now
stop
nobody
coming.
This
is
quite
audience.
I
give
this
talk.
1
support
our
local
lan
and
media
groups
is
out
of
order
magnitude
step
up
yesterday.
My
name
is
eros
viral
story
and
I'm
going
to
tell
you
about
charging
produced
case
continued
dollar
startup,
what
we
do
and
how
I
listed
all
that.
A
So
what
is
simple
story?
This
will
briefly
words.
Lana
may
start
off
west
side
by
George
tactical.
Just
because
of
you
were
fearful
time.
We
were
much
before
I
metamours
used
to
working
the
town,
bus
collision
of
engineering
there,
because
I've
ever
do
around
a.
We
are
just
about
nine
months
ago
last
summer
and
it's
a
service
for
deep
understanding
of
users.
I'll
say
that
means
this
compass
army.
So
we
take
what
we
have
to
call
capture
everything
approach
to
understanding
community
reflectance,
basically
kind
of
a
nutshell:
is
you
get
a
little
jealous
or
status?
A
Think
google
analytics
/
familiar
with
that
and
you
throw
that
on
your
page
and
that's
all
for
installation
there's
no
like
individual
wiring
events
like
comments
or
on
a
stronger
tortoise.
Just
put
one
script:
you're
done
and
we
capture
everyone
options.
So
the
initial
dominar,
conveying
all
the
changes
that
happen,
all
user
events
clicks,
south
movements,
etc.
All
the
CSS
images
here,
so
we
clatters
all
to
be
very,
very
high.
Fidelity,
recreate
user
experience
with
so
I'm
going
to
try
again
with
this.
A
A
Alright,
so
let
me
talk
a
little
time
to
play,
know
how
this
works.
I
think
mid
terms
of
relief
headline.
So
when
there
are
users
on
your
site
in
your
site
is
full
story
a
when
you've
got
a
script
error,
then
you
have
clients
and
our
computers
through
using
web
browser,
go
to
your
site.
They
were
running,
our
dough
starts
sniffing
and
what
that's
doing
is
tracking
all
the
nama
bets
in
a
router
and
it's
going
to
stream
them
like
every
couple
seconds
into
sending
info.
A
But
here
all
the
events
of
half
of
the
past
classic
controller
by
intervals.
Excuse
me
has
driven
them
to
our
friends,
he's
not
mention
comedian,
scalable,
etc.
So
those
are
receiving
all
these
Dom
events
from
all
the
users
across
all
of
our
customers,
those
bundles
we
want
to
be
the
next
summer.
Its
owners
are
primary.
Like
first
tier
friend,
Colleen
I
storage,
we
use
compute
engine
which,
for
those
of
you
around,
it's
basically
Google's
version
pc
to
sure
they
were
partly
for
lots
of
benefits,
but
leave
it
at
that.
A
So
it's
just
vm
to
the
cloud
we're
on
our
progress
there.
So
we
need
you,
everything
had
been
in
the
compute
engine
s
reels
in
this
great
quarter.
There's
this
little
dance
to
have
to
do
because
absolutely
is
a
platform
as
a
service,
and
so
it's
has
limited
ability
to
do
not
be
seeking
days.
So
we
can
set
us
a
little
abaca
students
into
else.
It's
the
desk
relative,
so
there
the
inscription
the
clients
does.
This
line
is
open
dishonor.
A
On
the
other
end,
we
then
to
do
something
with
it
once
it's
in
photography,
we
leave
it
there,
so
we're
sort
of
forgiving
page,
as
user
does
stuff
for
streaming
about
to
secure
some
here,
some,
your
sound
and
they're
all
filling
up,
etc,
but
that
isn't
done.
We're
done
is
defined
as
either
begin
to
look
that
doesn't
mean
that
user
clicked
a
link
or
going
to
a
new
page
that
be
another
more
does
an
idle
timeout.
This
is
kind
of
a
Google
Analytics
concept.
They
felt
long
time
we
go.
A
Basically
these
are
something
anything
for
30
minutes,
they're
all
gone.
Maybe
they
close
their
browser
or
just
wandered
away
where
coal
smells
is
all
pitched
up.
Dare
we
say
this
binge
is
finished,
it's
time
to
process
it,
so
we've
got
on
this
stream
of
bubbles
and
we
have
some
magnet
servers
and
we
call
Cougars
because
we
call
our
process
cooking
they're,
going
to
read
out
the
invention,
Cassandra.
A
Let
me
ask
you
to
leave
it's
only
a
temporary
storage
pools
and
then
once
they're
done
with
that
processing
they're
going
to
make
some
essentially
the
processing
artifacts,
which
are
just
a
couple
JSON
files.
What
I
call
the
interesting
stuff
having
an
aggregate
of
all
is
about
those
immigrants
of
the
muscle
rigid,
which
is
sort
of
Google's
version.
S3
class,
Georgia
club
and
that's
are
in
storage
for
all
points.
So
summer
is
a
little
bit
interesting
role
for
us,
because
it's
just
like
a
rolling
storage.
A
It's
not
her
misfortune
right
now,
so
we
need
to
actually
go
through
a
few
different
things
for
that
that
portion
of
the
diagram
that
was
to
Sondra,
we
tried
cause
single
for
a
while,
just
because
we
like
that
using
the
US
managing
things
we
have
a
data
store,
which
is
that
cloudy
night
or
very
early
on
the
dust
very
expensive
cell
I
stage
will
try
to
be
able
things.
We
finally
ended
up
I'm.
Sorry,
the
requirements
did
everything
else
didn't
fulfill
and
that
we
are
now
hiding
fulfill
here.
Is
it's
gotta
be
reliable
either?
A
This
is
our
kind
of
fun
and
storage,
other
things
can
go
down
and
things
wasn't
de
locas
honor.
We
can
deal
with
it
later.
It's
ours
now
we're
not
working
that's
for
business,
so
that's
pretty
important
and
users.
You
know
what
a
weaver
caves
are
they
close
their
browser
like
that's
it
there's
no
importance
of
that.
So
we
kind
of
got
into
halftime.
Life
is
gone
and
I
miss
early
on.
We
really
we
take
the
counselor
everything
approach.
Can
we
apply
double
to
single
sessions?
A
We
try
to
capture
everything
we
do,
but
we
also
apply
that
electrical
company.
We
don't
have
a
really
low
special
moments
together.
Some
companies
will
be
like
50
sessions
are
always
one
that
we
want
to
do
everything
you
want
every
session,
every
user
that
you
do
just
feel
like
that's
the
way
we
can.
You
know
if
you're
doing
mo
tracking,
if
you're
doing
support
or
something
so
that's
it
the
long
tail
you
will
to
care
about.
A
A
You're
not
watching
sessions
about
us
is
too
many
you're
going
to
use
that
search
that
I
should
or
find
the
most
interesting
ones,
you're
going
to
watch
those
a
small
fashion,
and
you
might
need
some
summary
statistics,
your
the
rest,
but
it
means
that
we
don't
need
your
read
these
over
again.
So,
just
another
nice
benefit
I'll
talk
really
quickly
about
soon,
as
we
can
simple,
so
we
have
a
seven
levels,
which
is
what
we're
inserting
into
these
first
four
values
are
all
just
navigation
to
find
a
tease,
sound
urgent
care.
A
A
A
Music,
see
this
number
just
what
more
we
talking
about
here
and
there's
a
big
people
often
Thomas,
and
these
can
vary
greatly
in
size.
So
it
is
you
be
amazed.
The
horrible
things
be
poured
into
the
covers,
so
we
get
some
created
that
dog,
awful
sometimes
I
was
well.
What
got
to
the
bean
case
we
run
into
that?
This
approach
is
because
we're
putting
on
the
Kings
local
hinges
can
be
really
wrong
and
you're
going
to
scale
you
kind
of
see
the
long
tail
appear
things
if
it
did
go.
A
Where
do
you
know
a
single
page
app
spit
some
his
mishap?
He
will
spend
powers
of
what
page,
which
things
do
huge
number
of
models
agent.
For
the
moment
when
deities
we
kind
of
architectural
eggs
don't
break
that
up,
but
now
it
means
that
if
we
network
Costas,
the
page
twice
which
he
comes
up,
I
can
get
kind
of
cramming
bubbles
with
it.
Then
we've
already
worked
out
all
the
bubbles
for
that
names,
and
so
we
need
this
huge
amount
of
tombstones
I've.
A
Seen
worse
than
this,
you
know
online,
but
this
kind
of
so-so
throws
only
activated
working
about
something
the
integration
sister
current
Network
place.
I
was
totally
small,
but
you
know
we're
going
fast
and
they
say
we
have
a
three
node
network.
With
paramedics
keys,
he
is
network
attached,
storage.
Every
will
see
me
not
believe
that
Oh
us
too,
but
you
know
we're
only
like
four
engineers.
A
We
try
and
do
whatever
we
can
to
make
things
quick
and
easy,
and
this
is
work
for
us-
we're
probably
a
little
bit
over
provision
because
of
that
we
use
local
disk,
get
away
with
your
options,
but
people
tell
and
I
guess
the
only
thing
open,
apps
that
were
growing
it
TTL
models,
which
is
just
another
nice
advantage
of
using
cassandra.
Is
it
because
this
is
a
rolling
event
score?
Nothing
should
live
in
artists
on
you
know,
you're
feeling
of
the
page
and
you're
swerving
housings.
A
The
pages
done
pages
should
be
done,
pretty
quick,
no
minutes.
The
most
powers
may
be
so
something
sitting
around
there
couple
weeks,
and
it
means
that
you
know
we
lost
a
task
or
something
through
an
error
or
whatever.
So
just
don't
win
up
from
building
up
it,
sort
of
just
a
nice
one,
liner
I
just
throw
one
more
TTL
on
everything
in
that
job.
Just
go
away
and
I
did
some
quick
math
yesterday
was
I
forgetting
about
how
terrified
I
data
each
day.
A
A
The
first
one
is
just
managing
repairs.
I
honestly,
just
didn't
give
this
enough
attention
in
the
beginning.
Now
it's
only
if
your
notes
are
now
a
lot
whatever
I'm
a
great
house
person
doesn't
ago,
and
they
were
mostly
up,
but
still
normal
small.
You
need
her
for
so
I
did
it
mainly
for
a
while
and
because
I
wasn't
consistent
enough.
They
would
be
really
long
and
it
was
bad
what
their
date
of
Jubilee
insensitive,
because
we
she
was
numbers
so
rocket
out.
A
Gator
was
erecting,
you
know
upstream,
will
attend
duplicates,
but
nonetheless,
so
now
I
was
kind
of
come
together,
a
cron
job
that
does
all
repairs
by
calling
jmax
it
just
sort
of
interested
in
us
and
that's
giving
us
by
for
a
while
I,
really
like
too
afraid
to
be
her
mentor
bears
with
your
AC
people
of
experience
like
that
with
their
you
know
how
that's
working
to
them,
we're
not
21.
So
that's
our
locker,
so
it's
kind
of
it's
all
back,
don't
read
one
see
you
again.
A
We
try
to
try
and
pride
ourselves
on
being
scrappy
and
you
know
not
waste
lying
I
love
these
get
too
close
to
to
rely
for
a
while.
I
was
exhibiting
here
that
cpus
mice
can
happen
for
a
lot
of
different
reasons.
You
know
compassionate
starts
up.
You
know
maybe
elsevier
people
here
that
runs
up
when
you
started
over
air
15
/
early
days
when
I
was
not
being
as
consistent
with
Michael
Harris
turn
into
the
suit
GC.
You
know.
A
Maybe
a
motivated
has
to
go
brought
overall,
much
SS
tables
to
get
your
main
status
body
good.
It
was
as
well
things
and
I
was
willing
are
new
to
sit
near
that
percent.
You
know,
59
you
put
too
much,
then
we
start
prepare
something
in
the
kind
and
cicero
this
cascading
effect,
so
that
wasn't
penalized
about
a
bullish.
My
questions,
you
know,
go
ahead
and
invest
get
reserves
of
thinking.
A
Ditto
on
the
disk,
I
didn't
same
thing,
especially
once
we
changed
SSDs.
Oh
my
god.
You
know
because
again
we're
not
growing
our
makeup.
Only
is
our
source
grow
but
remember
to
purchase
rolling
over
our
data.
So
how
do
we
make
this
actually
work,
something
as
a
result
right?
Let's
talk
of
the
festival
days,
I
would
notice
the
mess
of
one
of
our
nodes.
With
something
happen.
We
were
disability,
others,
we
advise
you
several
hundred
gigabytes
for
the
other.
Two
also
holds
a
deal.
You
know
I'm
using
replication
factor
three
on
three
nodes.
A
They
should
all
be
about
the
same
mr.
most
innovative,
and
I
find
it
there,
we'll
snapshots
in
the
data
directory
I
wasn't
taking
a
snapshot.
I
think
in
retrospect
was
actually
happening
was
written
some
specimen
of
deus
ex
engineers,
how
the
when
Compassion's
happened,
bill
Hormann
linked
to
the
SS
tables
as
they're
being
compacted,
and
so
you
can
in
the
end
of
instead
confuse
me.
A
So
I
think
I
was
seeing
compassionate
as
they
were
when
it
and
they
were,
are
looking
at
SS
tables
that
we're
going
to
be
removed
later
so
anyway,
I
found
this
an
Accounts
quote
on
SlideShare
how,
depending
on
the
compassion
strategy,
you're
using
particular
abusing
sized
here,
it
ain't
looked
at
like
happen.
It's
like
temporary
or
as
reviewing
compassion,
so
don't
lie
around
three
discs:
thelonious
bass
playing
on
this
basis,
super
bad.
A
Yet
so
that's
what
we're
right
now,
what
we
going
to
be
doing
in
the
future
first
off
in
the
near
term,
gain
tier
compassion.
So
we
were
gonna,
go
2094
by
law,
which
does
not
have
this
and
we
just
actually
this
meat
create
low
I'm
in
process
of
writing
in
20
13.
So
we
can
try
this.
You
know
our
data
is
very,
very
I'm
locked.
A
Also
new
piece
honor
cluster
on
this
Kavya
illnesses
currently
in
development,
so
my
name's
Rose
will
share
this
one,
but
you
know
we're
optimistic
I'll
briefly
outlined
in
these
case
here,
a
mission
early
on
that
we
get
that
processing
stage.
The
outlet
of
the
curves,
which
is
the
processing
of
the
stream,
is
set
json
documents
that
we
upload
to
Google
Cloud
storage
and
that
works
both
for
us.
A
We
like
that
because
cost
overages,
reliable
and
something
else
gets
paged
if
it
goes
down
and
all
those
nice
things
about
my
services,
it
also
is
with
us
because
then
gigabyte
month,
costs
like
actually
carrying
hostile
storage
is
cheaper
than
the
regular
disks.
What
we
will
cause
persistent
disk
or
the
plastic
box
storage-
it's
actually
cheaper
over
time.
So
that's
why
we
don't
just
like
throw
these
files
on
disk
or
gonna
kiss
our
customers
will
no
doubt
be
like
to
see
us.
A
The
problem
is
not
only
pay
for
the
storage
costs,
but
we
pay
for
operations
of
each
request
and
number
of
uploads
is
kind
of
telling
us
financially,
because
for
every
day,
three
or
four
we're
doing
two
uploads.
It
is
kind
of
do
deny
that
that's
actually
a
lot
of
loads,
so
we
want
to
rest
that
just
to
kind
of
get
our
costs
down.
So
here
is
the
current
plan
to
make
number
on
GCS
bus.
The
idea
is
lissa
boles,
nothing
groundbreaking
here.
A
A
So
exactly
although
math
so
this
was
there
was
a
lot
when
I
can
see.
If
not
can
set
an
issue.
This,
though,
is
it
what
happens
now
we
want.
We
have
five
so
I,
given
my
good
silly
little
diagram
or
flunkies
mock
Squatch
would
look
like
there's
a
little
header
of
those
pants
and
check,
sounds
and
versions,
and
things
like
that
and
then
there's
just
a
bunch
of
walks.
A
A
Gcs
supports
the
range
header
HTTP
calls,
so
I
can
just
request
back
slice
bottles,
which
is
where
a
comedian,
what
I
have
to
know
what
that
slices
where's
the
start
was
it
so
with
some
new
place
to
store
these
so
that
meet
our
newest
honor
cluster,
so
it
basically
just
be
keys
or
the
original
file
that
you
want
to
upload
a
cloud
to
op,
so
that
eight
would
be
the
key
and
your
I
you
it
calms,
you
can
get
back
to
be.
Where
is
the
name
of
the
mux
file?