►
From YouTube: The Apache Cassandra® Corner podcast w/ Mary Grygleski
Description
For this episode, I sat down with Mary Grygleski of DataStax. Mary is a Developer Advocate on the Astra Streaming team, evangelizing for and helping folks with Apache Pulsar®. She has a long history as both an advocate and an engineer in tech, organizes the Chicago Java Users' Group (CJUG), and has been recognized as a Java Champion. Listen in and hear how we discussed Java, Apache Cassandra®, Pulsar, and Mary's talk for the upcoming Cassandra Summit.
A
B
A
And
I
I
know
that
I've
ended
up
on
on
your
Twitch
stream,
at
least
once
yeah
yeah.
B
Yes,
yeah:
we
we
have
to
do
more,
yeah
yeah
for
sure
for
sure
yeah
these
days,
though
I'm
hoping
that
our
audience
could
be,
you
know
kind
of.
Have
the
passion
should
return.
You
know,
after
all,
the
pandemic
thing,
but.
B
It
a
little
slow
folks,
a
little
slow,
but
please
come
back
so
yeah,
so
yeah
yeah,
yeah
I
will
be
doing
more
yeah
of
my
stream
as
well.
So.
B
So
yes,
I
am
Mary
quickleski!
Thank
you!
Aaron
and
I'm,
a
developer
Advocate
with
the
streaming
team
at
data
stacks
and
so
I
the
area
that
I
primarily
advocating
for
it
would
be
the
event
streaming
platform
Astra
streaming
data
stacks
and
then
it's
powered
by
Apache
Pulsar.
So
a
lot
of
you
know
all
of
our
things
that
data
Stacks
are
really
open,
source
powered
so
so
essentially
yeah
Apache,
Pulsar,
so
event,
streaming
kind
of
more
catered,
For,
The,
Cloud
native
platform.
B
So
that's
what
I'm
advocating
for
and
then
in
my
spare
time
or
you
know
off
hours,
I
would
be
running
community,
so
I'm
also
the
Chicago
Java
users
group
president
yeah
and
then
I'm
also
Java,
Champion
too
so
yeah
and
I'm
interested
yeah
and
I'm
interested,
not
just
in
Java,
but
everything
else
and
now
nowadays,
even
like
getting
into
AI
like
our
company,
is
doing
and
things
like
that,
and
not
just
that.
Just
a
lot
of
variety
of
different
things.
B
I
think
I'm,
just
very
interested
in
So
yeah,
so
and-
and
it's
just
some
background
2-
is
that
I
was
an
engineer
before
becoming
an
advocate
at
IBM.
That's
where
I
started
being
you
know
started
off
my
efficacy
career
in
2018,
so
it's
been
five
years,
I've
been
doing
efficacy
back
then
I
was
doing
more
Java,
focused
reactive
technology
and
now
here
doing
event
streaming
yeah
and
so
I
was
an
engineer.
B
Doing
you
know
different
products
too,
and
in
the
90s
I
was
actually
doing
Unix
and
c
and
then
in
2000,
switching
doing
Java,
Java,
open
source
and
and
then
you
know,
from
products
to
engineering
applications
and
then
now
back
to
doing
products
and
efficacy
so
yeah.
So
that's
kind
of
a
nutshell
about
me
so
yeah,
oh.
A
That's
that's
awesome,
yeah,
and
the
thing
is
too
is
that
you
know,
of
course,
Cassandra
is
written
in
Java,
so
there's
there's
actually
of
you
know.
If
you
look
at
the
Venn
diagram
of
like
Java
people
and
Cassandra
people,
I
mean
it's.
A
A
Certainly
are
a
Lacross
over
there.
I
guess
is,
is
where
I'm
going
with
that
yeah
and
actually
that
was
kind
of
where
you
know
I
was
a
Java
developer.
You
know
it
really
wasn't
my
strong
suit
15
years
ago,
when
I,
when
I
first
started
doing
it,
but
it's
like
that's.
That
was
the
same
time
that
I
got
into
Cassandra
and
you
know
just
again
learning
so
much
about
the
the
Java
ecosystem.
A
Just
because
you
know
I
I
had
to
support
Cassandra
in
that
though
it
was,
it
was
it
was,
it
was
tricky
and
you
know
just
you
know,
yeah
kind
of
like
you,
you
know
I
I,
really
kind
of
kind
of
fell
in
love
with
Java.
You
know
if
you
want
to
think
about
it.
That
way.
That's.
B
A
B
About
oh,
go
away,
but
no
it's
it's
a
very
you
know
a
long
proven
system.
It's
it's
you
know
is
all
these
things.
That
is
so
much
like
open
source
options
to
it
too,
and
the
community
too,
is
amazing
right,
so
big
yeah
yeah,
even
the
whole
Apache
software
Foundation,
you
know
most
of
the
software
is
written
in
Java
as
well,
and
the
world
is
pretty
much
all
the
back.
End
systems
are
still
powered
by
Java,
there's
just
very
robust
yeah.
It's
proven
so
yeah
yeah,
yeah,
yeah,
yeah,
I.
A
B
Yeah
good
question
I've
been
asked
to
a
few
times
and
yeah,
so
it
is
a
bit
of
an
interesting
process
in
the
sense
that
you
know
it
was
first
started
back
when
Java
software
is
still
there
and
a
few
people
who
started
the
program
back
with
sun
Microsystems,
and
then
they
were
just
you
know
always
like
by
nomination,
and
it's
like
internal
nomination
too,
but
then
over
time
you
know
then
Oracle
took
over
the
program
and
everything,
and
so
so
these
days
too
I
look
into
the
program.
B
Is
that
with
this
program,
unlike
you
know,
other
programs,
like
maybe
Google
I,
think
there's
developer,
expert,
GTE,
right
and
also
MVP
Microsoft
and.
B
Life
yeah,
but
I
think
I
feel.
The
key
is
that
your
involvement
with
the
community,
especially
the
Java
community
and
through
Java
users
group,
will
be
very
helpful
because
that's
that
seems
to
be.
You
know
the
Java
Champions
do
most
of
the
time.
These
are
folks
that
are
from
running
the
the
drugs.
B
To
kind
of
know
you
and
then
you
are
active
and
not
just
like
okay
I'm,
just
you
know
by
name
I'm,
a
drug
leader.
No
that
won't
work.
You
actually
have
to
be
involved
right
and
then
also,
if
you
speak
at
conferences
and
things
like
that
on
meetups
right,
not
just
conferences.
A
B
B
A
B
A
community
is
nice,
but
I
think
the
key
again.
You
know,
of
course
you
know.
Ideally
if
you're
programming
you're
a
technical
person
would
be
even
better,
but
the
thing
is
I'm
finding.
Also,
interestingly,
there
are
some
folks
that,
if
you're
more
like
in
other
aspects
of
you
know
working
in
the
computer
industry,
but
you
kind
of
take
an
interest
or
you
know
very
active
in
doing
promotion
of
java
you'll
be
recognized,
you're,
helping
to
run
Java
users,
group
meetups
and
things
like
that,
and
maybe
you
are
less
of
a
Hands-On
technical.
B
There's
a
variety
of
different
ways,
but
it
is
true.
You
definitely
have
to
make
yourself
known
to
the
community
and
and
be
active
too
I.
Think
that's
that's.
What
I'm
finding
yeah
is
it's
kind
of
very,
very,
very
crucial
in
this
case.
Yeah
I
found
it
so
and
also
everything
is
internally
nominated.
So
if
you
know
someone
you've
done
enough
work,
the
person
will
be
confident
they
can
nominate
you
to
kind
of
like
that.
So
yeah,
okay,
okay,
oh,
it
works,
yeah,
yeah,
good.
A
Yeah-
and
you
know,
I
completely
agree
with
what
she
said
there
about
the
Java
Community,
where
you
know
I
hadn't
actually
done
a
whole
lot
with
the
Java
Community
before
I
met.
You
and
you
know,
I
came
down
for
that
that
conference
and
in
Chicago
last
year
and
and
everyone
was
just
so
welcoming.
A
B
So
yeah
and
yeah
welcome
to
the
community
yeah
Aaron
we'd
love
to
see
you.
Maybe
you
can
restart
I'm
hinting
now
restart
the
Minnesota.
B
B
B
A
A
Actually,
you
know
I
I,
think
I
have
something
that
that
would
be
good
to
talk
about.
That
is
both
Cassandra
and
Java,
that
yeah
yeah.
No,
it's
it's
an
example.
I
built
out
I.
Think
I
sent
this
to
you
at
the
get
repo
I
did
on
building
out
the
new
Vector
search
feature
and
like,
like
a
spring
boot
service
that
that
runs
and
kind
of
serves
that
out.
You
know
and
and
helps,
helps
you
do
like
an
a
n.
A
You
know
an
approximate
nearest
neighbor
on
you
know
something
that's
been
like
tokenized
into
a
vector,
but
great
yeah
I
was
thinking
about
something
like
that.
You
know
Vector
surge
AI,
that's
kind
of
the
hot
thing
right
now.
You
know
yeah.
B
A
Mary,
let
me
ask
you
a
question:
write
your
wheelhouse
here.
You
know
since
you're
a
an
advocate
for
the
for
the
streaming
team.
You
know
so
what
what
goes
into
choosing
an
event
streaming
platform.
B
B
You
know
in
terms
of
data
in
motion
right
and
you
have
always
have
a
point
of
ingestion
so
and
then,
if
you
want
data
to
be
ingested
at
a
high
speed
and
also
you
know
again,
you
know
the
messages
won't
be
lost
and
kind
of
guaranteed
delivery
of
these
things,
and
also,
in
addition
to
are
you
going
to
operate
in
the
cloud,
especially
in
different
clusters
right
so.
A
B
Kind
of
like
the
new
things
we
need
to
also
worry
about,
so
so
I
think
you
know
if
you
know
that
you
are
ingesting
data
and
you
need
it
at
high
speed
and
they
especially
too,
if
they,
you
are
ingesting
data
from
multiple
sources
to
I,
think
that
would
be
it
a
good
time,
good
cases
that
you
want
to
like
ingest
data
and
using
an
event
streaming
platform
form,
and
actually
you
know
talking
about
which
two
you
reminded
me,
you
know
if
I,
you
know
a
lot
of
times.
B
Do
people
also
ask
you
know
what
type
of
usage
scenario
and
at
and
I'm
I
think
I
mentioned
to
you
before
that
we
recorded
started.
This
talk,
I
I'm
talking
with
a
CO
who's
based
in
Netherlands.
B
A
B
B
B
Around
the
hillside,
so
you
can
kind
of
think
of
you
know
kind
of
using
a
data
event
streaming
ingestion
platform
such
as
that,
because
your
iot
devices
are
collecting
all
the
data
they
keep
sending
to
the
server
and
then,
basically,
you
have
some
point
of
sending
it
back
into
your
app
on
the
server
and
processing
this
data,
maybe
that
that
would
be
a
good
good
use
case
scenario.
Thinking.
A
That's
a
cool
idea,
though
yeah
yeah
and
I
never
would
have,
would
have
thought
about
yeah
using
it,
for
you
know,
for
you
using
an
iot
device.
You
know
spread
out
across
a
pasture
like
that,
but
yeah,
that's
that's
cool
yeah,
so
yeah
and
then,
of
course
you
have
all
that
data
from
the
iot
streams
coming
in
and
you're.
You
know
that's
where
you
want
the
yeah,
the
guaranteed
delivery
and
the
the
exact
order
that.
B
Can
kind
of
see
that
yeah
yeah
exactly
exactly
so
yeah
yeah.
So
something
is
it's
like
an
example
where
you
can
think
of
using
Advanced
streaming
and
yeah
yeah
so
and
I'm
also
aware
too
right
fraud,
detection
system
too.
You.
B
Monitoring
any
kind
of
suspicions
activities
you
know,
based
on
the
data
you
need
to
analyze,
so
the
ingestion
part
too,
is
very
important
because
there
are
any
given
point
in
time.
You
have
so
many
different
activities
going
on
so
yeah.
You
collect
all
those
inputs,
yeah
and
through
an
event
streaming
platform
that
that
yeah.
That
would
be
also
good
use
cases
for,
for
that.
B
Yeah
and
of
course,
also
recommendation
system
right:
there,
people
doing
shopping
and
you're
collecting
all
their
clicks.
You
know
keystrokes
and,
and
then
you're
analyzing
and
sending
them
down
and
machine
learning.
You
know,
algorithms
to
analyze
their,
you
know
their
patterns
and
then
figure
figuring
out.
You
know
giving
out
recommendations
as
they
are
shopping
too
so
yeah,
so
those
you
can
think
of
event
streaming
to
me.
B
I
feel
almost
seems,
like
you
know,
if
you
kind
of
explain
this,
you
know
to
my
grandma
who
doesn't
know
you
know,
does
isn't
technology,
you
know
focus
and
you
can
say
well,
it's
sort
of
like
infrastructure
in
your
house.
You
walk
into
a
house,
you
don't
really
see
it.
These
are
pipes
that
are
running
underneath.
You
know
that
Behind,
the
Walls
and
your.
B
They
are
so
essential
too.
They
are
like
you
know,
that's
where
the
pipes
are
kind
of.
You
know
when
you
turn
on
faucet.
It
sends
out
the
water
to
you
and
it
kind
of
drains
back
to
the
output.
You
know
whatever
exhaust,
not
exhausted
waste,
you
know
pipes
and
things
like
that.
So
I
think
in
some
ways
event
streaming.
It's
kind
of
it's
like
that.
You
know
in
the
if
I
have
to
put
in
analogy
that's
what
event
streaming
the
type
of
platform.
B
A
So
hey
Mary
at
the
the
end
of
the
year
at
the
Cassandra
Summit
you're,
going
to
be
giving
a
talk
there
yeah.
Why
don't
you
tell
us
a
little
bit
about
you
know
what
your
talk
is
and
and
what
you're
gonna
cover
sure.
B
Sure
so,
yes,
so
again,
right
I'm
doing
streaming
for
data
Tech,
so
we
have
Apache
Pulsar,
so
Apache
Pulsar
too
again,
you
know
event
streaming
for
ingestion,
so
here
it
is
I'm
thinking
right.
Data
Stacks
is
not
only
about
event
streaming
I
mean,
of
course,
Cassandra
is
the
main
thing
and-
and
you
know
so,
I'm
kind
of
been
thinking
of
you
know.
There's
got
to
be
ways
of
integrating
the
two
together
and
for
sure.
Yeah
again,
you
know
so
with
event
streaming.
B
My
talk
actually
is
really
to
explore
right
about
building
a
data
Pipeline
and
then
using
Apache
pulsar
and
of
course
too.
You
know
just
like
water
too,
when
you
flow
through
you
kind
of
water
goes
through
different
treatment
plans.
So
it's
like
data
coming
through.
You
want
to
do
some
transformation.
For
example,
you
you
can
have
data
that
is
flowing
through.
That
only
sends
the
message.
Each
one
of
them
carries
some
kind
of
IDs
right
along,
but.
B
Only
ID,
so
you
need
to
do
some
sort
of
lookup
too
to
your
tables.
So
that's
where
I
think,
let's
say
Cassandra
can
come
into
play
because
that's
where
your
your
Cassandra
is
storing.
Let's
say
it's
coming
in
user
ID.
You
need
to
use
look
up
more
information
about
the
user.
So
you
then
you
know
your
data
comes
in.
You
want
to
do.
Look
up
to
your
Cassandra
database,
based
on
the
user.
B
Id,
look
up
the
information
you
know
in
your
in
a
data
store,
so
that's
actually
what
I'm
working
on
is
is
actually
building
this
pipeline,
using
both
Apache,
pulsar
and
Cassandra,
and
basically
and
from
Poster's
perspective
I'm
using
Pulsar
functions,
and
these
are
lightweight
functions
that
you
can
plug
into
your
pipeline
and
that
will
do
message,
transformation
or
selection
filtering
whatever
you
need
to
do
so.
Insight
is
just
bite.
B
Size
function
that
you
can
I
can
do
query
against
a
Cassandra
and
and
search
for
the
information
I
need,
and
then
you
transfer
form
your
data
essentially,
and
then
you
send
them
off
to
your
Downstream
to
to
your
destination.
To
a
sync
like
that.
So
essentially,
my
talk
is
about
that
and
just
coming
up
with
this
demo
scenario,
to
show
to
demonstrate
how
we
can
integrate
the
technology
together
and
essentially
provide
solution
to
solve
some
business
problems
so
yeah.
So
that's
what
it
is
in
a
nutshell:
that's
what
my
talk
is
about
so
yeah.
Oh.
A
B
Oh
yes,
yes,
yes,
you
can
you.
Can
you
can
do
that
too?
So?
Yes,
so
essentially
data
gets
sent
to
the
topic.
So
from
the
topic
too,
that's
where
you
know
you're,
you,
you
get
your.
You
know
your
subscriber
you're,
getting
the
messages
so
you're
consuming
it.
So
from
the
consuming
side
to
you
can
basically
have
you
know
your
your
functions
can
be
called
to
in
in
there
right
and
oh.
A
B
Right
so
so
you
should
be
able
to
then
yeah
do
whatever
you
need
and
then
basically
from
there,
then
it
gets
sent
Downstream
to
whoever
destination.
You
need
to
go
to,
let's
say
you're,
building
a
set
of
pipelines
that
are
all
connected
together
and
that
that's
possible
too
right
and
or
your
output
can
be
sent
to
another
topic
as
well.
So
so
that
will
be
another
topic
for
somebody
else
to
pick
up,
so
you
can
think
of
chaining
all
your
data
pipelines
together
so
I
think
in
in
this
case.
B
You're,
like
you,
know,
we're
kind
of
treating
data
like
they
can
kind
of
flow
through
the
system
quite
easily.
In
this
sense
you
know,
rather
than
you
know,
the
more
traditional
way
is.
Let's
say
you
make
restful
calls
and
things
like
that
and
all
that
stuff,
which
is
different
kind
of
approach
to
doing
things,
but
in
this
case
you're
right.
A
B
A
B
A
So
that's
a
that's
a
good
way
to
put
it
yeah.
So
within
a
pulsar
function
you
can
actually
like
connect
to
Cassandra
Oh.
B
B
That
that's
why,
within
the
function
you
should
be
able
to,
then
you
know
essentially
make
calls
to
your
Cassandra
to
any
kind
of
you
know,
external.
You
know,
storage
or.
B
To
be
able
to
do
some
lookups,
for
example,
things
like
that
so
yeah
yeah,
you
should
be
able
to
to
do
that
as
well.
So
yeah
yeah
yeah.
It's
because
the
function
is
think
of
it,
yeah
they're
bite-sized
they
they
can
be
essentially
another
kind
of
interface
or
a
place
where
you
can.
You
know
invoke
another
procedure
right.