►
Description
CHAOSScon North America 2018, Vancouver, Canada
28 August 2018
Speakers:
- John Hawley | Open Source Engineer, VMware
- Alex Courouble | Open Source Engineer, VMware
Talk: Pains and tribulations of finding data
A
All
so
we
have
12,000
different
repositories
in
there
are
be
immersive.
We're
then
also
tracking
repositories
from
everyone
else
that
we
can
do
comparative
them.
So,
to
put
this
into
perspective,
we
have
two
terabytes
of
walk,
get
down,
how
many
of
your
tools
that
are
currently
out
there
will
process
this
in.
A
A
Simplification
of
the
API
put
your
uterus
will
be
practice
that
asking
the
API
this
is
pointlessly
stupid
if
for
no
other
reason
that
takes
16
in
college
to
fetch
a
single
Linux
kernel
tree,
even
though
hypothetically
until
only
take
eight
but
the
API
slows
down.
As
you
return
to
the
tree
because,
as
we
said,
github
holds
the
entire
git
repository
into
memory
source
we're
playing
for
commits.
You
want
and
then
tear
the
whole
thing
down
to
the
waiting
asking
the
next
hundred
queries.
A
They
found
that
the
api's
distant
is
that
once
in
a
while
they'll,
just
not
tell
you
something:
if
you're,
if
you're
asking
order
stuff
out
of,
might
be
the
streaming
API
they
just
magically,
it
never
hits.
So
your
work
also
assuming
that
there's
some
sort
of
eventually
consistent
data
bases
background,
but
then
it
just
never
tells
you
want
something
saved,
but
we
have
noticed
that
of
it
has
literally
never
made
it
through
that.
You
then
have
to
go
back
weary
to
one
company,
which
is.
A
Exactly
so
yeah
and
it's
just
obviously
painfully
slow,
where
it
is
sometimes
contain
upwards
of
three
to
get
anything
useful
so
hip
hop,
is
a
pain
in
the
butt.
Let's
see
what
else
is
out
there,
there
are
things
like
gh
archive.
This
is
beautiful
and
wonderful.
It
has
X
out
that
it
doesn't.
It
basically
get
to
be
sleep
which
we,
which
we
already
know.
Sometimes
it
is
missing
the
event
data
out
of
the
item,
string
they're,
also
really
specific
about
the
ones
to
only
track
open
source.
A
A
A
A
A
B
B
Security
projects
and
we
wanted
to
modify
their
and
medium
run
into
some
issues.
There
too,
so
there's
two
types
of
product
or
medium
in
country
to
github,
hosted
projects
really
gonna
go
scale,
especially
when
you
have
use
so
the
missions
everyone's
using
into
that
truck
there
for
us
really
using
issue
the
comments,
and
we
get
a
pretty
good
idea
of
how
we're
doing.