►
From YouTube: GitLab MLFlow Integration - What Why and How
Description
As we gear up to start exploring an integration between MLFlow and GitLab, we want to talk about why we are doing this in the first place, and where do we want to go
Epic for feedback: https://gitlab.com/groups/gitlab-org/incubation-engineering/mlops/-/epics/9
A
Hello,
everyone,
my
name,
is
eduardo
and
today
we're
going
to
be
talking
about
kit
lab
and
in
outflow,
and
if
you
follow
my
updates,
I've
been
talking
about
mfl
for
a
while
and
I'm
going
to
start
working
on
it
on
integration,
and
I
think
it's
important
to
take
a
little
bit
of
time
to
talk
about
what
why
and
how
we
are
doing
this
integration.
A
First
of
all,
what
is
mflo
mflow
is
a
modern
registry.
Motor
registry
is
a
component
on
the
ammo
flow
mlab
stack
responsible
for
managing
the
model,
the
machine
learning
models
that
we
create.
So
what
is
available
in
production,
or
not
the
list
search
bureau
upon
all
the
things
that
usually
do
with
a
registry
same
that
you
do
with
the
docker
registry.
A
It's
a
very
common,
it's
a
very
popular
open
source
project
and
yeah,
but
more
important
that
what
it
is
is
what
does
it
enable
teams
to
do?
First
of
all
track
experiments
when
we
create
machine
learning
models,
a
big
important
part
of
this
is
hyper
parameter
tuning.
We
create
a
code,
but
the
parameters
that
you
pass
to
the
code
can
make
a
lot
of
difference
on
on
the
output.
A
So
what
we
do
is
we
create
an
experiment
where
I
run
this
training
multiple
times
and
with
different
parameters,
whatever
float
does
whatever
flow
makes
it
easy
to
do
is
giving
all
of
these
runs
that
you
just
had
makes
it
easy
to
compare
and
explore
your
hyper
parameter
space.
So,
for
example,
here
I
can
compare
these
two
hyper
parameters
on
one
or
more
different
metrics
that
I'm
interested
on,
and
that
makes
it
a
lot
easier.
A
A
You
can
roll
back
all
those
classic
necessary
things
to
do
with
with
packages
and
having
that
model,
then
it
becomes
really,
and
since
this
model
is
a
flow
packages,
is
kind
of
like
a
pattern
or
a
cask,
so
it
makes
it
really
easy
to
create
a
server
that
will
create
predictions
for
your
model
and
it
makes
it
really
easy
to
deploy.
A
So
my
flow
already
has
a
lot
of
functionality
for
this
both
to
serve
this,
but
also
to
deploy
machine
learning
models
in
to
sagemaker
and
to
azure
not
to
gcp
yet,
but
it
makes
it
really
easy
to
go
through
the
entire
flow
of
creating
testing
and
deploying
your
managing
and
deploying
your
machine
learning
models,
that's
really
cool,
but
why?
A
A
A
And
how
we
will
work
on
this,
so
the
three
underlying
guidelines
that
we
have
in
mind
is
first
self-managed.
It
should
work
for
some
managed
it's
not
only
for
gitlab.com.
This
is
also
for
users
that
deploy
behind
their
own
servers.
So
it's
self-managed.
First
second
point:
it
should
just
work.
This
shouldn't
be.
A
A
We
want
to
one
make
emma
flow
better
by
being
part
of
gitlab.
We
want
to
make
ammo
flow
easier
to
install
easier
to
use
easier
to
manage,
but
also
we
want
to
make
gitlab
better
by
having
ammo
flow
integrated.
A
So
when
to
surface
the
right
information,
for
example,
how
can
I
use
ammo
flow
at
code
review?
How
can
I
use
ml
flow?
What
information
can
I
show
from
memo
flow
on
the
commit
page
of
a
model
or
so
on
so
forth?
So
there
are
many
places
where
we
can,
where
we
can
help
with
this.
It's
a
multi-game
thing
and
the
plan.
So
this
is
how
it
start.
The
first
milestone
that
I'm
looking
at
here
is
storm
of
flow
as
a
component
on
gitlab
installation.
A
So
when
you
install
gitlab
or
your
jdk
instance,
you
also
the
same
will
have
a
component
forget
a
gitaly
or
whatnot.
You
have
a
component
for
mlflow.
It
is
installed
together.
A
Second,
milestone
authentication
through
piggybacking
on
projects.
So
when
you
open
a
project,
it
will
have
your
own
mlflow.
Each
project
has
its
own
ammo
flow.
Installation
and
authentication
is
based
on
the
project.
So
you
get
this
for
free
and
second,
you
also
have
to
configure
where,
where
stor,
where
artifacts
are
stored
for
mflow,
we
can
use
gitlab
storage
for
this,
and
this
is
the
first
part
in
the
installation.
The
second
part
is
usage,
how
the
users
actually
use.
So
you
connect
the
first
milestone
for
usage.
A
You
can
open
emma
flow
ui
through
gitlab.com,
slash
my
project,
slash
mflow,
so
you're
going
to
have
the
ui
that
I
showed
before
second
milestone.
Well,
the
same
milestone
actually
is:
each
project
has
its
own
tracking
server,
so
it's
gitlab.com
myproject.mlflow
that
would
be
best
when
you're
creating
the
model
so
that
you
can
track
the
runs
it's
central,
gitlab
and
over
time
at
the
beginning.
The
first
milestone
is
to
use
emma
flow
ui,
but
we
want
over
time
to
rely
less
and
less
and
less
on
the
malfoy
ui
and
migrate.
A
All
the
information
into
the
gitlab
ui
so
use
mflow
as
a
backend
for
this,
but
the
front
end
will
be
gitlab
and
that's
what
I
had
so
if
you
have
any
feedback,
this
is
the
link
I
can
for
the
epic
of
4ml
flow,
I'm
accepting
any
ideas,
any
opinions,
any
I
don't
know
whatever
you
want
to
say
it's
very
early
and
of
course
this
is
incubation
engineering.
This
is
an
exploration
that
we
are
doing.
We
want
to
explore
if
this
is
viable
or
not.