►
From YouTube: Case Study: OpenShift at Public Health England Francesco Giannoccaro (Public Health England)
Description
Francesco Giannoccaro (Public Health England)
The OpenShift Commons Gathering was held in London, UK, and features guest speakers from local customers and users. The OpenShift Commons Gatherings brought together 300+ experts from all over the world to discuss container technologies, best practices for cloud native application developers and the open source software projects that underpin the OpenShift/Kubernetes ecosystem.
https://commons.openshift.org/gatherings/London_2020.html
A
Good
afternoon
everyone,
my
name,
is
Francesco
Gennaro
I
work
in
public
health,
England
has
heard
of
high
performance,
computing
and
infrastructure.
Today,
I
will
take
about
15
minutes
of
your
time
to
talk
about
how
we
are
using
open
source
technology
to
deliver
modern
public
health
services.
I
want
to
talk
very
quickly
about
what
public
health
England
us
is.
A
An
executive
agency
of
Department
of
Health
in
UK,
which
provide
government
local
government
and
the
public
with
evidence-based,
professional
scientific
expertise
and
support
is
an
organization
that
has
been
established
in
2013,
merging
public
health
scientists,
bringing
together
public
health
scientists
from
about
70
organisation,
one
single
body
so
currently
PG
6,000
employees.
Most
of
them
are
scientists
and
public
health
professionals.
The
mission
of
public
health
England
is
ambitious
and
inspiring.
A
The
organization
came
together
in
2013
and
since
the
beginning,
the
effort
from
a
technology
point
of
view
was
focus
supporting
the
scientific
community
within
the
organization.
So
PA
G
deliver
a
wide
range
of
public
health
services
that
span
from
research
and
scientific
publication
based
on
statistic.
A
Mathematical
model
such
as
special
metal
population
model
for
test,
miss
transmissible
disease
like
the
normal
flu,
but
also
more
aggressive
pathogen
like
Ebola
or
the
corona
virus.
That,
unfortunately,
during
these
days,
is
on
the
news
for
the
outbreak
in
China,
as
well
as
research
and
scientific
publication
through
predictive
models
applied
to
anthrax,
as
well
as
inference
problem
able
to
infer.
You
know
the
decides,
the
likely
size
of
how
it
works
and
the
location
of
sirs
and
special
extent,
X,
etcetera.
A
Another
R
of
service
that
PG
deliver
are
around
pathogen
genomic
service
that
we
deliver
to
hospital
based
on
the
whole
genome,
sequencing
for
essentially
pathogen
identification,
pathogen
typing
surveillance
and
outbreak
investigation,
essentially
PG
receive
biological
samples
from
hospital
and
use
those
technology.
The
whole
genome
sequencing
technology
to
analyze
the
biological
sample
and
identified
the
pathogen
that
possibly
is
affecting
patients
can
be,
of
course,
aggressive
pathogen
in
three
years
between
2014
and
2018,
PGs
analyzed
more
than
a
hundred
thousand
bacteria
and
virus
genome.
A
In
addition,
we
deliver
service
directly
to
the
public
through
campaign
to
increase
awareness
around
cancer,
obesity,
smoking
and
other
world
being
behavior.
The
technology
ecosystem
that
was
used
in
PG
initially
was
very
much
focused
on
a
restricted
number
of
proprietary
technology,
primarily
to
support
the
business-as-usual
type
of
IT.
A
We
in
this
effort
of
supporting
the
scientific
community
started
looking
into
a
new
set
of
technology,
and
we
wanted
to
stay
focused
on
open
source
more
than
proprietary,
because
we
see
open
source
in
being
very
much
in
line
with
the
mission
of
the
organization
making
as
open
as
possible.
The
science
that
we
work
with
and
keeping
open
standards
so
that
the
results
can
be
easily
shared
with
the
scientific
community
around
the
world
and
I
will
talk
brought
very
lightly
about
those
technology.
A
This
type
of
workload
is
very
I,
oh
intensive,
so
the
workload
manage
large
amount
of
data,
so
the
environment
has
to
be
capable
of
facing
high
throat
boots.
Another
area
that
is
making
use
of
high
performance
computing
is
statistic:
modeling
and
economic,
where
we
run
real
time
models
and
simulations
to
predict,
expected
pandemic
disease
dynamics,
supporting
national
vaccination
policy
and
control
of
antimicrobial
resistance.
So
the
understanding,
essentially
how
the
mutation
of
bacteria
and
Mars
are
developing
resistance
towards
antibiotics.
A
So
the
the
open
source
technology
that
we
have
identified
to
support
this
specific
niche
and
the
high
performance
computing
has
been
OpenStack.
Openstack
is
a
cloud
technology,
specifically
infrastructures,
a
service
type
of
cloud
technology,
and
it's
been
the
first
new
technology
that
we
have
been
introduced.
But
we
also
spoke
out
larger
program
covering
a
number
of
different
area.
As
I
say,
pg
produce
very
large
amount
of
data
is
data
drive
an
organization
so
for
us
was
very
important.
A
So,
in
addition
to
the
traditional
virtualization
platform
that
we
were
using
since
the
beginning,
the
hammer,
overt
rev
and
an
open
stack,
we
started
also
using
a
little
bit
public
cloud
environment
like
AWS
as
your
in
Google
compute,
and
the
idea
was
to
provide
a
single
pane
of
glass
for
user
to
be
able
to
deploy
system
in
each
of
those
underpinning
environment
from
a
single
set
of
API
and
a
single
web
front-end.
And
we
have
chose
for
that
project.
An
open
source
technology
called
manage
IQ
measure.
A
Iq
is
the
upstream
name
and
cloud
form
is
the
version
that
we
use,
which
is
supported
by
Red
Hat,
and
the
last
area
and
project
was
related
to
deploying
platform
as
a
service
to
support
contour,
eyes
application,
and
for
that
project
we
chose
to
use
okd
openshift,
and
that
is
one
of
the
topic
of
this
presentation.
I
think
it's
important
to
again
emphasize
the
amount
of
data
that
the
organization
manage,
because
clearly
the
decision
in
terms
of
technology
that
we
chose
have
been
dictated
by
the
amount
of
storage.
A
So
when
we
talk
about
using
private
cloud
technologies
versus
public
cloud,
techno,
a
hybrid
cloud
multi-cloud,
you
are
driving
in
choosing
one
to
the
other
on
the
basis
of
the
requirements.
So
clearly,
if
you
have
to
move
the
petabyte
to
the
storage
in
a
cloud
in
a
public
cloud
environment,
there
are
constrained
and
the
cost
model
are
slightly
different.
Then
in
case
where
your
workload
is
CPU
intensive,
the
amount
of
data
in
the
space
of
Life
Science
is
growing.
It's
constantly
growing.
A
We
produce
about
25
petabytes
of
data
worldwide
every
year
and
the
amount
of
data
related
to
sequencing.
Dna
is
doubling
every
7
month
and
is
taking
over
other
scientific
topics
like
astronomy.
The
migration
introduction
of
open
source
cloud
technology,
both
on
infrastructure
as
a
service
level
or
platform
as
a
service
level,
can
be
challenging,
especially
if.
A
You
don't
have
since
the
beginning
the
rhein
skill
set,
so
we
have
approached
the
introduction
of
these
technology,
specifically
using
this
user
case
related
to
high
performance
computing.
High
performance
computing
is
itself
a
technology
that,
since
the
beginning,
is
designed
to
allocate
reserves
in
an
elastic
way.
So
the
the
software
stack
that
you
have
in
an
HPC
environment
already
has
a
job
scheduler
capable
of
looking
at
the
available
resource
in
your
cluster
and
allocating
job
to
the
nodes
that
are
available.
A
So
the
introduction
of
OpenStack
has
been
very
easily
to
deploy
and
to
support
this
type
of
requirements,
because
essentially
the
three
bare
metal
cluster
that
we
had
in
public
health
England
since
the
beginning.
Instead
of
relying
only
on
the
bare
metal
compute
note,
when
those
compute
nodes
were
fully
saturated,
they
were
able
of
burst
additional
compute
capacity
on
these
sherrod
on-premise
cloud
environment,
running
on
OpenStack,
and
once
the
job
were
executed
and
completed.
A
The
cloud
instances
that
were
deployed
to
be
part
of
the
cluster
was
released
and
made
available
for
other
workloads,
but
that
type
of
requirements
solve
with
the
use
of
infrastructure
as
a
service
type
of
cloud.
Wasn't
the
only
requirement
we
had
in
public
health
England
and
after
that,
we
start
looking
into
how
to
make
more
cost
efficient.
A
The
set
of
hardware
and
research
that
were
used
by
the
legacy
application,
essentially
the
the
web
application
that
we
use
to
present
to
the
public
and
to
share
with
other
organization
that
the
results
of
the
research
and
analysis
that
we
do
so.
The
results
of
course
is
through
internet
and
we
use
web
application
to
to
share
those
results.
A
A
Also
behind
because
if
the
machine
was
updated,
that
updating
process
was
going
to
update
library
and
the
for
breaking
the
application
that
was
using
the
legacy
library.
So
the
current
use
of
container
has
been
identified
to
solve
this
issue.
There
are
similarity
between
the
requirements
that
we
have
in
the
web
application
with
application
that
we
use
in
HPC,
meaning
that
in
the
HPC
environment,
we
also
use
a
different
version
of
pipeline
different
version
of
workload
which
rely
on
different
version
of
library
in
in
the
HPC.
A
The
building
of
container
it's
a
process
that
can
be
automated
or
can
be
you
know,
requires
several
step.
The
automation
in
the
container
lifecycle,
it's
one
of
the
most
relevant
aspect
in
managing
the
lifecycle
of
the
application
that
the
NT
containers,
the
use
of
the
container
engine
itself,
doesn't
provide
the
mechanism
to
automate
the
container
lifecycle.
So
there
are
a
number
of
other
technology
around
the
container
engine,
anode
oka,
that
need
to
be
leveraged
in
order
to
automate
the
process
and
the
lifecycle
of
the
containers.
A
We
have
been
seeing
the
the
technology
that
we
were
using
in
other
space.
Very
much
useful
in
that
process
and
openshift
is
integrating
many
of
this
technology
in
one
single
platform
very
well
integrated.
So
we
were
using
already,
for
instance,
geek
lab
not
only
to
do
version
of
the
code,
but
also
to
trigger
operation.
When
you
know
called
code
is
moved
into
stage,
production
branch
and
so
on.
A
So
the
use
of
that
the
learning
curve
that
we
were
facing
was
having
benefits
in
using
OpenShift
because,
in
addition
to
the
container
engine
you
have
since
the
beginning
a
number
of
other
tools
that
are
already
integrated,
the
that
that
set
of
tools
made
possible
to
automate
the
application
lifecycle.
So
the
the
building
process
of
the
container,
and
essentially
when
a
new
version
of
the
code
is
pushed
into
gate
lab
in
our
gate,
lab
that
trigger
through
her
web
book
operation
on
OpenShift.
A
That
then
take
care
of
rebuilding
the
image,
publishing
the
image
in
the
register
and
then
having
the
running
application
being
updated
with
a
very
latest
version
of
the
image
that
has
been
built.
The
security
aspect
are
also
very
well
managed
by
the
the
platform
in
situation
like
public
health
England,
where
sensitive
data
are
managed
is
impossible
to
think
of
using
a
public
registry
like
docker
hub,
because
the
risk
of
having
vulnerability
in
the
container
is
very
high.
A
So
the
having
register
that
this
contest
constantly
scan
for
security
vulnerability
is
another
functionality
implemented
in
this
platform
that
made
possible
the
use
of
this
technology
within
PA
G.
So
you
have
in
one
single
environment,
they
are
the
automation
process
of
the
application
that
we
started
with.
You
know
focusing
on
legacy
application
wherever
possible
migrating
into
subsets
of
container.
But
of
course
also,
you
know
supporting
completely
new
web
application
that
are
designed
to
have
a
cloud
native
architecture
since
the
beginning
using
this
approach
of
micro
services
since
the
beginning.
A
So
this
is
the
the
developer
are
empowered
and
facilitating
in
their
in
maintaining
the
the
application
lifecycle,
but
also
the
infrastructure
team.
The
operators
have
significantly
simplified
the
work
through
the
level
of
the
integration
and
automation
that
is
provided
in
this
platform,
because
when
you
think
about
deploying
an
application,
you
have
to
think
about
networking
aspect.
A
All
the
integration
are
deployed
and
orchestrated
in
a
very
integrated
way
within
the
platform,
and
this
give
us
the
ability
to
deploy
on
premise
as
well
as
off
premise.
So
we
at
the
moment
we
are
using
the
platform
essentially
for
two
project.
One
is
in
the
space
of
surveillance,
meaning
you
know,
surveillance
of
heartbreak
and
this
type
of
environment
may
need
to
scale
very
quickly
across
large
set
of
machines,
so
the
ability
of
deploying
essentially
having
a
level
of
portability
that
containers
offer
clearly
may
make
the
difference
in
this
situation.
A
So
we
currently
run
those
type
of
workloads
on-premise,
but
the
freedom
of
redeploying
openshift
off
premise
in
a
public
cloud
give
us
that
level
of
freedom
that
is
difficult
to
have
another
solution,
and
that's
essentially,
is
I.
I.
Think
the
the
powerful
aspect
of
those
technology
is
is
made
possible
by
the
effort
of
the
open
source,
community
and
I.
A
Think
this
is
the
main
message
that
you
know.
I
would
like
to
give
the
ability
of
delivering
this.
This
is
made
possible
by
the
effort
of
you
know
many
people
around
the
role
that
working
on
open-source
technology
and
and
really
we
take
for
run
this
many
times.
But
if
you
think
about
what
our
everyday
life
would
have
been,
if
the
World
Wide
Web,
that
was
patented
or
if
the
human
genome
was
the
intellectual
property
of
a
single
company
which
almost
happen
clearly,
the
entire
society
as
we
live
today
was
going
to
be
different.