►
From YouTube: Education & Workforce WG
Description
S.E.E.D.S Outcome: Evaluation and Demonstration of Open Data Portal Technology for Smart Cities and Data Science for Social Good
Presenter: Seema Iyer
University of Baltimore
A
Thanks
so
much
for
the
invitation,
it's
really
nice
to
meet
all
of
you.
We
are
pretty
new
to
the
south
big
data
hub
family.
I
guess
we
got
this
seed
grant
for
the
first
time
a
year
ago,
a
year
and
a
half
ago,
but
so
it
is,
I
think,
a
good
I'm
glad
I
went
after
dr
dennis
because
I
think
we're
coming
at
this
issue
as
a
pedagogical
tool.
A
Having
done
the
work,
we
actually
I'm
not
a
teacher
of
data
science,
but
I
use
data
science
in
the
project
that
I'm
going
to
talk
about,
and
then
we
realized
that
if
we
were
going
to
need
students
and
future,
you
know
staff
for
the
project.
We
need
to
teach
what
we've
learned
over
the
last.
You
know
15
years
and
create
a
curriculum
to
do
the
work,
and
I
will
say
you
know
every
the
discussion
data
people
think
of
as
just
a
bunch
of
numbers,
but
these
are
people's
lives
right.
A
These
are
real
human
stories,
literally
behind
every
single
data
point,
and
this
is
kind
of
the
approach
that
we
take
with
all
of
our
data,
because
it
is
about
action.
It
is
about
empowerment,
it
is
about
taking
that
knowledge
that
you
gain
from
the
data
and
using
data
science
skills
in
order
to
do
something.
A
It's
not,
it
is
about
analysis,
but
it's
also
about
action
and
that's,
I
think,
what
most
of
our
students
are
concerned
about
and
most
of
the
people
that
come
into
our
data.
For
so
I
want
to
talk
a
little
bit
before
I
get
into
the
actual
seed
grant
as
to
why
we
even
came
into
this
work.
I
oversee
a
project
called
the
baltimore
neighborhood
indicators
alliance,
and
this
was
a
project
that
was
created
in
1999,
as
in
different
cities
around
the
country
through
grant
foundation's
foundation.
A
Grant
sorry
so
the
annie
e
casey
foundation
was
the
biggest
one.
There
was
other
gran
foundations
around
the
country
that
were
trying
to
get
data.
You
know
newly
electronic
data
coming
out
of
government
e-government
was
relatively
new
in
the
1990s
and
gis
computing
power
was
also
relatively
new
in
in
the
ubiquitousness
of
it.
By
the
time
the
1990s
comes
around,
and
so
looking
at
spatial
data,
that's
being
generated
by
all
these
e-government
platforms
and
then
the
computing
power
with
gis.
A
So,
just
by
way
of
background,
I'm
an
urban
planner
I
happen
to
have
a
quantitative
background.
My
undergraduate
degree
is
in
mathematics,
but
as
an
urban
planner,
it's
always
about
taking
data
to
understand
the
current
situation
and
using
it
to
plan
for
the
future,
whatever
scale
we're
talking
about-
and
I
normally
work
in
urban
areas,
but
this
could
apply
to
you
know
any
local
jurisdiction
and
then,
of
course,
within
an
urban
area.
A
I'm
going
to
be
talking
about
neighborhoods
and
neighborhoods
are
essentially
a
a
system,
an
orga
or
organism
and
system
that
has
everything
happening
to
it.
Right,
education,
crime,
housing,
health.
Everything
is
happening
in
that
unit
of
analysis,
which
is
essentially
a
neighborhood
and
it
operates
like
a
system,
and
so
what
we're
after
is
creating
what
are
called
community-based
indicators
to
help
people
understand.
A
You
know
analogously
kind
of
the
the
physiology
of
a
body
we're
trying
to
get
at
the
ecometrics
of
a
place,
and
so
that's
the
goal
of
of
all
of
our
projects.
There
are
now
30
cities
around
the
country
that
are
part
of
the
national
neighborhood
indicators
partnership.
A
We
were
the
sixth
member
at
the
time,
but
you
can
see
kind
of
the
gamut
of
cities
around
the
country
that
have
the
similar
approach
we're
pulling
out
what
are
called
administrative
data
sets
from
governments
at
all
scales,
local
state,
federal
augmenting
that
with
some
proprietary
data,
obviously
census
data
and
then
pushing
it
back
out
for
the
communities
to
use.
So
this
is
what
we
do.
A
Every
year
we
issue
the
vital
signs
report,
which
is
a
compendium
of
over
a
hundred
different
indicators
across
these
different
topics,
again
thinking
of
a
neighborhood
as
a
system,
so
every
neighborhood
gets
a
battery
of
indicators
over
110
indicators
every
year,
and
so
you
can
see
trends
what's
the
crime
rate
over
the
last
20
years.
What's
the
number
of
children
that
are
in
the
school
system
over
the
last
20
years?
A
What's
the
median
sales
price
median
income,
whatever
you
want
to
know
in
a
comprehensive
fashion
about
any
particular
neighborhood,
but
so
you
can
imagine,
we
have
20
years
110
different
indicators
and
we've
broken
up
the
city
into
what
are
called
55
communities.
A
A
A
A
So
we
have
all
of
this
data
and
we've
learned
a
lot
and
figured
out
a
lot
and
we've
wanted
to
figure
out.
How
do
we
actually
create?
Basically,
an
onboarding
process
right
if
you
want
to
bring
anybody
into
our
project,
it's
a
little
bit
of
a
nightmare
for
new
staff,
because
we
don't
really
have
standard
operating
procedures
because
things
change
all
the
time.
A
So
these
are
the
kinds
of
people
over
the
years
as
a
manager
that
I've
had
to
bring
on
board
to
the
project,
because
if
you
can
imagine
it's
data
acquisition,
data
processing
data
analysis
and
then
that
communication
piece,
which
sometimes
our
data
scientists,
don't
know
how
to
do
the
communication
piece.
But
you
need
some
way
to
kind
of
pass
the
baton
across.
So
this
is
literally
all
of
the
kind
of
different
skills
sets
that
we
have
on
staff
throughout
the
years.
A
And
you
can
see
data
scientists
is
not
necessarily
a
one
of
the
things
that
we
do,
but
it's
kind
of
on
everything
that
we
do
and
so
in
2019
2020
we
were
awarded
a
hdr
grant
from
nsf,
which
some
of
you
might
have
really
thinking
about.
How
do
we
create
a
curriculum
to
help
people
understand
what
we
do
across
any
interdisciplinary
field?
And
so
we
have
a
hdr
grant
with
umbc
here
in
the
baltimore
region.
A
Towson
state
umc
is
university
of
maryland
baltimore
county,
I'm
at
the
university
of
baltimore,
towson
state
and
bowie
state,
and
we
are
all
part
of
the
university
of
maryland
system
of
schools.
So
we
have
that
kind
of
shared
ability
to
work
together
and
create
co-curriculum,
and
we
wanted
to
create
a
way
for
a
standardized
way
across.
All
of
our
skill
sets
to
help
students
understand
how
to
use
their
skills
for
social
good
again
that
actionability
part
of
the
equation,
and
for
me
it's
how
do
I?
A
How
do
I
create
a
set
of
people
that
can
come
in
and
actually
start
working
for
me?
So
it
was
a
little
bit
self-serving.
I
guess
in
that
way,
and
we
and
it
literally
worked
one
of
the
thousand
each
one
of
our
institutions.
Every
year
chooses
five
students
to
be
a
part
of
the
data
core
and
we
have
a
joint
curriculum
across
all
four
of
our
institutions
with
those
20
students
and
I
just
hired
one
from
towson
university
to
join,
to
join
our
group.
A
So
it's
working,
that's
a
good
thing
and
so
the
way
that
the
pr
project
works-
and
I
I
can't
put
this
in
the
chat
this
link
in
the
chat
when
I'm
done
with
the
screen
share.
But
the
way
it
works
is
that
we
get
non-profits
as
part
of
the
project.
We
get
non-profits
or
governmental
agencies
that
have
a
problem,
and
then
we
use
our
data
science
skills
and
we
assign
them
to
that
problem
and
actually
help
them
work
through
the
issue
and
so
in
data
science.
A
What
we've
told
them
that
data
science
is
is
a
set
of
tools.
The
problem
is
fixed
or
the
problem
might
change
over
time,
but
to
answer
that
problem,
you're
going
to
need
to
kind
of
dip
into
your
toolbox
to
figure
out
how
to
actually
answer
that
question
and
that's
what
we're
here
to
help
you,
because.
B
A
A
We
obviously
are
a
pretty
high
gis
shop,
because
all
of
our
data
has
to
have
some
spatial
aspect
to
it
and
we
do
not
have
a
gis
degree
at
the
university
of
baltimore,
there's
a
gis
degree
at
towson
and
a
gis
degree
at
umbc,
and
so
we
are
trying
to
teach
our
data
science
students
who
are
learning.
You
know
in
the
information
science
and
applied
information
technology
field,
we're
kind
of
augmenting.
A
What
they're
learning
in
the
classroom,
which
is
python
data,
cleaning,
that
kind
of
thing
and
and
adding
in
a
gis
kind
of
component
to
their
skill
set.
They
don't
generally
have
too
much
by
way
of
statistical
background
and
we're
not
really
an
r
shop.
Yet
we're
probably
going
to
eventually
need
to
move
there
and
the
university
is
an
spss
licensed
location,
so
we
do
teach
them
spss
for
now
and
then
we
walk
through.
You
know
the
full
data
science
process
with
with
our
students.
What
is
the?
A
What
is
the
question
that
you're
trying
to
ask
right,
and
sometimes
you
know
our
data
with
the
vital
science
data?
We
have
a
question
that
we
want
to
ask
sometimes
the
non-profits
that
we're
working
on
or
the
city
that
we're
working
on
the
city
partner
that
we're
working
with
has
a
project,
and
then
you
have
to
figure
out
okay.
A
If
this
is
a
question
that
I
want
to
try
to
answer,
what
is
the
data
that
I'm
going
to
use
to
try
and
answer
that
and,
like
I
said,
we
focus
in
on
that
administrative
data
set,
but
we
obviously
talk
about
other
kinds
of
data
that
might
be
out
there,
focusing
on
data
processing,
making
sure
that
you're
looking
at
the
data
making
sure
that
one
data
set
matches
another
data
set,
making
sure
that
you
can
actually
do
the
analysis
and
set
up
the
data
in
the
proper
way
and
then
think
about
what
kind
of
analysis
you
actually
want
to
do.
A
Is
it
a
gis
based
analysis?
Is
it
some
other
statistical
kind
of
analysis
and
then
we
kind
of
force
our
students
to
work
on
the
communication
the?
So
what
and
not
just
so
what
from
a
policy
point
of
view?
But
what
did
you
learn
through
the
process?
What
skill
did
you
have
to
pick
up
in
order
to
do
this?
A
Work
like
one
of
our
students,
had
never
accessed
an
api
before
so
she
literally
wrote
the
code
and
put
in
a
blog
here's
how
I
went
and
got
an
api
call
just
in
order
for
me
to
do
the
coding
that
I
needed
to
do
and
that
you
know
little
bit
of
information
is
really
important
for
the
next
student
that
comes
on,
because
they're
going
to
know
that
they
need
these
very
specific
things.
In
order
to
do
the
analysis,
so
the
communications
is
a
little
bit
about
you
know.
A
What
do
you
want
the
next
student
to
know,
based
on
what
you
learned
through
the
process,
and
so
now
we
have
two
years
worth
of
these
kind
of
blogs
that
the
students
have
put
together
and
that's
being
used
so,
like
I
mentioned
they're
working
and
in
some
cases
creating
data
they're,
creating
data
that
can
then
have
that's
been
cleaned
right.
It's
gone
through
that
process.
A
It's
been
processed
and
we
didn't
want
to
lose
that
data,
because
it's
kind
of
valuable
data
they've
only
analyzed
it
for
one
particular
question
right,
but
now
that
the
data
has
been
cleaned,
it
could
be
used
for
all
kinds
of
different
things,
maybe
either
with
that
organization
or
with
somebody
else,
and
in
most
cases
by
the
time
it's
been
processed,
it's
been
aggregated
or
it's
certainly
been
de-identified
if
there
was
any
identification
on
there
to
begin
with.
A
So
it's
actually
clean
good
data
for
other
students
to
use,
and
so
we
wanted
to
create
a
way
for
them
to
host
their
data
once
they're,
so
at
the
end
of
their
project,
not
only
communicate
but
also
host
their
data.
Some
of
our
students,
for
example,
have
created
scripts
that
you
can
pull
data
out
of
the
open
data
portal
and
you
can
kind
of
routinely
run
the
script
and
keep
adding
to
an
open
data
portal,
and
so
we
wanted
to
come
up
with
the
you
know.
Which
portal
are
we
going
to
use?
A
We've
got
four
different
institutions
with
20
different
students.
No
we're
not
going
to
have
a
manager
of
publishing
data,
so
we
have
to
distribute
how
that
publishing
occurs.
What
do
we
have
access
to
in
terms
of
platforms?
How
much
would
it
cost
and
and
really
think
deeply
about
how
we're
going
to
set
up
an
open
data
portal
for
the
baltimore
data
science
core,
so
that
other
students
in
the
region
can
use
what
they've
done
so
basically
kind
of
moving
forward
and
creating
a
resource?
A
Well
beyond
the
20
students
that
we
get
every
year,
and
so
that
was
what
we
use
the
seed
funding
for
to
create
a
series
of
webinars
that
all
of
you
now
have
access
to
the
webinars
are
amazing.
We
asked
kind
of
people
what
kind
of
platforms
people
wanted
to
use
or
think
about
or
learn
more
about
in
terms
of
open
data
platform
technology.
A
And
then
how
can
we
get
our
our
data
science
just
to
publish
onto
the
platform?
And
these
are
the
kinds
of
questions
that
people
had
if
they're
thinking
about
creating
an
open
data
portal,
which
I
think
you
all
probably
have
too
you
can
see,
you
know
how
do
you
create
visualizations?
How
do
you
stop
bots?
What's
the
security
privacy?
A
How
do
you
create
metadata?
How
do
you
upkeep-
and
you
know
how
do
you
engage
other
civic
tech
organizations,
so
I'll
put
the
link
to
the
to
the
webinars
in
the
chat
too,
but
we
did
three
different.
We
did
hubzero,
we
did
c
can
and
we
did
esri
online
for
the
edgery
data
portal
hub
and
for
each
one
of
our
sessions.
We
asked
each
one
of
our
respondents
or
or
speakers
to
try
to
answer
as
many
of
these
questions
as
possible.
How
do
you
handle
multiple
publishers?
A
What's
a
process
for
metadata
and
documentation?
How
do
you
archive
when
something
is
like
old?
How
do
you
customize
a
platform?
How
do
you
integrate
for
data
visualization
and
what
are
the
maintenance
costs?
So
if
you
are
thinking
about
doing
any
one
of
these
or
or
just
look
at
all
of
them,
they're
actually
just
all
amazing
webinars
to
review,
and
so
we
are
now
in
the
process
of
creating
the
open
data
portal
for
students
to
post
to
this
summer
and
we
create
a
an
event.
B
Thank
you
so
much
yeah.
I
think
that
you
know
we're
really
really
excited
about
your
pro
your
program
in
general
and
this
resource
as
part
of
the
reason
why
we
thought
we
could
be
very
useful
to
others
as
they
go
through
a
similar
process
of
trying
to
look
at
open
portals
and
other
things.
So
are
there
any
questions
from
the
group
and
then
I'll
have
a
question
too.
C
Sure,
thank
you
so
much
super
interesting.
So
are
you
so.
C
Oh,
I'm
sorry,
sorry,
I'm
kathryn
kramer
and
I
know
renata
from
when
I
was
working
on
data
science
for
all
with
the
northeast
big
data
hub.
I'm
not
I'm
still.
I
we
actually
got
a
seed
grant
from
them
on
to
work
on
expanding
a
data
jam
project
at
the
university
of
pittsburgh.
I
can
talk
about
that
later,
but
that's
been
super
exciting,
but
I'm
also
now
working
with
the
west
hub.
C
I
work
remotely
with
the
super
computer
center
at
ucsd
and
I
guess
my
question.
So
are
you
still
accepting
projects
for
non-profits
that
need
help.
A
C
Okay,
so
an
organization
in
massachusetts
called
amplified
poc,
which
is
about
supporting
black
owned
businesses.
They
are
really
I'm
actually
just
putting
out
a
call
to
everybody.
If
anybody
knows
anybody
in
massachusetts
doing
I've
been
trying
to
find
either
a
group
of
students
or
a
project
like
yours
that
could
help
them
sort
of
get
more
data
literate
so
and
they're
they're
based
in
massachusetts.
So
if
anybody
has
any
thoughts,
I
think
amherst.
A
Has
a
good-
and
I
would
maybe
look
at
all
of
the
recipients
of
the
hdr.
C
C
Okay,
great
great
thought
and
then
simon
my
other
question.
I
saw
library,
science
yeah.
So
what
could
you
say
just
a
little
bit
about
what
that
work
is
or
how
they're
involved.
A
Yeah,
so
you
know
we
have
to
create
the
110
indicators
every
year.
It
comes
from
about
50,
different
sources,
and
you
do
that
every
year.
So
you
just
imagine.
The
raw
data
has
got
to
go
somewhere
every
year
and
the
whole
point
of
the
thing
is
that
you
can
go
back
and
query
that
data
and
so
we're
effectively
a
library,
a
data
library
for
all
city
data.
A
They
need
we're,
not
the
data
science,
we're
not
the
subject
matter,
experts
for
whatever
they're
doing,
but
they
need
to
link
their
data
to
some
kind
of
spatial
data
set,
looking
at
context
looking
at
whatever,
and
so,
if
you
want
to
be,
if
that's
how
your
revenue
is
generated,
you
got
to
be
really
good
at
finding
your
data,
and
so
we
actually
had
a
civic
librarian
with
us
to
think
about.
Okay.
What?
What
does
what
does
our
process
need
to
look
at
version?
Control
is
so
critical
for
us.
A
A
So
all
that
you
can
just
imagine
our
repository
at
this
point
20
years
of
doing
this,
and
it's
not
big
data
right.
It's
not
like
millions
of
records,
but
it's
lots
of
little
data
sets
over
many
many
years
so
50
times
20,
you
know
and
and
then
multiple
versions
of
each
data
set.
So
we
keep
the
raw
data
set
as
it
was.
We
don't
touch
it.
If
we
do
any
modifications
to
it,
then
it
goes
to
the
next
version,
and
you
know
on
and
on
so
50
data
sets
four
different
versions.
A
B
Thanks
sima,
this
sounds
like
a
very
interesting
project:
I'm
particularly
interested
in
the
open
data
portal
for
data
science
students.
So
how
do
you
envision?
That
would
other
universities
be
able
to
submit
data
sets,
and
if
we
do
I'm
assuming
this
is
more
than
a
repository.
So
you
are
going
to
provide
some
visualizations
and
things
inbuilt.
B
A
So
even
that
is
a
nightmare
to
try
and
get
multiple
schools
to
submit
to
the
same
portal.
So
it's
a
con,
it's
a
federated
hub,
so
each
one
of
the
schools
still
maintains
their
own
hub,
but
you
can
see
it
all
in
one
shot.
I
have
no
idea
how
we
would
go
beyond
usn.
It
was
a
feat
just
to
even
figure
this
out
to
get
to
all
of
our
schools
here
in
the
maryland
to
to
work
together,
but
maybe
that's
a
model
that
can
work
in
other
states
right.
A
B
So
would
you
be
publicizing
it
in
this
forum
once
it's
up
or
should
contact.
A
B
Page
for
this
project
on
the
south
website,
we
will
have
a
we
have
a
sequence
page
for
every
sea
grant.
So
these
types
of
links-
and
things
can
also
be
put
there-
that
are,
you,
know,
connected
to
the
seed
grant
and
we'll
put
the
newsletter
out,
but
it's
also
just
a
way
to
send
people
to
you
from
our
from
our
portals
so
that
that's
the
question
I
had
was
uma's
question
of.
If
it's
open,
it's
open
for
any
academic
that
might
want
to
use
baltimore
data.
B
As
an
example,
do
you
have
case
studies
or
something
that
could,
or
you
know,
modules
almost
that
could
be
used
in
courses?
Is
that
going
to
be.
A
A
Well,
that
was
what
I
was
going
to
quickly
mention.
You
know
they
are
case
studies,
because
the
cases
come
from
real
people
right
or
real
organizations.
So,
just
to
give
you
an
example,
these
are
what
the
data
science
core
just
at
our
university
has
been
working
on.
There's
four
universities.
A
So
you
know
if
you
are
interested
in,
let
me
let
me
give
a
good
one.
A
We
we
were
looking
at
how
many
civic
tech
organizations
there
are
in
maryland,
and
so
this
was
that
one
about
the
api
and
and
trying
to
come
up
with
the
meetup
meetups
in
this
in
the
region
like
how
do
you
do
meetup
api
access?
How
do
you
look
for?
This?
Was
a
code
on
for
twitter
of
how
do
you
look
for
certain
types
of
behaviors
of
you
know
no
masking
this
is
before
you
know.
A
We
had
the
vaccine,
so
this
code
is
going
to
keep
scraping
twitter
data
for
that
kind
of
data
and
that's
going
to
go
up
there.
We
have
scooter
data
like
where
people
are
using
scooters
in
the
city
of
baltimore,
and
so
we
have
these
are.
These
are
where
the
use
cases
are
coming,
because
the
question
was
coming
from
some
non-profit
organization
or
the
city
itself,
so
some
of
our
students
went
to
the
health
department.
A
So
I
think
it'll
be
you
know,
questions
that
students,
even
if
they're,
not
from
baltimore,
they
might
want
to
know
like
where
are
people
using
scooters
in
a
city
things
like
that,
so
they're,
they're,
manageable
questions
and
the
data
sets
are
ready
for
different
kind
of
analysis.
Right.
A
Baltimore
county,
yes,
yeah,
it's
very
confusing.
We
have
university
of
maryland
baltimore
county,
which
is
an
undergraduate
kind
of
stem-based
university,
and
then
we
have
university
of
maryland
baltimore,
which
is
a
graduate
kind
of
professional
social
work
law
that
kind
of
thing,
and
then
I'm
at
the
university
of
baltimore,
which
is
a
non-traditional
both
graduate
and
undergraduate
institution.
That's
mostly
business
and
law.
B
Will
the
methods
be
a
part
of
you
know
how
you
might
have
gotten
that
type
of
data
like
what
departments
you
would
go
to
look
at?
So
if
another
city
and
say
if
you're
in
another
city,
atlanta
or
north
carolina-
and
you
wanted
to
look
at
people
underserved,
I
saw
one
of
your
projects
was
underserved
by
transit
yeah.
What
departments
do
you
go
to.
A
Yeah
yeah,
all
of
those
blogs
we
made
those
schools,
write
everything
down.
What
date
are
you
looking
at?
What
question
are
you
trying
to
answer?
Where
did
it
come
from,
so
that
anybody
can
go
back
and
look
at
that?
But
it's
a
good
question
of
on,
like
once,
we
put
up
the
data
and
we
have
the
use
cases
and
we
create
kind
of
a
pathway
for
our
students
to
publish,
as
well
as
write
their
blogs,
which
is
a
very
critical
component
of
the
final
project.