►
Description
February 2022
Use of educational data to predict students’ performance
Thereza Padilha; Georgia Tech
A
So,
first
of
all,
thanks
for
having
me
and
it's
a
little
echo,
no
you're,
okay,
okay
and
for
this
opportunity
to
talk
and
share
what
I'm
doing
here
in
georgia,
tech.
A
So
today,
I'll
present
a
little
bit
the
work
that
I
me
I'm
doing
here
and
I
will
talk
about
use
of
educational
data
to
predict
students
performs,
I
am
a
visiting
scholar
in
georgia,
tech
until
may
this
year
and
I
will
come
back
to
brazil
and
hopefully
I
will
continue
to
meet
everybody
virtually
so
here
my
advisor
is
professor
richard
cataprong.
A
The
agenda
for
this
this
talk,
I
divided
basically
an
introduction
just
to
see
what
I'm
saying
a
contest
contextualization
and
talk
a
little
bit
so
the
education
data
mining
in
that
case
of
study
following
five
steps:
data
collection,
data,
clean
and
transformation,
feature
selection,
deep
learning
models
and
evaluation,
and
I
will
finally
a
brief
conclusion.
A
In
educational
institutions,
you
know
we
have
many
academic
and
personal
data
that
are
stored
and
you
can
explore
to
find
hidden,
hidden
or
unknowing
and
knowledge
yeah.
I
saw
one
mistake
here,
but
okay,
annoy
or
hidden
knowledge
for
this
is
only
to
predict
students,
performance,
user
education.
Data
set
is
a
topic
that
has
been
researched
for
several
years,
but
there
are
some
limitations.
A
A
A
Now
that's
available
on
the
internet,
I
think
until
since
2014
yeah
there
is
this
kind
of
data
set,
so
the
education
data
mine,
it
is
based
on
knowledge,
discovery
in
database
process.
Very
known.
Very
many
people
know
this
process,
the
fayad
and
others
defining
in
1996,
but
until
now
we
can
use
to
follow
all
the
steps
that
basically,
we
have
five
steps.
A
A
We
choose
four
feature
selection
base.
The
idea
is
to
minimize
the
amount
of
features
that
we
have
in
the
data
pass
present
all
this
data
for
the
deep
learning
models
and
see
what
we
have
and
compare
these
models.
Now.
Let
me
explain
all
these
steps
that
you
we
did
the
first
one,
the
phase,
one
data
collection,
this
data
set
it's
possible
to
find
in
this
website
is,
as
I
said,
is
from
portugal.
A
A
To
to
everybody
knows
better
how
features
that
we
have
in
this
data
set,
that
is
school
age,
address
family
size,
mother's
education,
father's
education,
free
time?
How?
How
how
much
time
the
students
have
to
free
no,
no,
no
study,
and
if
we
I
have,
for
example,
if
they
drink
during
the
week,
drink
just
weekends.
A
So
there
is
33
fixtures
in
this
data
set
the
number
of
of
the
absences
and
for
the
records
academic
records.
Basically,
we
have
about
the
grades.
We
have
the
grade
first
period
grade
and
the
second
grade
for
the
phase
2.
Basically,
data
and
cleaning
transformation.
A
A
The
number
of
possible
values
like
g3
is
a
target
value
target
feature
that
means
pass
or
fail
in
the
in
the
class
0
to
9
and
10
to
20,
because
we
saw
in
understand
how
is
the
portuguese
system
to
pass
one
year
to
another,
zero
to
nine
means
that
failed
and
10
to
20
or
of
average
or
above
average,
but
anyway
he
can
pass
and
move
move
to
the
next
great,
the
phase
three
we
had.
A
This
is
the
features
that
it
selected
for
each
algorithm.
Some
I
put
in
bold
as
bold
because
they
are
exactly
the
same.
Like
the
two
grades
and
the
amount
of
failures.
A
A
A
A
In
the
next
slide
is
basically
to
see
how
the
accuracy
of
the
boruto
algorithm
and
over
the
epochs
as
soon
as
we
can
see
the
examples
he
will
improve
over
the
time.
A
And
for
the
conclusion,
for
this
talk
right
to
predict
students
performance,
you
have
investigate
the
use
of
feature
selection,
algorithms,
cho
beta
model
with
more
accuracy
than
others.
Would
you
like
to
see
if
it's
possible
to
use
legs
features
minimize
the
amount
of
data
that
you
need
to
show
for
for
the
neural
network,
to
try
to
classify
if
these
students
is
possible
to
fight
to
fail?
There
is
a
high
chance
to
fail
in
the
future
or
not,
and
in
this
case
the
study
de
boruto
helped
us
to
find
and
to
classify
these
students.
A
So
I'm
looking
for
for
other
projects
that
I
can
collaborate
in
the
future
and
I
my
idea
just
bring
and
share
with
you
what
I'm
doing.