►
From YouTube: HTM Agent Demo
Description
This video is the visual supplementary material for my thesis. It describes an NPC architecture which combines Hierarchical Temporal Memory and Temporal Difference Learning Lambda. Thesis links below.
Thesis link: https://www.dropbox.com/s/jguh4d0863y6x1r/10164132.pdf?dl=0
Discussion thread: https://discourse.numenta.org/t/htm-based-autonomous-agent/2701
A
Hi,
my
name
is
Ellie,
and
this
is
the
widget
supplementary
material.
For
my
thesis.
This
is
a
game
world
that
the
agent
navigates
and
the
purple
agent
is
the
player,
the
blue
line.
You
know
just
direction
and
I'm
controlling
it
right
now,
jumping
from
cell
centers
of
cell
center
and
if
I
go
out
of
bounds,
I
responded
around
themself
with
negative
reward.
If
I
go
to
the
portal,
I
respond
again
in
a
random
cell
with
plugs
a
reward.
So
this
is
the
problem.
A
A
A
A
So,
at
the
right
side,
you
can
see
a
pleasure
graph
of
the
agent,
the
FO
values.
You
know
positive
rewards
and
lower
values.
You
know
negative
rewards
and
the
one
below
that
you
know
the
average
award.
So
we
can
wedge
allies
the
synapses.
These
are
proximal
synapses
and
on
top
to
be
distal
synapses,
we
can
do
them
separately.
These
direct
people
will
be
open
them
all
either
the
synapses
that
lead
to
activity
in
the
architecture-
and
these
are
visualized
in
real
time.