1 Ergebnis.

Reinforcement Learning with History Lists

Timmer, Stephan

A very general framework for modeling uncertainty in learning environments is given by Partially observable Markov Decision Processes (POMDPs). In a POMDP setting, the learning agent infers a policy for acting optimally in all possible states of the environment, while receiving only observations of these states. The basic idea for coping with partial observability is to include memory into the ...

1 Ergebnis.

Reinforcement Learning with History Lists

91,00 CHF

Ergebnisse anzeigen für

1 Ergebnis.

Reinforcement Learning with History Lists

91,00 CHF In Warenkorb legen

Ergebnisse anzeigen für

91,00 CHF