Qualia
0.2
|
#include <BatchRLTrainer.h>
Public Member Functions | |
BatchRLTrainer (QFunction *qFunction, int maxExamples, float gamma, int maxTrainingIterationsPerEpisode=-1) | |
virtual | ~BatchRLTrainer () |
virtual void | init () |
virtual void | _doTrainEpisode (DataSet *data) |
This method should be overriden by subclasses to train a single episode over the dataset. More... | |
![]() | |
DataSetTrainer (Function *function) | |
Constructor. More... | |
virtual | ~DataSetTrainer () |
virtual void | train (DataSet *data, int maxEpisodes=10) |
virtual void | trainEpisode (DataSet *data) |
Trains one episode over the dataset. More... | |
virtual bool | stop () |
![]() | |
Trainer (Function *function) | |
Constructor. More... | |
virtual | ~Trainer () |
int | nEpisodes () const |
Public Attributes | |
real * | targets |
int | maxExamples |
float | gamma |
int | maxTrainingIterationsPerEpisode |
RLObservation | _lastObservation |
Action | _lastOrNextAction |
RLObservation | _observation |
This class implements the Batch reinforcement learning algorithm. It trains a QFunction on a DataSet containing (s, a, r, s') tuples.
Reference: S. Lange, T. Gabel and M. Riedmiller. Batch Reinforcement Learning. In Wiering, M. and van Otterlo, M., editor, Reinforcement Learning: State of the Art, Springer, in press, 2011
BatchRLTrainer::BatchRLTrainer | ( | QFunction * | qFunction, |
int | maxExamples, | ||
float | gamma, | ||
int | maxTrainingIterationsPerEpisode = -1 |
||
) |
|
virtual |
|
virtual |
This method should be overriden by subclasses to train a single episode over the dataset.
Implements DataSetTrainer.
|
virtual |
Reimplemented from Trainer.
RLObservation BatchRLTrainer::_lastObservation |
Action BatchRLTrainer::_lastOrNextAction |
RLObservation BatchRLTrainer::_observation |
float BatchRLTrainer::gamma |
int BatchRLTrainer::maxExamples |
int BatchRLTrainer::maxTrainingIterationsPerEpisode |
real* BatchRLTrainer::targets |