logo Idiap Research Institute        
 [BibTeX] [Marc21]
Estimates of Parameter Distributions for Optimal Action Selection
Type of publication: Idiap-RR
Citation: dimitrak-bengio_04-72
Number: Idiap-RR-72-2004
Year: 2004
Institution: IDIAP
Abstract: We present a general method for maintaining estimates of the distribution of parameters in arbitrary models. This is then applied to the estimation of probability distribution over actions in value-based reinforcement learning. While this approach is similar to other techniques that maintain a confidence measure for action-values, it nevertheless offers a new insight into current techniques and reveals potential avenues of further research.
Userfields: ipdmembership={learning},
Keywords:
Projects Idiap
Authors Dimitrakakis, Christos
Bengio, Samy
Added by: [UNK]
Total mark: 0
Attachments
  • rr-04-72.pdf
  • rr-04-72.ps.gz
Notes