Estimates of Parameter Distributions for Optimal Action Selection
| Type of publication: | Idiap-RR |
| Citation: | dimitrak-bengio_04-72 |
| Number: | Idiap-RR-72-2004 |
| Year: | 2004 |
| Institution: | IDIAP |
| Abstract: | We present a general method for maintaining estimates of the distribution of parameters in arbitrary models. This is then applied to the estimation of probability distribution over actions in value-based reinforcement learning. While this approach is similar to other techniques that maintain a confidence measure for action-values, it nevertheless offers a new insight into current techniques and reveals potential avenues of further research. |
| Userfields: | ipdmembership={learning}, |
| Keywords: | |
| Projects: |
Idiap |
| Authors: | |
| Added by: | [UNK] |
| Total mark: | 0 |
|
Attachments
|
|
|
Notes
|
|
|
|
|