Please use this identifier to cite or link to this item: https://repository.iimb.ac.in/handle/2074/10953
DC FieldValueLanguage
dc.contributor.authorBasu, Arnab
dc.contributor.authorBhattacharyya, Tirthankar
dc.contributor.authorBorkar, Vivek S
dc.date.accessioned2020-03-23T09:25:11Z-
dc.date.available2020-03-23T09:25:11Z-
dc.date.issued2008
dc.identifier.issn0364-765X
dc.identifier.urihttps://repository.iimb.ac.in/handle/2074/10953-
dc.description.abstractA linear function approximation-based reinforcement learning algorithm is proposed for Markov decision processes with infinite horizon risk-sensitive cost. Its convergence is proved using the "o.d.e. method" for stochastic approximation. The scheme is also extended to continuous state space processes.
dc.description.sponsorshipInfosys Fellowship; Universities Grants Commission, Government of India; J.C. Bose Fellowship of the Department of Science and Technology, Government of India
dc.publisherInforms
dc.subjectLearning Algorithm
dc.subjectRisk-Sensitive Cost
dc.subjectFunction Approximation
dc.subjectStochastic Approximation
dc.titleA learning algorithm for risk-sensitive cost
dc.typeJournal Article
dc.identifier.doi10.1287/moor.1080.0324
dc.pages880-898p.
dc.vol.noVol.33-
dc.issue.noIss.4-
dc.journal.nameMathematics of Operations Research
Appears in Collections:2000-2009
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.