Please use this identifier to cite or link to this item:
https://repository.iimb.ac.in/handle/2074/11430
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Basu, Arnab | |
dc.contributor.author | Stettner, Lukasz | |
dc.date.accessioned | 2020-04-06T13:21:12Z | - |
dc.date.available | 2020-04-06T13:21:12Z | - |
dc.date.issued | 2015 | |
dc.identifier.issn | 0363-0129 | |
dc.identifier.uri | https://repository.iimb.ac.in/handle/2074/11430 | - |
dc.description.abstract | We consider asymmetric partially observed Shapley-type finite-horizon and infinite-horizon games where the state, a controlled Markov chain $\{X_t\}$, is not observable to one player (minimizer) who observes only a state-dependent signal $\{Y_t\}$. The maximizer observes both. The minimizer is informed of the maximizer's action after (before) choosing his control in the MINMAX (MAXMIN) game. A nontrivial open problem in such situations is how the minimizer can use this knowledge to update his belief about $\{X_t\}$. To address this, the maximizer uses off-line control functions which are known to the minimizer. Using these, novel control-parameterized nonlinear filters are constructed which are proved to characterize the conditional distribution of the full path of $\{X_t\}$. Using these filters, recursive algorithms are developed which show that saddle-points exist in both behavioral and Markov strategies for the finite-horizon case in both games. These algorithms are extended to prove saddle-points in Markov strategies for both games for the infinite-horizon case. A counterexample shows that the finite-horizon MINMAX value may be greater than the MAXMIN value. We show that the asymptotic limits of these values converge to the corresponding MINMAX and MAXMIN saddle-point values in the infinite-horizon setup. Another counterexample shows that the uniform value need not exist. Read More: https://epubs.siam.org/doi/10.1137/141000336 | |
dc.publisher | Society For Industrial and Applied Mathematics Publications | |
dc.subject | Dynamic Programming Algorithms | |
dc.subject | Finite-Horizon and Infinite-Horizon Discounted Cost | |
dc.subject | Nonsymmetric Partially Observed Game | |
dc.subject | Parameterized Filtering | |
dc.subject | Zero-Sum Stochastic Game | |
dc.title | Finite-and infinite-horizon shapley games with nonsymmetric partial observation | |
dc.type | Journal Article | |
dc.identifier.doi | 10.1137/141000336 | |
dc.pages | 3584-3619p. | |
dc.vol.no | Vol.53 | - |
dc.issue.no | Iss.6 | - |
dc.journal.name | SIAM Journal On Control and Optimization | |
Appears in Collections: | 2010-2019 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.