Finite-and infinite-horizon shapley games with nonsymmetric partial observation

Basu, Arnab; Stettner, Lukasz

Welcome to the Research @IIMB

The Digital Institutional Repository of IIM Bangalore

This repository provides metadata of IIMB Publications and aimed at creating and preserving an archive of Institution scholarship. IIMB Publications include Articles, Working Papers (FULL TEXT), Book Chapters published by Faculty, Doctoral Dissertations by FPM Scholars and Project reports of Students enrolled in various courses of IIMB.

Learn More

Please use this identifier to cite or link to this item: https://repository.iimb.ac.in/handle/2074/11430

DC Field	Value	Language
dc.contributor.author	Basu, Arnab	-
dc.contributor.author	Stettner, Lukasz	-
dc.date.accessioned	2020-04-06T13:21:12Z	-
dc.date.available	2020-04-06T13:21:12Z	-
dc.date.issued	2015	-
dc.identifier.issn	0363-0129	-
dc.identifier.uri	https://repository.iimb.ac.in/handle/2074/11430	-
dc.description.abstract	We consider asymmetric partially observed Shapley-type finite-horizon and infinite-horizon games where the state, a controlled Markov chain {Xt}, is not observable to one player (minimizer) who observes only a state-dependent signal {Yt}. The maximizer observes both. The minimizer is informed of the maximizer's action after (before) choosing his control in the MINMAX (MAXMIN) game. A nontrivial open problem in such situations is how the minimizer can use this knowledge to update his belief about {Xt}. To address this, the maximizer uses off-line control functions which are known to the minimizer. Using these, novel control-parameterized nonlinear filters are constructed which are proved to characterize the conditional distribution of the full path of {Xt} . Using these filters, recursive algorithms are developed which show that saddle-points exist in both behavioral and Markov strategies for the finite-horizon case in both games. These algorithms are extended to prove saddle-points in Markov strategies for both games for the infinite-horizon case. A counterexample shows that the finite-horizon MINMAX value may be greater than the MAXMIN value. We show that the asymptotic limits of these values converge to the corresponding MINMAX and MAXMIN saddle-point values in the infinite-horizon setup. Another counterexample shows that the uniform value need not exist.	-
dc.publisher	Society For Industrial and Applied Mathematics Publications	-
dc.subject	Dynamic Programming Algorithms	-
dc.subject	Finite-Horizon and Infinite-Horizon Discounted Cost	-
dc.subject	Nonsymmetric Partially Observed Game	-
dc.subject	Parameterized Filtering	-
dc.subject	Zero-Sum Stochastic Game	-
dc.title	Finite-and infinite-horizon shapley games with nonsymmetric partial observation	-
dc.type	Journal Article	-
dc.identifier.doi	10.1137/141000336	-
dc.pages	3584-3619p.	-
dc.vol.no	Vol.53	-
dc.issue.no	Iss.6	-
dc.journal.name	SIAM Journal On Control and Optimization	-
Appears in Collections:	2010-2019

Show simple item record

Google Scholar^TM

Check

Welcome to the Research @IIMB

Google ScholarTM

Altmetric

Google Scholar^TM