Loading…

Markov control models with unknown random state–action-dependent discount factors

The paper deals with a class of discounted discrete-time Markov control models with non-constant discount factors of the form α ~ ( x n , a n , ξ n + 1 ) , where x n , a n , and ξ n + 1 are the state, the action, and a random disturbance at time n , respectively, taking values in Borel spaces. Assum...

Full description

Saved in:

Bibliographic Details
Published in:	TOP 2015-10, Vol.23 (3), p.743-772
Main Author:	Minjárez-Sosa, J. Adolfo
Format:	Article
Language:	English
Subjects:	Business and Management Economic Theory/Quantitative Economics/Mathematical Methods Economics Finance Industrial and Production Engineering Insurance Management Operations Research/Decision Theory Optimization Original Paper Statistics for Business
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	The paper deals with a class of discounted discrete-time Markov control models with non-constant discount factors of the form α ~ ( x n , a n , ξ n + 1 ) , where x n , a n , and ξ n + 1 are the state, the action, and a random disturbance at time n , respectively, taking values in Borel spaces. Assuming that the one-stage cost is possibly unbounded and that the distributions of ξ n are unknown, we study the corresponding optimal control problem under two settings. Firstly we assume that the random disturbance process ξ n is formed by observable independent and identically distributed random variables, and then we introduce an estimation and control procedure to construct strategies. Instead, in the second one, ξ n is assumed to be non-observable whose distributions may change from stage to stage, and in this case the problem is studied as a minimax control problem in which the controller has an opponent selecting the distribution of the corresponding random disturbance at each stage.
ISSN:	1134-5764 1863-8279
DOI:	10.1007/s11750-015-0360-5