Abstract
This paper deals with the so-called limiting average criteria for nonstationary Markov decision processes with (possibly unbounded) rewards and Borel state space. A new set of conditions is provided, under which the existence of both a solution to the optimality equations and the limiting average e(= 0)-optimal Markov policies is derived. Also, a rolling horizon algorithm for computing limiting average e(andgt; 0)-optimal Markov policies is developed. Furthermore, the results in this paper are illustrated by several examples such as the water regulation problem.
| Original language | English |
|---|---|
| Pages (from-to) | 1037 - 1053 |
| Number of pages | 16 |
| Journal | SIAM Journal of Control and Optimization |
| Volume | 11 |
| Issue number | 4 |
| DOIs | |
| Publication status | Published - 30 Jun 2001 |
Keywords
- nonstationary markov decision processes
- limiting average criteria
- optimality equations
- rolling horizon algorithm