The theory of Markov Decision Processes - also known under several other names including sequential stochastic optimization, discrete-time stochastic control, and stochastic dynamic programming - studies sequential optimization of discrete time stochastic systems. Fundamentally, this is a methodology that examines and analyzes a discrete-time stochastic system whose transition mechanism can be controlled over time. Each control policy defines the stochastic process and values of objective functions associated with this process. Its objective is to select a "good" control policy. In real life, decisions that humans and computers make on all levels usually have two types of impacts: (i) they cost or save time, money, or other resources, or they bring revenues, as well as (ii) they have an impact on the future, by influencing the dynamics. In many situations, decisions with the largest immediate profit may not be good in view of future events. Markov Decision Processes (MDPs) model this paradigm and provide results on the structure and existence of good policies and on methods for their calculations. MDPs are attractive to many researchers because they are important both from the practical and the intellectual points of view. MDPs provide tools for the solution of important real-life problems. In particular, many business and engineering applications use MDP models. Analysis of various problems arising in MDPs leads to a large variety of interesting mathematical and computational problems. Accordingly, the Handbook of Markov Decision Processes is split into three parts: Part I deals with models with finite state and action spaces and Part II deals with infinite state problems, and Part III examines specific applications. Individual chapters are written by leading experts on the subject.
1. Introduction; E.A. Feinberg, A. Shwartz. Part I: Finite State and Action Models. 2. Finite State and Action MDPs; L. Kallenberg. 3. Bias Optimality; M.E. Lewis, M.L. Puterman. 4. Singular Perturbations of Markov Chains and Decision Processes; K.E. Avrachenkov, et al. Part II: Infinite State Models. 5. Average Reward Optimization Theory for Denumerable State Spaces; L.I. Sennott. 6. Total Reward Criteria; E.A. Feinberg. 7. Mixed Criteria; E.A. Feinberg, A. Shwartz. 8. Blackwell Optimality; A. Hordijk, A.A. Yushkevich. 9. The Poisson Equation for Countable Markov Chains: Probabilistic Methods and Interpretations; A.M. Makowski, A. Shwartz. 10. Stability, Performance Evaluation, and Optimization; S.P. Meyn. 11. Convex Analytic Methods in Markov Decision Processes; V.S. Borkar. 12. The Linear Programming Approach; O. Hernández-Lerma, J.B. Lasserre. 13. Invariant Gambling Problems and Markov Decision Processes; L.E. Dubins, et al. Part III: Applications. 14. Neuro-Dynamic Programming: Overview and Recent Trends; B. Van Roy. 15. Markov Decision Processes in Finance and Dynamic Options; M. Schäl. 16. Applications of Markov Decision Processes in Communication Networks; E. Altman. 17. Water Reservoir Applications of Markov Decision Processes; B.F. Lamond, A. Boukhtouta. Index.
Series: International Series in Operations Research & Management Science
Number Of Pages: 565
Published: 30th September 2001
Country of Publication: NL
Dimensions (cm): 23.5 x 15.5
Weight (kg): 2.17