Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs

Della Vecchia, Eugenio Martín; Di Marco, Silvia Cristina; Jean-marie, Alain

doi:10.1007/s10479-012-1070-0

Mostrar el registro sencillo del ítem

dc.contributor.author

Della Vecchia, Eugenio Martín Se ha confirmado la validez de este valor de autoridad por un usuario

dc.contributor.author

Di Marco, Silvia Cristina Se ha confirmado la validez de este valor de autoridad por un usuario

dc.contributor.author

Jean-marie, Alain Se ha confirmado la validez de este valor de autoridad por un usuario

dc.date.available

2023-03-07T12:15:15Z

dc.date.issued

2012-02

dc.identifier.citation

Della Vecchia, Eugenio Martín; Di Marco, Silvia Cristina; Jean-marie, Alain; Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs; Springer; Annals Of Operations Research; 199; 1; 2-2012; 193-214

dc.identifier.issn

0254-5330

dc.identifier.uri

http://hdl.handle.net/11336/189791

dc.description.abstract

This paper is concerned with the links between the Value Iteration algorithm and the Rolling Horizon procedure, for solving problems of stochastic optimal control under the long-run average criterion, in Markov Decision Processes with finite state and action spaces. We review conditions of the literature which imply the geometric convergence of Value Iteration to the optimal value. Aperiodicity is an essential prerequisite for convergence. We prove that the convergence of Value Iteration generally implies that of Rolling Horizon. We also present a modified Rolling Horizon procedure that can be applied to models without analyzing periodicity, and discuss the impact of this transformation on convergence. We illustrate with numerous examples the different results. Finally, we discuss rules for stopping Value Iteration or finding the length of a Rolling Horizon. We provide an example which demonstrates the difficulty of the question, disproving in particular a conjectured rule proposed by Puterman.

dc.format

application/pdf

dc.language.iso

eng

dc.publisher

Springer

dc.rights

info:eu-repo/semantics/restrictedAccess

dc.rights.uri

https://creativecommons.org/licenses/by-nc-sa/2.5/ar/

dc.subject

HEURISTIC METHODS

dc.subject

MARKOV DECISION PROBLEMS

dc.subject

ROLLING HORIZON

dc.subject

VALUE ITERATION

dc.subject.classification

Matemática Aplicada Se ha confirmado la validez de este valor de autoridad por un usuario

dc.subject.classification

Matemáticas Se ha confirmado la validez de este valor de autoridad por un usuario

dc.subject.classification

CIENCIAS NATURALES Y EXACTAS Se ha confirmado la validez de este valor de autoridad por un usuario

dc.title

Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs

dc.type

info:eu-repo/semantics/article

dc.type

info:ar-repo/semantics/artículo

dc.type

info:eu-repo/semantics/publishedVersion

dc.date.updated

2023-03-07T11:25:48Z

dc.journal.volume

199

dc.journal.number

1

dc.journal.pagination

193-214

dc.journal.pais

Alemania

dc.journal.ciudad

Berlin

dc.description.fil

Fil: Della Vecchia, Eugenio Martín. Universidad Nacional de Rosario; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Rosario; Argentina

dc.description.fil

Fil: Di Marco, Silvia Cristina. Universidad Nacional de Rosario; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Rosario; Argentina

dc.description.fil

Fil: Jean-marie, Alain. No especifíca;

dc.journal.title

Annals Of Operations Research Se ha confirmado la validez de este valor de autoridad por un usuario

dc.relation.alternativeid

info:eu-repo/semantics/altIdentifier/url/https://link.springer.com/article/10.1007/s10479-012-1070-0

dc.relation.alternativeid

info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.1007/s10479-012-1070-0

Archivos asociados

Tamaño: 731.2Kb

Formato: PDF

Solicitar