Partially Observable Markov Decision Process with Lagged Information over Infinite Horizon

Jeong, Byong-Ho;Kim, Soung-Hie;

Journal of the Korean Operations Research and Management Science Society (한국경영과학회지)

Volume 16 Issue 1
/
Pages.135-146
/
1991
/
1225-1119(pISSN)
/
2733-4759(eISSN)

The Korean Operations Research and Management Science Society (한국경영과학회)

Partially Observable Markov Decision Process with Lagged Information over Infinite Horizon

Jeong, Byong-Ho (Chonbuk National University) ;
Kim, Soung-Hie (Korea Advanced Institute of Science and Technolopgy)

Published : 1991.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper shows the infinite horizon model of Partially Observable Markov Decision Process with lagged information. The lagged information is uncertain delayed observation of the process under control. Even though the optimal policy of the model exists, finding the optimal policy is very time consuming. Thus, the aim of this study is to find an .eplison.-optimal stationary policy minimizing the expected discounted total cost of the model. .EPSILON.- optimal policy is found by using a modified version of the well known policy iteration algorithm. The modification focuses to the value determination routine of the algorithm. Some properties of the approximation functions for the expected discounted cost of a stationary policy are presented. The expected discounted cost of a stationary policy is approximated based on these properties. A numerical example is also shown.

Keywords

References

Oper. Res. v.27 Structural Results for Partially Observable Markov Decision Processes Albright,S.C.
Ann. Math. Stat. v.36 Discounted Dynamic Programming Blackwell,D.
SIAM Rev. v.9 Contraction Mappings in the Theory Underlying Dynamic Programming Denardo,E.V.
J. Opl. Res. Soc. v.38 A Partially Obsevable Markov Decision Process with Lagged Information Kim,S.H.;Jeong,B.H.
Unpublished Dissertation, Korea Advanced Institute of Science and Technology Use of Lagged Information in partially Observable Markov Decision Process Jeong,B.H.
SIAM J. Con. & Opt. v.18 Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems Platzman,L.K.
Recent Developments in Markov Decision Prcoesses Piecewise Linear Markov Decision Process with an aplication into Patially Observable Models Sawaki,K.;R.Hartley(et al.)(ed.)
J. Math. Anal. & Appl. v.91 Transformation of Partially Observable Markov Decision Processes into Piecewise Linear Ones Sawaki,K.
Oper. Res. v.26 The Optimal Control of Partially Observable Markov Process over the Infinite Horizon; Discounted Costs Sondik,E.J.
Euro. J. Oper. Res. v.21 A Markov Decision Algorithm for Optmal Inspections and Revisions in a Maintenance System with Partial Information Tijms,H.C.;F.A.van der Duyn Schouten
J. Opl. Res. Sov. v.39 Note on 'A Partially Observable Markov Decision Process with Lagged Information' White,C.C.
J. Math. Anal. & Appl. v.72 Finite State Approximations for Denumerable State Infinite Horizon Contractd Markov Decision Processes: The Policy Space Method White,D.J.

Journal of the Korean Operations Research and Management Science Society (한국경영과학회지)

Partially Observable Markov Decision Process with Lagged Information over Infinite Horizon

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)