Pascal and Francis Bibliographic Databases

Help

Search results

Your search

kw.\*:("Markov decision")

Document Type [dt]

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Publication Year[py]

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Discipline (document) [di]

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Language

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Author Country

A-Z Z-A Frequency ↓ Frequency ↑
Export in CSV

Results 1 to 25 of 1801

  • Page / 73
Export

Selection :

  • and

A Modified Memory-Based Reinforcement Learning Method for Solving POMDP ProblemsLEI ZHENG; CHO, Siu-Yeung.Neural processing letters. 2011, Vol 33, Num 2, pp 187-200, issn 1370-4621, 14 p.Article

Robustness of policies in constrained markov decision processesZADOROJNIY, Alexander; SHWARTZ, Adam.IEEE transactions on automatic control. 2006, Vol 51, Num 4, pp 635-638, issn 0018-9286, 4 p.Article

Minimizing risk models in stochastic shortest path problemsOHTSUBO, Yoshio.Mathematical methods of operations research (Heidelberg). 2003, Vol 57, Num 1, pp 79-88, issn 1432-2994, 10 p.Article

Functional characterization for average cost Markov decision processes with Doeblin's conditionsKURANO, M.Computers & mathematics with applications (1987). 1991, Vol 21, Num 11-12, pp 57-63, issn 0898-1221Conference Paper

Index policies for the maintenance of a collection of machines by a set of repairmenGLAZEBROOK, K. D; MITCHELL, H. M; ANSELL, P. S et al.European journal of operational research. 2005, Vol 165, Num 1, pp 267-284, issn 0377-2217, 18 p.Article

Decentralized MDPs with sparse interactionsMELO, Francisco S; VELOSO, Manuela.Artificial intelligence (General ed.). 2011, Vol 175, Num 11, pp 1757-1789, issn 0004-3702, 33 p.Article

Maximizing a new quantity in sequential reserve selectionSCHAPAUGH, Adam W; TYRE, Andrew J.Environmental conservation. 2014, Vol 41, Num 2, pp 198-205, issn 0376-8929, 8 p.Article

On Bellman's principle with inequality constraintsCHONG, Edwin K. P; MILLER, Scott A; ADASKA, Jason et al.Operations research letters. 2012, Vol 40, Num 2, pp 108-113, issn 0167-6377, 6 p.Article

Partially observable markov decision processes with reward information : Basic ideas and modelsCAO, Xi-Ren; XIANPING GUO.IEEE transactions on automatic control. 2007, Vol 52, Num 4, pp 677-681, issn 0018-9286, 5 p.Article

Linear dependence of stationary distributions in ergodic Markov decision processesORTNER, Ronald.Operations research letters. 2007, Vol 35, Num 5, pp 619-626, issn 0167-6377, 8 p.Article

Repair strategies in an uncertain environment : Markov decision process approachKIM, Y.-H; THOMAS, L. C.The Journal of the Operational Research Society. 2006, Vol 57, Num 8, pp 957-964, issn 0160-5682, 8 p.Article

An Alternative LP Formulation of the Admission Control Problem in Multiclass NetworksPIETRABISSA, Antonio.IEEE transactions on automatic control. 2008, Vol 53, Num 3, pp 839-845, issn 0018-9286, 7 p.Article

Incremental value iteration for time-aggregated markov-decision processesTAO SUN; QIANCHUAN ZHAO; LUH, Peter B et al.IEEE transactions on automatic control. 2007, Vol 52, Num 11, pp 2177-2182, issn 0018-9286, 6 p.Article

A note on the hypercube modelKATEHAKIS, M. N.Operations research letters. 1985, Vol 3, Num 6, pp 319-322, issn 0167-6377Article

Nonstationary value-iteration and adaptive control of discounted semi-Markov processesHERNANDEZ-LERMA, O.Journal of mathematical analysis and applications. 1985, Vol 112, Num 2, pp 435-445, issn 0022-247XArticle

Maximal mean/standard deviation ratio in an undiscounted MDPSOBEL, M. J.Operations research letters. 1985, Vol 4, Num 4, pp 157-159, issn 0167-6377Article

Monotone value iteration for discounted finite Markov decision processesWHITE, D. J.Journal of mathematical analysis and applications. 1985, Vol 109, Num 2, pp 311-324, issn 0022-247XArticle

On the existence of relative values for undiscounted Markovian decision processes with a scalar gain rateSCHWEITZER, P. J.Journal of mathematical analysis and applications. 1984, Vol 104, Num 1, pp 67-78, issn 0022-247XArticle

On the existence of relative values for undiscounted multichain Markov decision processesSCHWEITZER, P. J.Journal of mathematical analysis and applications. 1984, Vol 102, Num 2, pp 449-455, issn 0022-247XArticle

Truncated policy iteration methodsDEMBO, R. S; HAVIV, M.Operations research letters. 1984, Vol 3, Num 5, pp 243-246, issn 0167-6377Article

Décision et planification dans l'incertainCHARPILLET, F; GARCIA, F; PERNY, Patrice et al.Revue d'intelligence artificielle. 2006, Vol 20, Num 2-3, issn 0992-499X, 304 p.Conference Proceedings

Finite-state approximations for denumerable multidimensional state discounted Markov decision processesHERNANDEZ-LERMA, O.Journal of mathematical analysis and applications. 1986, Vol 113, Num 2, pp 382-389, issn 0022-247XArticle

Markov decision processes with a Borel measurable cost function: the average caseKURANO, M.Mathematics of operations research. 1986, Vol 11, Num 2, pp 309-320, issn 0364-765XArticle

Block-successive approximation for a discounted Markov decision modelHAVIV, M.Stochastic processes and their applications. 1985, Vol 19, Num 1, pp 151-160, issn 0304-4149Article

State information lag Markov decision process with control limit ruleKIM, S. H.Naval research logistics quarterly. 1985, Vol 32, Num 3, pp 491-496, issn 0028-1441Article

  • Page / 73