Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses

Alexander Nazin, Boris Miller

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

We consider the adaptative stochastic problem for a system described by a controlled Markov Chain (CMC) with a finite number of states. The novelty of the approach consists in the adaptation technique for optimization of the system with unknown distribution of the cost function.
Original languageEnglish
Title of host publicationProceedings of the 18th IFAC World Congress
EditorsSergio Bittani, Angelo Cedenese, Sandro Zampieri
Place of PublicationUnited Kingdom
PublisherElsevier
Pages12421 - 12426
Number of pages6
Volume18
ISBN (Print)9783902661937
DOIs
Publication statusPublished - 2011
EventInternational Federation of Automatic Control World Congress 2011 - Università Cattolica del Sacro Cuore, Milano, Italy
Duration: 28 Aug 20112 Sep 2011
Conference number: 18th
https://www.ifac2011.org/

Conference

ConferenceInternational Federation of Automatic Control World Congress 2011
Abbreviated titleIFAC 2011
CountryItaly
CityMilano
Period28/08/112/09/11
Internet address

Cite this

Nazin, A., & Miller, B. (2011). Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses. In S. Bittani, A. Cedenese, & S. Zampieri (Eds.), Proceedings of the 18th IFAC World Congress (Vol. 18, pp. 12421 - 12426). United Kingdom: Elsevier. https://doi.org/10.3182/20110828-6-IT-1002.03450
Nazin, Alexander ; Miller, Boris. / Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses. Proceedings of the 18th IFAC World Congress. editor / Sergio Bittani ; Angelo Cedenese ; Sandro Zampieri. Vol. 18 United Kingdom : Elsevier, 2011. pp. 12421 - 12426
@inproceedings{7e40fada120d4d8781ce1bf40c336f0c,
title = "Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses",
abstract = "We consider the adaptative stochastic problem for a system described by a controlled Markov Chain (CMC) with a finite number of states. The novelty of the approach consists in the adaptation technique for optimization of the system with unknown distribution of the cost function.",
author = "Alexander Nazin and Boris Miller",
year = "2011",
doi = "10.3182/20110828-6-IT-1002.03450",
language = "English",
isbn = "9783902661937",
volume = "18",
pages = "12421 -- 12426",
editor = "Sergio Bittani and Angelo Cedenese and Sandro Zampieri",
booktitle = "Proceedings of the 18th IFAC World Congress",
publisher = "Elsevier",
address = "Netherlands",

}

Nazin, A & Miller, B 2011, Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses. in S Bittani, A Cedenese & S Zampieri (eds), Proceedings of the 18th IFAC World Congress. vol. 18, Elsevier, United Kingdom, pp. 12421 - 12426, International Federation of Automatic Control World Congress 2011, Milano, Italy, 28/08/11. https://doi.org/10.3182/20110828-6-IT-1002.03450

Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses. / Nazin, Alexander; Miller, Boris.

Proceedings of the 18th IFAC World Congress. ed. / Sergio Bittani; Angelo Cedenese; Sandro Zampieri. Vol. 18 United Kingdom : Elsevier, 2011. p. 12421 - 12426.

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

TY - GEN

T1 - Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses

AU - Nazin, Alexander

AU - Miller, Boris

PY - 2011

Y1 - 2011

N2 - We consider the adaptative stochastic problem for a system described by a controlled Markov Chain (CMC) with a finite number of states. The novelty of the approach consists in the adaptation technique for optimization of the system with unknown distribution of the cost function.

AB - We consider the adaptative stochastic problem for a system described by a controlled Markov Chain (CMC) with a finite number of states. The novelty of the approach consists in the adaptation technique for optimization of the system with unknown distribution of the cost function.

UR - http://www.ifac-papersonline.net/Detailed/51551.html

U2 - 10.3182/20110828-6-IT-1002.03450

DO - 10.3182/20110828-6-IT-1002.03450

M3 - Conference Paper

SN - 9783902661937

VL - 18

SP - 12421

EP - 12426

BT - Proceedings of the 18th IFAC World Congress

A2 - Bittani, Sergio

A2 - Cedenese, Angelo

A2 - Zampieri, Sandro

PB - Elsevier

CY - United Kingdom

ER -

Nazin A, Miller B. Online mirror descent algorithm for controlled homogeneous finite Markov chains with unknown mean losses. In Bittani S, Cedenese A, Zampieri S, editors, Proceedings of the 18th IFAC World Congress. Vol. 18. United Kingdom: Elsevier. 2011. p. 12421 - 12426 https://doi.org/10.3182/20110828-6-IT-1002.03450