Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems
Informal Systems Seminar ISS, Centre for Intelligent Machines (CIM) and Groupe d'Etudes et de Recherche en Analyse des Decisions (GERAD)
Speaker: Borna Sayedana
²Ñ±ð±ð³Ù¾±²Ô²µÌý±õ¶Ù:Ìý845Ìý1388Ìý1004ÌýÌýÌýÌýÌýÌýÌý
±Ê²¹²õ²õ³¦´Ç»å±ð:Ìý³Õ±õ³§³§
** Note that this is a hybrid event.
** Note that his event does not take place at the usual time of ISS events
Abstract: In this talk, we consider the learning and control problem for unknown Markov jump linear systems (MJLS) with perfect state observations. We propose a certainty equivalence-based learning algorithm and show that this algorithm achieves a regret of O(\sqrt{T} log(T)) relative to a certain subset of the sample space. As part of our analysis, we propose a switched least squares method for the identification of MJLS, show that this method is strongly consistent, and derive data-dependent and data-independent rates of convergence. These results show that certainty equivalence control along with the switched least squares method for MJLS has the same rate of convergence as the certainty equivalence control method for linear systems.
Affiliation: Borna Sayedana is a PhD candidate in the Department of Electrical and Computer Engineering, Ï㽶ÊÓƵ.