在网络控制系统中学习

论文标题

在网络控制系统中学习

Learning in Networked Control Systems

论文作者

Singh, Rahul, Kumar, P. R.

论文摘要

我们为网络控制系统（NCS）设计自适应控制器（学习规则），其中包含控制信息的数据包在有损的无线通道上传输。 We propose Upper Confidence Bounds for Networked Control Systems (UCB-NCS), a learning rule that maintains confidence intervals for the estimates of plant parameters $(A_{(\star)},B_{(\star)})$, and channel reliability $p_{(\star)}$, and utilizes the principle of optimism in the face of uncertainty while making control decisions.我们通过分析$（a _ {（\ star）}，b _ {（\ star）}，p _ {（\ star）}）$时，为UCB-NC提供了非反应性绩效保证，即从场景中的性能差距。我们表明，遗憾的可能性很高，可以将遗憾作为$ \ tilde {o} \ left（c \ sqrt {t} \ right）$ \ footNote {在这里$ \ tilde {o} $ hides boogarithmic factor。}，其中$ t $ t $是$ c $ c $ co $ ristions and comptiment and Is comptiment and Is Is Is Is Is Is Is Is Is Is Is Is Is Is Issive。

We design adaptive controller (learning rule) for a networked control system (NCS) in which data packets containing control information are transmitted across a lossy wireless channel. We propose Upper Confidence Bounds for Networked Control Systems (UCB-NCS), a learning rule that maintains confidence intervals for the estimates of plant parameters $(A_{(\star)},B_{(\star)})$, and channel reliability $p_{(\star)}$, and utilizes the principle of optimism in the face of uncertainty while making control decisions. We provide non-asymptotic performance guarantees for UCB-NCS by analyzing its "regret", i.e., performance gap from the scenario when $(A_{(\star)},B_{(\star)},p_{(\star)})$ are known to the controller. We show that with a high probability the regret can be upper-bounded as $\tilde{O}\left(C\sqrt{T}\right)$\footnote{Here $\tilde{O}$ hides logarithmic factors.}, where $T$ is the operating time horizon of the system, and $C$ is a problem dependent constant.

下载PDF全文

下载文献需遵守相关版权规定

论文标题