论文标题
基于POMDP的运动计划的非线性度量
Non-Linearity Measure for POMDP-based Motion Planning
论文作者
论文摘要
不确定性下的运动计划对于可靠的机器人操作至关重要。尽管在过去十年中取得了长足的进步,但对于具有复杂动态的系统而言,问题仍然很困难。大多数最先进的方法都执行依赖大量远期模拟的搜索。对于具有复杂动态的系统,这通常需要昂贵的数值集成,从而大大减慢了计划过程。已经提出了基于线性化的方法,可以减轻上述问题。但是,目前尚不清楚线性化如何影响生成的运动策略的质量,并且当这种简化是可以接受的。我们提出了一种非线性度量,称为基于统计距离的非线性度量(SNM),可以识别线性化的位置是有益的,应避免它。我们表明,当问题被构成部分可观察到的马尔可夫决策过程时,原始模型的最佳策略与线性化模型之间的值差可以由SNM中的函数线性界定。在各种情况下与现有度量的比较表明,SNM更适合估计基于线性化的求解器的有效性。为了测试SNM在运动计划中的适用性,我们提出了一个简单的在线规划师,该计划者使用SNM作为启发式程序来切换一般和基于线性的求解器。在具有二阶动力学和4-DOF和7-DOFS扭矩控制的操作器的类似汽车的机器人上的结果表明,SNM可以适当地决定是否以及何时应使用基于线性化的求解器。
Motion planning under uncertainty is essential for reliable robot operation. Despite substantial advances over the past decade, the problem remains difficult for systems with complex dynamics. Most state-of-the-art methods perform search that relies on a large number of forward simulations. For systems with complex dynamics, this generally require costly numerical integrations which significantly slows down the planning process. Linearization-based methods have been proposed that can alleviate the above problem. However, it is not clear how linearization affects the quality of the generated motion strategy, and when such simplifications are admissible. We propose a non-linearity measure, called Statistical-distance-based Non-linearity Measure (SNM), that can identify where linearization is beneficial and where it should be avoided. We show that when the problem is framed as the Partially Observable Markov Decision Process, the value difference between the optimal strategy for the original model and the linearized model can be upper bounded by a function linear in SNM. Comparisons with an existing measure on various scenarios indicate that SNM is more suitable in estimating the effectiveness of linearization-based solvers. To test the applicability of SNM in motion planning, we propose a simple on-line planner that uses SNM as a heuristic to switch between a general and a linearization-based solver. Results on a car-like robot with second order dynamics and 4-DOFs and 7-DOFs torque-controlled manipulators indicate that SNM can appropriately decide if and when a linearization-based solver should be used.