论文标题
在随机环境中对力控制机器人搜索策略的无启发式优化
Heuristic-free Optimization of Force-Controlled Robot Search Strategies in Stochastic Environments
论文作者
论文摘要
在工业和服务领域中,使用机器人的核心优势是它们快速可靠地执行重复性任务的能力。但是,即使是相对简单的孔洞任务,通常也会受到随机变化的影响,需要搜索运动才能找到相关的功能,例如孔。尽管搜索提高了鲁棒性,但它以增加运行时的成本为代价:更详尽的搜索将最大化成功执行给定任务的可能性,但会大大延迟任何下游任务。根据简单的启发式方法,这种权衡通常由人类专家解决,这些启发式很少是最佳的。本文介绍了一种自动,数据驱动和无启发式方法,以优化机器人搜索策略。通过训练搜索策略的神经模型在一系列模拟随机环境上,在几个现实世界的示例上进行调节并颠倒模型,我们可以推断出适应基本概率分布的时间变化特征,同时需要很少的现实世界测量值。在螺旋和探测器搜索电子组件的背景下,我们评估了对两个不同工业机器人的方法。
In both industrial and service domains, a central benefit of the use of robots is their ability to quickly and reliably execute repetitive tasks. However, even relatively simple peg-in-hole tasks are typically subject to stochastic variations, requiring search motions to find relevant features such as holes. While search improves robustness, it comes at the cost of increased runtime: More exhaustive search will maximize the probability of successfully executing a given task, but will significantly delay any downstream tasks. This trade-off is typically resolved by human experts according to simple heuristics, which are rarely optimal. This paper introduces an automatic, data-driven and heuristic-free approach to optimize robot search strategies. By training a neural model of the search strategy on a large set of simulated stochastic environments, conditioning it on few real-world examples and inverting the model, we can infer search strategies which adapt to the time-variant characteristics of the underlying probability distributions, while requiring very few real-world measurements. We evaluate our approach on two different industrial robots in the context of spiral and probe search for THT electronics assembly.