使用改编版的蒙特卡洛树搜索搜索空间树搜索组合优化问题

论文标题

使用改编版的蒙特卡洛树搜索搜索空间树搜索组合优化问题

Exploring search space trees using an adapted version of Monte Carlo tree search for combinatorial optimization problems

论文作者

Jooken, Jorik, Leyman, Pieter, Wauters, Tony, De Causmaecker, Patrick

论文摘要

在本文中，我们提出了一种启发式算法，以探索与组合优化问题实例相关的搜索空间树。该算法基于蒙特卡洛树搜索，这是一种流行的游戏玩法算法，用于探索游戏树，代表许多游戏的最新算法。提出了对蒙特卡洛树搜索的几种增强功能，使该算法更适合在组合优化环境中。这些增强功能利用了问题的组合结构，并旨在通过使用启发式模拟策略修剪子树来有效地探索搜索空间树，从而通过消除主导的价值分配并使用光束宽度来减少变量的域。该算法是针对两个组合优化问题专门量身定制的：Quay Crane调度问题与非跨限制和0-1 Knapsack问题的专门定制的组件。对于第一个问题，我们的算法超过了最先进的结果，并且在基准一组实例中找到了一些新的最佳解决方案。对于第二个问题，我们的算法通常会产生近乎最佳的解决方案，这些解决方案比最先进的结果稍差，但仅需要一小部分时间才能这样做。这些结果表明，对于两个完全不同的组合优化问题，该算法与最先进的算法具有竞争力。

In this article we propose a heuristic algorithm to explore search space trees associated with instances of combinatorial optimization problems. The algorithm is based on Monte Carlo tree search, a popular algorithm in game playing that is used to explore game trees and represents the state-of-the-art algorithm for a number of games. Several enhancements to Monte Carlo tree search are proposed that make the algorithm more suitable in a combinatorial optimization context. These enhancements exploit the combinatorial structure of the problem and aim to efficiently explore the search space tree by pruning subtrees, using a heuristic simulation policy, reducing the domains of variables by eliminating dominated value assignments and using a beam width. The algorithm was implemented with its components specifically tailored to two combinatorial optimization problems: the quay crane scheduling problem with non-crossing constraints and the 0-1 knapsack problem. For the first problem our algorithm surpasses the state-of-the-art results and several new best solutions are found for a benchmark set of instances. For the second problem our algorithm typically produces near-optimal solutions that are slightly worse than the state-of-the-art results, but it needs only a small fraction of the time to do so. These results indicate that the algorithm is competitive with the state-of-the-art for two entirely different combinatorial optimization problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题