论文标题
排队依赖于动作依赖的服务器性能:利用率降低
Queueing Subject To Action-Dependent Server Performance: Utilization Rate Reduction
论文作者
论文摘要
我们考虑一个离散的时间系统,该系统包括首次登上的队列,非抢先的服务器和固定的非工作人员控制调度程序。新任务根据伯努利进程以预先指定的到达率进入队列。在每个瞬间,服务器要么忙于执行任务,要么可以使用。当服务器可用时,调度程序要么将新任务分配给服务器,要么允许其可用(休息)。除了上述可用性状态外,我们还假设服务器具有整数值的活动状态。活动状态在工作期间无保留,否则并非犯罪。在我们框架的典型应用中,随着活动状态的增加,服务器性能(被理解为任务完成概率)会恶化。在本文中,我们基于同一框架获得的最新稳定性结果。具体来说,我们建立了设计调度策略的方法,不仅可以稳定队列,而且还会降低利用率 - 被理解为服务器正在工作的无限时间平均时间。本文具有一个主要定理,导致了两个关键结果:(i)我们提出了一种可使用有限维线性程序(LP)确定的可拖动方法,这是通过安排稳定的稳定策略来实现的所有利用率的最低限度。 (ii)我们提出了一种基于有限差的LP的设计方法,以获得稳定的调度策略,该策略可以任意接近上述信息最接近的利用率。我们还建立了整个文章中使用的结构和分布收敛属性,并且本身具有重要意义。
We consider a discrete-time system comprising a first-come-first-served queue, a non-preemptive server, and a stationary non-work-conserving scheduler. New tasks enter the queue according to a Bernoulli process with a pre-specified arrival rate. At each instant, the server is either busy working on a task or is available. When the server is available, the scheduler either assigns a new task to the server or allows it to remain available (to rest). In addition to the aforementioned availability state, we assume that the server has an integer-valued activity state. The activity state is non-decreasing during work periods, and is non-increasing otherwise. In a typical application of our framework, the server performance (understood as task completion probability) worsens as the activity state increases. In this article, we build on and transcend recent stabilizability results obtained for the same framework. Specifically, we establish methods to design scheduling policies that not only stabilize the queue but also reduce the utilization rate - understood as the infinite-horizon time-averaged portion of time the server is working. This article has a main theorem leading to two key results: (i) We put forth a tractable method to determine, using a finite-dimensional linear program (LP), the infimum of all utilization rates that can be achieved by scheduling policies that are stabilizing, for a given arrival rate. (ii) We propose a design method, also based on finite-dimensional LPs, to obtain stabilizing scheduling policies that can attain a utilization rate arbitrarily close to the aforementioned infimum. We also establish structural and distributional convergence properties, which are used throughout the article, and are significant in their own right.