论文标题
加速基于深钢筋学习的基于紧急电压控制的负载脱落
Accelerated Deep Reinforcement Learning Based Load Shedding for Emergency Voltage Control
论文作者
论文摘要
负载脱落一直是针对电压不稳定性的最广泛使用和有效的紧急控制方法之一。随着电力系统中不确定性的增加和迅速变化的操作条件,现有方法在速度,适应性或可伸缩性方面都有出色的问题。近年来,深入的增强学习(DRL)被视为一种有前途的自适应网格稳定性控制的有前途的方法。但是,现有的DRL算法在应用于电力系统控制问题时显示了两个出色的问题:1)计算效率低下,需要广泛的培训和调整时间; 2)可伸缩性差,因此很难扩展到高维控制问题。为了克服这些问题,开发了一种名为PARS的加速DRL算法,并根据负载脱落为电力系统电压稳定性控制而量身定制。 PARS具有高可扩展性,并且仅使用五个主要的超参数调节。该方法在IEEE 39-BUS和IEEE 300总线系统上进行了测试,而后者是此类研究的最大规模。测试结果表明,与其他方法相比,包括模型预测性控制(MPC)和近端策略优化(PPO)方法相比,PARS显示出更好的计算效率(更快的融合),学习方面的稳健性,出色的可伸缩性和概括能力。
Load shedding has been one of the most widely used and effective emergency control approaches against voltage instability. With increased uncertainties and rapidly changing operational conditions in power systems, existing methods have outstanding issues in terms of either speed, adaptiveness, or scalability. Deep reinforcement learning (DRL) was regarded and adopted as a promising approach for fast and adaptive grid stability control in recent years. However, existing DRL algorithms show two outstanding issues when being applied to power system control problems: 1) computational inefficiency that requires extensive training and tuning time; and 2) poor scalability making it difficult to scale to high dimensional control problems. To overcome these issues, an accelerated DRL algorithm named PARS was developed and tailored for power system voltage stability control via load shedding. PARS features high scalability and is easy to tune with only five main hyperparameters. The method was tested on both the IEEE 39-bus and IEEE 300-bus systems, and the latter is by far the largest scale for such a study. Test results show that, compared to other methods including model-predictive control (MPC) and proximal policy optimization(PPO) methods, PARS shows better computational efficiency (faster convergence), more robustness in learning, excellent scalability and generalization capability.