论文标题
延迟Q-update:一种新型的信用分配技术,用于为网格连接的微电网提供最佳操作策略
Delayed Q-update: A novel credit assignment technique for deriving an optimal operation policy for the Grid-Connected Microgrid
论文作者
论文摘要
微电网是一种创新的系统,它将分布式能源集成以在电界内提供电力需求。这项研究提出了一种推导理想的微电网操作策略的方法,该方法可以使用拟议的新型信用分配技术,即延迟-Q更新,从而在微电网系统中进行复杂的控件。该技术采用新颖的功能,例如处理和解决微电网有效特性的能力,这阻止了学习代理在复杂的控制下得出合适的政策。提出的技术跟踪了充电期的历史,并追溯为ESS充电控制分配了调整值。由于技术的过程,使用所提出的方法得出的操作策略非常适合ESS操作的实际影响。因此,它支持在复杂控制的微电网环境下搜索近乎最佳的操作策略。为了验证我们的技术,我们通过将我们的政策的绩效指标与基准政策和最佳政策进行比较,模拟了现实世界中网格连接的微电网系统下的操作政策,并通过将我们的政策的绩效指标与近乎最佳的政策进行了融合。
A microgrid is an innovative system that integrates distributed energy resources to supply electricity demand within electrical boundaries. This study proposes an approach for deriving a desirable microgrid operation policy that enables sophisticated controls in the microgrid system using the proposed novel credit assignment technique, delayed-Q update. The technique employs novel features such as the ability to tackle and resolve the delayed effective property of the microgrid, which prevents learning agents from deriving a well-fitted policy under sophisticated controls. The proposed technique tracks the history of the charging period and retroactively assigns an adjusted value to the ESS charging control. The operation policy derived using the proposed approach is well-fitted for the real effects of ESS operation because of the process of the technique. Therefore, it supports the search for a near-optimal operation policy under a sophisticatedly controlled microgrid environment. To validate our technique, we simulate the operation policy under a real-world grid-connected microgrid system and demonstrate the convergence to a near-optimal policy by comparing performance measures of our policy with benchmark policy and optimal policy.