发电技术

• •    下一篇

基于脉冲经验回放强化学习光储直柔系统优化调度

曾伟1,熊健豪1,简婧1,熊俊杰1,卢恒宇2,彭春华2   

  1. 1. 国网江西省电力有限公司电力科学研究院,江西省 南昌市 330096;2. 华东交通大学电气与自动化工程学院,江西省 南昌市 330013
  • 基金资助:

    江西省重点研发计划(20212BBE51002)。

Optimization Scheduling of PEDF System Based on Spiking Neural Experience Replay Reinforcement Learning

ZENG Wei1, XIONG Jianhao1, JIAN Jing1, XIONG Junjie1, LU Hengyu2, PENG Chunhua2   

  1. 1. State Grid Jiangxi Electric Power Research Institute,Nanchang 330096,China;2. School of Electrical and Automation Engineering,East China Jiaotong University,Nanchang 330013,China

摘要:

【目的】为实现新型的光储直柔(photovoltaics, energy storage, direct current, flexibility,PEDF)系统的优化运行,结合直流系统的功率调控方法,以系统运行成本最低为目标,构建PEDF系统优化调度模型。在实现系统经济运行的同时保障其安全运行。【方法】利用光-荷实时功率差动态调整直流母线电压,将其作为全局性物理信号指导系统内柔性设备进行功率调节,以系统运行成本为目标构建优化调度模型。针对模型的高效在线求解,提出脉冲经验回放TD3算法,通过离线训练和在线优化实现实时经济调度。【结果】算例仿真表明,所提PEDF系统优化调度模型的优越性和所提优化策略的有效性。【结论】所提控制策略能有效降低系统对上级电网依赖,提高运行的经济性,并通过多种算法对比,验证了本文所提算法效果更好。

Abstract:

[Objective] The new type of power distribution system, namely PEDF (photovoltaics, energy storage, direct current, flexibility) can effectively utilize flexible resources and promote on-site consumption of photovoltaics. To achieve optimized operation of the PEDF system, [Methods]This article uses the difference between photovoltaic output and load demand power to adjust the DC bus voltage, and uses this voltage as a signal to guide the flexible equipment of the system to adjust power, thus constructing an optimized scheduling model for PEDF system; In addition, to achieve efficient online economic scheduling of various flexible devices within the PEDF system, it is proposed to use a pulse experience replay reinforcement learning algorithm for offline training and online optimization of the model. [Results] Finally, the superiority of the optimization scheduling model for the optical storage direct flexible system proposed in this paper and the effectiveness of the proposed optimization strategy were verified through numerical simulations. [Conclusions] The control strategy proposed in this article can effectively regulate the flexible devices of the PEDF system, reduce dependence on the higher-level power grid, and improve the economic efficiency of the system operation. Through various algorithm comparisons, it has been verified that the algorithm proposed in this article has better performance.