发电技术

• •    下一篇

基于改进软演员评论家算法的混合光伏温差发电系统动态补偿策略

周雷1, 杨博1, 桑一岩2   

  1. 1.昆明理工大学电力工程学院,云南省 昆明市 650500;2.上海电力大学电气工程学院,上海市 杨浦区 200090
  • 出版日期:2026-04-28 发布日期:2026-04-28
  • 基金资助:

    国家自然科学基金项目资助(62263014,52507123)。

Dynamic Compensation Strategy of Hybrid PV-TEG Systems via Improved Soft Actor-Critic Algorithm

ZHOU Lei 1, YANG Bo 1*, SANG Yiyan 2   

  1. 1. Faculty of Electric Power Engineering, Kunming University of Science and Technology, Kunming 650500, Yunnan Province, China; 2. College of Electrical Engineering, Shanghai University of Electric Power, Yangpu District , Shanghai 200090, China
  • Published:2026-04-28 Online:2026-04-28

摘要: 【的】光伏-温差发电(photovoltaic-thermoelectric generation,PV-TEG)混合系统虽能通过回收光伏余热提升能源利用率,但在部分遮蔽条件(partial shading conditions,PSC)下仍面临电-热参数优化异步、电池补偿机制僵化、传统算法易局部最优或策略不稳定等问题,严重制约系统效率。【方法】提出基于改进软演员评论家(improved soft actor-critic,ISAC)算法的动态补偿策略,通过优化PV阵列拓扑逻辑、TEG功率调节规律及电池自适应补偿机制,实现电-热参数同步优化与动态环境适配;采用MATLAB/Simulink仿真平台,在6×4、6×6等不同规模阵列上开展性能验证,对比传统启发式算法与常规强化学习算法的优化效果。【结果】10种PSC场景下的案例验证表明,ISAC与传统算法相比,所提策略使6×4、6×6阵列失配损耗分别降低12.3%~18.7%、13.6%~19.2%,整体功率提升4.2%~7.3%。【结论】该算法及其策略有效解决了现有混合PV-TEG系统协同性差、动态适应性弱的核心痛点,为复杂遮挡环境下太阳能高效利用提供了可靠技术支撑,对提升可再生能源系统的运行效率与稳定性具有重要意义。

关键词:

"> 动态补偿;混合PV-TEG系统;软演员评论家;部分遮蔽;强化学习;重构;失配损耗抑制;余热回收

Abstract: [Objective] Although the photovoltaic-thermoelectric generation (PV-TEG) hybrid system can improve energy utilization efficiency by recovering photovoltaic waste heat, it still faces problems such as asynchronous electro-thermal parameter optimization, rigid battery compensation mechanism, and traditional algorithms being prone to local optimality or unstable strategies under partial shading conditions (PSC), which seriously restrict the system efficiency. [Method] A dynamic compensation strategy based on the improved soft actor-critic (ISAC) algorithm is proposed. By optimizing PV array topology logic, TEG power regulation law and battery adaptive compensation mechanism, the synchronous optimization of electro-thermal parameters and dynamic environment adaptation are realized. The MATLAB/Simulink simulation platform is adopted to conduct performance verification on arrays of different scales such as 6×4 and 6×6, and the optimization effects are compared with those of traditional heuristic algorithms and conventional reinforcement learning algorithms. [Result] Case verification under 10 PSC scenarios indicates that, compared with traditional algorithms, the proposed strategy reduces the mismatch loss of 6×4 and 6×6 arrays by 12.3%~18.7% and 13.6%~19.2% respectively, with an overall power increase of 4.2%~7.3%. [Conclusion] The algorithm and its corresponding strategy effectively solve the core pain points of poor coordination and weak dynamic adaptability of existing PV-TEG systems, provide reliable technical support for the efficient utilization of solar energy under complex shading environments, and are of great significance for improving the operation efficiency and stability of renewable energy systems.

Key words:

dynamic compensation, hybrid PV-TEG systems, ISAC, PSC, reinforcement learning, reconstruction, mismatch loss suppression, waste heat recovery