王振宇,许 静,胡文博,齐 蓓,万长瑛.不确定性环境下园区风光储互动运行的PPO强化学习策略[J].电力需求侧管理,2022,24(5):44-50
PPO-based reinforcement learning strategy for interactive operation of wind-photovoltaic-storage in an uncertain environment
投稿时间:2022-06-08  修订日期:2022-08-02
DOI:10. 3969 / j. issn. 1009-1831. 2022. 05. 008
中文关键词: 园区能源管理系统  微电网  风光储互动  电池储能系统
英文关键词: energy management system in parks  microgrid  interaction of wind- photovoltaic- storage  battery energy storage system
王振宇 国网电力科学研究院有限公司(南瑞集团有限公司)南京 210000国网电力科学研究院武汉能效测评有限公司武汉 430074 
许 静 国网电力科学研究院有限公司(南瑞集团有限公司)南京 210000国网电力科学研究院武汉能效测评有限公司武汉 430074 
胡文博 国网电力科学研究院有限公司(南瑞集团有限公司)南京 210000国网电力科学研究院武汉能效测评有限公司武汉 430074 
齐 蓓 国网电力科学研究院有限公司(南瑞集团有限公司)南京 210000国网电力科学研究院武汉能效测评有限公司武汉 430074 
万长瑛 国网电力科学研究院有限公司(南瑞集团有限公司)南京 210000国网电力科学研究院武汉能效测评有限公司武汉 430074 
摘要点击次数: 1227
全文下载次数: 371
      With the continuous upgrading of the energy structure, the new parks with new energy power generation will play an important role in the future new power system. Uncertainties such as the randomness of demand, intermittency of wind and solar output, and volatility of electricity prices in electricity market are coupled together, making it difficult to achieve the reasonable operation between wind and solar energy and battery energy storage system. Considering the limitations of traditional optimization methods, a deep reinforcement learning method based on the PPO algorithm is proposed to solve the problem of interactive operation of wind-solar-storage in parks under uncertain environments. Based on the theoretical framework of reinforcement learning, a Markov decision model with continuous state space and continuous action space and unknown transition probability is constructed for the interactive operation of the park. The new load control system controls the battery energy storage system and flexible resources in the microgrid of the park to realize the economic operation, fully considering battery degradation.
查看全文   查看/发表评论  下载PDF阅读器