The Predictive-Control Optimization Method for Park Integrated Energy System Considering the High Penetration of Photovoltaics and Deep Reinforcement Learning
Chen Minghao1, Zhu Yueyao1, Sun Yi1, Xie Zhiyuan2, Wu Peng3
1. School of Electrical and Electronic Engineering North China Electric Power University Beijing 102206 China;
2. School of Electrical and Electronic Engineering North China Electric Power University Baoding 071000 China;
3. State Grid Energy Research Institute Co. Ltd Beijing 102209 China
As the interface between different energy infrastructures and users, park integrated energy system (PIES) has gained universal recognition for improving the reliability, resiliency, and profitability of multi-carrier energy systems by adaptively scheduling fast energy conversion units (e.g., combined heat and power (CHP), gas boiler (GB), and electric boiler (EB)) and participating in the various energy markets (e.g., electricity, heat, and natural gas). As a promising technology for replacing the rule-based decision-making in PIES, deep reinforcement learning (DRL) is a practical solution to identify the optimal control for energy conversion equipment. However, as PIES's customers perform more casual energy-consumption behaviors, the intermittency and volatility of demands make managing multi-energy supply and storage much harder for DRL agents. To tackle this task, focusing on the utilization of high penetration photovoltaic and the optimization of PIES's benefits, this article proposes an optimization scheduling method for PIES that combines the deep reinforcement learning and the interval prediction of photovoltaic power generation, considering the uncertainty of photovoltaic power generation.
Firstly, taking the equipment of energy conversion and storage as the scheduling objects, we design the predictive-control optimization structure, which can be divided into the facility level and information level, of PIES with electricity, gas, and heat, introducing the coordination between different sub-models. Secondly, the continuous and discrete feature data are respectively normalized and encoded for deterministic and probabilistic predicting the photovoltaic power generation based on temporal convolutional networks and kernel density estimation. Thirdly, based on the theory of model predictive control, the iteratively obtained intervals of photovoltaic power generation are used to construct the operating environment state of the control agent of soft actor critic (SAC) and to obtain the scheduling actions for PIES's equipment of energy conversion and storage.
Numerical results show that the proposed PFP-SAC method is able to identify the generation of photovoltaic power, improve the utilization of PV generation, and optimize the benefit of PIES by dynamic scheduling these conversion and storage equipment and increasing their operation efficiency. Meanwhile, these results prove that the gaps of energy purchasing price is the motivation of multi-energy conversion for PIES and its cost-saving. On the contrast, in the scenario of high penetration of photovoltaic power, the multi-energy conversion and storage of PIES need to simultaneously consider the consumption demand for photovoltaic power and the price-gaps of multi energy, and improve its utilization of photovoltaic power generation as much as possible by reserving energy storage resources. Finally, taking the traditional SAC and deep deterministic policy gradient (DDPG) as the benchmarks, the same datasets are utilized to verify the performance of proposed method and benchmarks, including the scheduling benefit and SOC of storage. The results show that our proposed method is superior for each index.
The following conclusions can be drawn from the simulation analysis: (1) A PIES model with multiple kinds of energy conversion and storage units are constructed, accompanying the uncertainty of renewable generation, demands, and energy purchasing prices. In this sense, it is closer to reality than existing PIES models. (2) Model predictive control theory and deep reinforcement learning algorithm are employed to cope with the intermittent nature of multi-energy demands. This paper constructs the state space of DRL models with prediction intervals of multi-energy demands of PIES, which is obtained by TCN and KDE. (3) Taking the operating cost saving as the prioritize objective and the generation utilization of photovoltaic power as secondary goal of PIES scheduling, soft actor critic, which is a promising DRL algorithm, is applied to reduce the operational expenditures and improve the usage of multi-energy storage capacity as much as possible. Compared with traditional DRL algorithms, it owns the advantages of predicting accuracy and the economic benefits of PIES management.
陈明昊, 朱月瑶, 孙毅, 谢志远, 吴鹏. 计及高渗透率光伏消纳与深度强化学习的综合能源系统预测调控[J]. 电工技术学报, 2024, 39(19): 6054-6071.
Chen Minghao, Zhu Yueyao, Sun Yi, Xie Zhiyuan, Wu Peng. The Predictive-Control Optimization Method for Park Integrated Energy System Considering the High Penetration of Photovoltaics and Deep Reinforcement Learning. Transactions of China Electrotechnical Society, 2024, 39(19): 6054-6071.
[1] 国家能源局.我国可再生能源发电总装机突破13亿kW[EB/OL]. [2023-11-18]. http:// www.nea.gov.cn/ 2023-07/19/c_1310733273.htm.
[2] 吴孟雪, 房方. 计及风光不确定性的电-热-氢综合能源系统分布鲁棒优化[J]. 电工技术学报, 2023, 38(13): 3473-3485.
Wu Mengxue, Fang Fang.Distributionally robust optimization of electricity-heat-hydrogen integrated energy system with wind and solar uncertainties[J]. Transactions of China Electrotechnical Society, 2023, 38(13): 3473-3485.
[3] 罗潇, 任洲洋, 温紫豪, 等. 考虑氢能系统热回收的电氢区域综合能源系统日前优化运行[J]. 电工技术学报, 2023, 38(23): 6359-6372.
Luo Xiao, Ren Zhouyang, Wen Zihao, et al.A day-ahead dispatching method of regional integrated electric-hydrogen energy systems considering the heat recycle of hydrogen systems[J]. Transactions of China Electrotechnical Society, 2023, 38(23): 6359-6372.
[4] Chen Ruijun, Tsay Y S, Zhang Ting.A multi-objective optimization strategy for building carbon emission from the whole life cycle perspective[J]. Energy, 2023, 262: 125373.
[5] 曾艾东, 王佳伟, 邹宇航, 等. 考虑供热管网储热的综合能源系统多时间尺度优化调度[J]. 高电压技术, 2023, 49(10): 4192-4202.
Zeng Aidong, Wang Jiawei, Zou Yuhang, et al.Multi-time-scale optimal scheduling of integrated energy system considering heat storage characteristics of heating network[J]. High Voltage Engineering, 2023, 49(10): 4192-4202.
[6] Robert L R, Ravi Singh L R S. Economic emission dispatch of hydro-thermal-wind using CMQLSPSN technique[J]. IET Renewable Power Generation, 2020, 14(14): 2680-2692.
[7] 黄文杰, 崔雪, 陈君, 等. 基于多智能体Q学习算法的能源互联园区协调调度[J]. 武汉大学学报(工学版), 2022, 55(11): 1141-1148.
Huang Wenjie, Cui Xue, Chen Jun, et al.Coordinated scheduling of energy interconnected parks based on multi-agent Q-learning algorithm[J]. Engineering Journal of Wuhan University, 2022, 55(11): 1141-1148.
[8] Correa-Jullian C, López Droguett E, Cardemil J M.Operation scheduling in a solar thermal system: a reinforcement learning-based framework[J]. Applied Energy, 2020, 268: 114943.
[9] 张帆, 武东昊, 陈玉萍, 等. 多智能体深度强化学习的分布式园区综合能源系统经济调度策略[J]. 电力系统及其自动化学报, 2022, 34(12): 18-26.
Zhang Fan, Wu Donghao, Chen Yuping, et al.Economic scheduling strategy for integrated energy system in distributed parks based on multi-agent deep reinforcement learning[J]. Proceedings of the CSU-EPSA, 2022, 34(12): 18-26.
[10] 杨挺, 赵黎媛, 刘亚闯, 等. 基于深度强化学习的综合能源系统动态经济调度[J]. 电力系统自动化, 2021, 45(5): 39-47.
Yang Ting, Zhao Liyuan, Liu Yachuang, et al.Dynamic economic dispatch for integrated energy system based on deep reinforcement learning[J]. Automation of Electric Power Systems, 2021, 45(5): 39-47.
[11] 程义, 李更丰. 基于双层模仿学习的多园区综合能源系统分布式协同优化调度[J]. 电力系统自动化, 2022, 46(24): 16-25.
Cheng Yi, Li Gengfeng.Distributed collaborative optimal dispatch of multi-park integrated energy system based on bilayer imitation learning[J]. Automation of Electric Power Systems, 2022, 46(24): 16-25.
[12] 陈明昊, 孙毅, 胡亚杰, 等. 基于纵向联邦强化学习的居民社区综合能源系统协同训练与优化管理方法[J]. 中国电机工程学报, 2022, 42(15): 5535-5549.
Chen Minghao, Sun Yi, Hu Yajie, et al.The collaborative training and management-optimized method for residential integrated energy system based on vertical federated reinforcement learning[J]. Proceedings of the CSEE, 2022, 42(15): 5535-5549.
[13] Bag A, Subudhi B, Ray P K.A combined reinforcement learning and sliding mode control scheme for grid integration of a PV system[J]. CSEE Journal of Power and Energy Systems, 2019, 5(4): 498-506.
[14] 刘自发, 张婷, 王岩. 基于模型预测控制的主动配电网多场景变时间尺度优化调度[J]. 电力自动化设备, 2022, 42(4): 121-128.
Liu Zifa, Zhang Ting, Wang Yan.Multi-scenario variable time scale optimal scheduling of active distribution network based on model predictive control[J]. Electric Power Automation Equipment, 2022, 42(4): 121-128.
[15] 余洋, 贾浩, 陈启维, 等. 基于改进模型预测控制的电-气系统新能源功率波动平滑策略[J]. 电力建设, 2021, 42(9): 65-73.
Yu Yang, Jia Hao, Chen Qiwei, et al.Fluctuation smoothing strategy of new energy power for electricity-gas interconnected system based on improved model predictive control[J]. Electric Power Construction, 2021, 42(9): 65-73.
[16] 舒晓欣, 林其友, 张健, 等. 基于双层模型预测控制的微电网经济调度[J]. 浙江工业大学学报, 2023, 51(3): 324-329.
Shu Xiaoxin, Lin Qiyou, Zhang Jian, et al.A Two-stage model predictive control strategy for economical operation of microgrid[J]. Journal of Zhejiang University of Technology, 2023, 51(3): 324-329.
[17] 吴梦丹, 张俊礼, 吴嘉峰, 等. 基于经济模型预测控制的光伏光热一体化热泵系统动态能效优化[J]. 中国电机工程学报, 2023, 43(6): 2119-2130.
Wu Mengdan, Zhang Junli, Wu Jiafeng, et al.Dynamic energy efficiency optimization of photovoltaic/ thermal integrated heat pump system based on economic model predictive control[J]. Proceedings of the CSEE, 2023, 43(6): 2119-2130.
[18] 杨海涛, 江晶晶, 赵敏, 等. 基于模型预测控制的区域综合能源系统运行优化方法[J]. 电气技术, 2022, 23(4): 7-13.
Yang Haitao, Jiang Jingjing, Zhao Min, et al.Operational optimization method of regional integrated energy system based on model predictive control[J]. Electrical Engineering, 2022, 23(4): 7-13.
[19] 王阳, 刘希喆. 基于GRU-MPC的光储充电站日前- 日内两阶段优化控制[J]. 电力自动化设备, 2022, 42(10): 177-183.
Wang Yang, Liu Xizhe.Day-ahead and intra-day two-stage optimal control of photovoltaic-energy storage charging station based on GRU-MPC[J]. Electric Power Automation Equipment, 2022, 42(10): 177-183.
[20] 王晓霞, 俞敏, 霍泽健, 等. 基于近邻传播聚类与LSTNet的分布式光伏电站群短期功率预测[J]. 电力系统自动化, 2023, 47(6): 133-141.
Wang Xiaoxia, Yu Min, Huo Zejian, et al.Short-term power forecasting of distributed photovoltaic station clusters based on affinity propagation clustering and long short-term time-series network[J]. Automation of Electric Power Systems, 2023, 47(6): 133-141.
[21] Zhang Qingyong, Chen Jiahua, Xiao Gang, et al.TransformGraph: a novel short-term electricity net load forecasting model[J]. Energy Reports, 2023, 9: 2705-2717.
[22] Blad C, Bøgh S, Kallesøe C S.Data-driven offline reinforcement learning for HVAC-systems[J]. Energy, 2022, 261: 125290.
[23] 陈柘, 刘嘉华, 赵斌, 等. 基于GCN和TCN的多因素城市路网出租车需求预测[J]. 控制与决策, 2023, 38(4): 1031-1038.
Chen Zhe, Liu Jiahua, Zhao Bin, et al.Multi-factor taxi demand forecasting for urban road network based on GCN and TCN[J]. Control and Decision, 2023, 38(4): 1031-1038.
[24] 邢晨, 张照贝. 基于改进时间卷积网络的短期光伏出力概率预测方法[J]. 太阳能学报, 2023, 44(2): 373-380.
Xing Chen, Zhang Zhaobei.Short-term probabilistic forecasting method of photovoltaic output power based on improved temporal convolutional network[J]. Acta Energiae Solaris Sinica, 2023, 44(2): 373-380.
[25] 宋绍剑, 姜屹远, 刘斌. 一种TCN的改进模型及其在短期光伏功率区间预测的应用[J]. 计算机应用研究, 2023, 40(10): 3064-3069.
Song Shaojian, Jiang Yiyuan, Liu Bin.Improved TCN model and its application in short-term photovoltaic power interval prediction[J]. Application Research of Computers, 2023, 40(10): 3064-3069.
[26] Wang Kejun, Qi Xiaoxia, Liu Hongda.Photovoltaic power forecasting based LSTM-convolutional Network[J]. Energy, 2019, 189: 116225.
[27] 万灿, 崔文康, 宋永华. 新能源电力系统概率预测:基本概念与数学原理[J]. 中国电机工程学报, 2021, 41(19): 6493-6509.
Wan Can, Cui Wenkang, Song Yonghua.Probabilistic forecasting for power systems with renewable energy sources: basic concepts and mathematical principles[J]. Proceedings of the CSEE, 2021, 41(19): 6493-6509.
[28] Cheng Lilin, Zang Haixiang, Xu Yan, et al.Probabilistic residential load forecasting based on micrometeorological data and customer consumption pattern[J]. IEEE Transactions on Power Systems, 2021, 36(4): 3762-3775.
[29] 陈明昊, 孙毅, 谢志远. 基于双层深度强化学习的园区综合能源系统多时间尺度优化管理[J]. 电工技术学报, 2023, 38(7): 1864-1881.
Chen Minghao, Sun Yi, Xie Zhiyuan.The multi-time-scale management optimization method for park integrated energy system based on the Bi-layer deep reinforcement learning[J]. Transactions of China Electrotechnical Society, 2023, 38(7): 1864-1881.
[30] 朱振山, 陈哲盛, 盛明鼎. 基于柔性行动器-评判器的园区综合能源系统运行优化[J]. 高电压技术, 2022, 48(12): 4949-4958.
Zhu Zhenshan, Chen Zhesheng, Sheng Mingding.Operation optimization of park-level integrated energy system based on soft actor-critic[J]. High Voltage Engineering, 2022, 48(12): 4949-4958.