电工技术学报  2024, Vol. 39 Issue (5): 1327-1339    DOI: 10.19595/j.cnki.1000-6753.tces.222273
电力系统与综合能源 |
结合数据驱动与物理模型的主动配电网双时间尺度电压协调优化控制
张剑1, 崔明建2, 何怡刚3
1.合肥工业大学电气与自动化工程学院 合肥 230009;
2.天津大学电气自动化与信息工程学院 天津 300072;
3.武汉大学电气与自动化学院 武汉 430072
Dual Timescales Coordinated and Optimal Voltages Control in Distribution Systems using Data-Driven and Physical Optimization
Zhang Jian1, Cui Mingjian2, He Yigang3
1. School of Electrical and Automation Engineering Hefei University of Technology Hefei 230009 China;
2. School of Electrical and Information Engineering Tianjin University Tianjin 300072 China;
3. School of Electrical and Automation Wuhan University Wuhan 430072 China
全文: PDF (3071 KB)   HTML
输出: BibTeX | EndNote (RIS)      
摘要 高比例电动汽车、分布式风电、光伏接入配电网,导致电压频繁地剧烈波动。传统调压设备与逆变器动作速度差异巨大,如何协调是难点问题。该文结合数据驱动与物理建模方法,提出一种配电网双时间尺度电压协调优化控制策略。针对短时间尺度(min级)电压波动,以静止无功补偿器、分布式电源无功功率为决策变量,以电压二次方偏差最小为目标函数,针对平衡与不平衡配电网,基于支路潮流方程,计及物理约束构建了二次规划模型。针对长时间尺度(h级)电压波动,以电压调节器匝比、可投切电容电抗器挡位、储能系统充放电功率为动作,当前时段配电网节点功率为状态,节点电压二次方偏差为代价,构建了马尔可夫决策过程。为克服连续-离散动作空间维数灾,提出了一种基于松弛-预报-校正的深度确定性策略梯度强化学习求解算法。最后,采用IEEE 33节点平衡与123节点不平衡配电网验证了所提出方法的有效性。
服务
把本文推荐给朋友
加入我的书架
加入引用管理器
E-mail Alert
RSS
作者相关文章
张剑
崔明建
何怡刚
关键词 智能配电网电压控制深度强化学习二次规划双时间尺度    
Abstract:A large number of electric vehicles (EVs), distributed solar and/or wind turbine generators (WTGs) connected to distribution systems lead to frequent and sharp voltages fluctuations. The action rates of conventional adjustable devices and smart inverters are very different. In this context, a novel dual-timescale voltage control scheme is proposed by organically combining data-driven with physics-based optimization. On fast timescale, a quadratic programming (QP) for balanced and unbalanced distribution systems is developed based on branch flow equations. The optimal reactive power of renewable distributed generators (DGs) and static VAR compensators (SVCs) is configured on several minutes. Whereas, on slow timescale, a data-driven Markovian decision process (MDP) is developed, in which the charge/discharge power of energy storage systems (ESSs), statuses/ratios of switchable capacitors reactors (SCRs), and voltage regulators (VRs) are configured hourly to minimize long-term discounted squared voltages magnitudes deviations using an adapted deep deterministic policy gradient (DDPG) deep reinforcement learning (DRL) algorithm. The capabilities of the proposed method are validated with IEEE 33-bus balanced and 123-bus unbalanced distribution systems.
The contributions of this paper are summarized as follows: (1)Combining data-driven with physics-based methods, a strategy for coordinated control of five different types of adjustable equipment, namely VRs, SCRs, ESSs, SVCs and DGs inverters on fast and slow timescales is proposed. (2) A slow timescale (say 1 hour) MDP for active and reactive power coordination is constructed. The (near) optimal settings of ratios/statuses of VRs, SCRs and charge/discharge power of ESSs are found using DRL algorithm. As a result, the deficiency of low computing rate for conventional physics -based large scale mixed integer non-convex nonlinear stochastic programming is completely overcome. (3) The charge/discharge power of ESS is continuous variable while ratios/statues of VRs and SCRs are discrete decisions. DDPG algorithm cannot be directly applicable to discrete action while DQN algorithm cannot be applicable to continuous action. Further, when there are a large number of VRs and SCRs in distribution network, DQN algorithm leads to dimensionality curses. The existing DRL algorithm in literatures cannot deal with joint continuous-discrete action (efficiently). To eliminate dimensionality curses in joint continuous- discrete action space, ratios/statuses of VRs and SCRs are firstly relaxed to continuous variables. Then, for the proto action given by actor of DDPG agent, Knn nearest neighbors are found out in the joint continuous-discrete action space. Finally, each of the Knn actions is transferred to the critic of DDPG agent one by one to evaluate its value. The action with the greatest value is chosen to interact with the distribution network. (4) Given the (near) optimal solution of the slow timescale MDP, a QP for VVO with DGs, SVCs inverters settings on fast timescale (say several minutes or seconds) to minimize squared voltages magnitudes deviations are developed for balanced and unbalanced distribution systems. As a result, voltage violations on fast timescale caused by sizable, rapid and frequent power fluctuations from renewable DGs and fast charged EVs can be mitigated in real time. (5) One of the outstanding advantages of the proposed method is very easy to perform in practice with near optimal solution. Further, when Knn =20 or Knn =40, the proposed method has much more stable training process than multi-agent DQN algorithm and much higher computing rate than conventional multi-slot single fast timescale mixed integer QP by about 18.0~36.7 times.
Key wordsSmart distribution systems    voltage control    deep reinforcement learning(DRL)    quadratic programming(QP)    dual-timescale   
收稿日期: 2022-12-12     
PACS: TM732  
基金资助:国家自然科学基金资助项目(52207130)
通讯作者: 张 剑 1982年生,男,博士,讲师,研究方向为电力系统建模、主动配电网技术、电动汽车有序充电等。E-mail:z_jj1219@sina.com   
作者简介: 崔明建 1987年生,男,博士,教授,研究方向为风力预测、机组组合、配电网物理信息系统等。E-mail:mingjian.cui@ieee.org
引用本文:   
张剑, 崔明建, 何怡刚. 结合数据驱动与物理模型的主动配电网双时间尺度电压协调优化控制[J]. 电工技术学报, 2024, 39(5): 1327-1339. Zhang Jian, Cui Mingjian, He Yigang. Dual Timescales Coordinated and Optimal Voltages Control in Distribution Systems using Data-Driven and Physical Optimization. Transactions of China Electrotechnical Society, 2024, 39(5): 1327-1339.
链接本文:  
https://dgjsxb.ces-transaction.com/CN/10.19595/j.cnki.1000-6753.tces.222273          https://dgjsxb.ces-transaction.com/CN/Y2024/V39/I5/1327