基于强化学习的电网调度研究 | |
王威![]() | |
2024-05-16 | |
页数 | 92 |
学位类型 | 硕士 |
中文摘要 | 随着用电需求的逐渐增加和电源形式的日益丰富,传统电网调度方法的局限性日益凸显,电网运行的质量和效率也亟需进一步提升。人工智能(Artifcial Intelligence, AI)方法凭借其强大的学习能力,近年来在多个领域表现出色,为
为了提高有功出力调度任务中电网调度的稳定性,同时兼顾调度的质量,本文提出融合了人类先验知识的分层安全强化学习(Hierarchical Safe Reinforcement Learning)的方法,在调度过程中同时兼顾稳定性以及原始任务性能。并且构建了新能源机组占比大的有功出力调度环境,验证了所提方法的性能,分层安全强化学习的方法具有更高的稳定性,同时任务性能也不会明显下降。 为了解决拓扑结构优化中的大规模不均衡离散动作空间问题,本文提出了基于搜索排序的强化学习(Reinforcement Learning based Searching and Ranking )的方法,通过多阶段逐级降维的方法获得更小的高质量动作空间,并使用强化学习对候选动作集合进行长期视角的排序。在标准的拓扑优化环境中对所提算法进行了全面的评估,本文所提方法可以在保证存活时间和累计奖励的前提下,大幅降低智能体的单步模拟次数,提高调度的效率。 最后,为了直观地展现本文研究成果,本文开发了针对有功出力调度和拓扑结构调度的电网运行可视化系统。该系统集成了本文讨论的两种关键调度场景,通过图形界面展示了算法在实际电网运行过程中的工作流程和效果,并允许用户通过交互修改调度进程,直观地对比不同策略对电网运行的影响。 |
英文摘要 | As electricity demand continues to increase and the diversity of power sources expands, the limitations of traditional grid dispatching methods have become increasingly apparent. There is an urgent need to enhance the quality and efciency of grid operation. Leveraging its powerful learning capabilities, Artifcial Intelligence (AI) has demonstrated outstanding performance in various felds in recent years, offering new solutions for the intelligent and digital transformation of power systems. Against this backdrop, Reinforcement Learning (RL), as a representative AI method, has gained attention for its remarkable performance in complex decision-making problems. RL, through interaction with the environment, can autonomously learn optimal strategies and has achieved signifcant success in felds such as go, video games, and robotic control. Its strong This thesis aims to explore the application of reinforcement learning in the optimization of automated dispatching in modern power grids, focusing primarily on two areas that have garnered signifcant attention in recent years: active power dispatching optimization under steady-state conditions and topology optimization. The objective is to enhance the operational quality of the grid under steady-state conditions. Active To improve the stability of active power dispatching tasks while maintaining dispatching quality, this paper proposes a Hierarchical Safe Reinforcement Learning method that integrates human prior knowledge. This approach balances stability and original task performance during the dispatching process. Additionally, an active power dispatching environment with a high proportion of renewable energy units was constructed to validate the performance of the proposed method. The results demonstrate that the proposed method provides higher stability without a signifcant decline in task performance. To address the issue of large-scale imbalanced discrete action spaces in topology optimization, this thesis proposes a Reinforcement Learning based Searching and Ranking method. This method employs a multi-stage dimensionality reduction approach to obtain a smaller, high-quality action space and uses reinforcement learning to rank the candidate actions from a long-term perspective. Comprehensive evaluations of the proposed algorithm were conducted in a standard topology optimization environments, demonstrating that the method signifcantly reduces the number of single-step simulations required by the agent while maintaining survival time and cumulative rewards, thus improving dispatching efciency.
|
关键词 | 电网自动化调度 电网有功出力调度 电网拓扑结构优化 强化学习 |
语种 | 中文 |
文献类型 | 学位论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/56906 |
专题 | 毕业生_硕士学位论文 |
推荐引用方式 GB/T 7714 | 王威. 基于强化学习的电网调度研究[D],2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
基于强化学习的电网调度研究.pdf(18647KB) | 学位论文 | 限制开放 | CC BY-NC-SA |
个性服务 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[王威]的文章 |
百度学术 |
百度学术中相似的文章 |
[王威]的文章 |
必应学术 |
必应学术中相似的文章 |
[王威]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论