CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Deep Reinforcement Learning With Part-Aware Exploration Bonus in Video Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2022, 卷号: 14, 期号: 4, 页码: 644-653
作者:  Xu, Pei;  Yin, Qiyue;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(1480Kb)  |  收藏  |  浏览/下载:320/80  |  提交时间:2023/02/22
Deep learning  exploration  reinforcement learning  video game  
Offline reinforcement learning with representations for actions 期刊论文
INFORMATION SCIENCES, 2022, 卷号: 610, 页码: 746-758
作者:  Lou, Xingzhou;  Yin, Qiyue;  Zhang, Junge;  Yu, Chao;  He, Zhaofeng;  Cheng, Nengjie;  Huang, Kaiqi
收藏  |  浏览/下载:182/0  |  提交时间:2022/11/14
Offline reinforcement learning  Action embedding  
Black swan event small-sample transfer learning (BEST-L) and its case study on electrical power prediction in COVID-19 期刊论文
APPLIED ENERGY, 2022, 卷号: 309, 页码: 10
作者:  Hu, Chenxi;  Zhang, Jun;  Yuan, Hongxia;  Gao, Tianlu;  Jiang, Huaiguang;  Yan, Jing;  Gao, David Wenzhong;  Wang, Fei-Yue
收藏  |  浏览/下载:198/0  |  提交时间:2022/07/25
Transfer learning  Black swan event  Small-sample learning  COVID-19  Load forecasting  
Mutually trustworthy human-machine knowledge automation and hybrid augmented intelligence: mechanisms and applications of cognition, management, and control for complex systems 期刊论文
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 页码: 16
作者:  Wang, Fei-Yue;  Guo, Jianbo;  Bu, Guangquan;  Zhang, Jun Jason
收藏  |  浏览/下载:182/0  |  提交时间:2022/07/25
Complex systems  Human-machine knowledge automation  Parallel systems  Bulk power grid dispatch  Artificial intelligence  Internet of Minds (IoM)  TP11  
Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning 会议论文
, 意大利, 2022-07
作者:  Yang GK(杨光开);  Chenhao(陈皓);  Junge Zhang(张俊格);  Qiyue Yin(尹奇跃);  Kaiqi Huang(黄凯奇)
Adobe PDF(2924Kb)  |  收藏  |  浏览/下载:276/60  |  提交时间:2022/07/12
Supervised assisted deep reinforcement learning for emergency voltage control of power systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 475, 页码: 69-79
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Dai, Yuxin;  Yu, Zhihong;  Zhang, Jun Jason;  Bu, Guangquan;  Wang, Fei-Yue
Adobe PDF(2551Kb)  |  收藏  |  浏览/下载:337/68  |  提交时间:2022/06/06
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Emergency control  
HackGAN: Harmonious Cross-Network Mapping Using CycleGAN With Wasserstein-Procrustes Learning for Unsupervised Network Alignment 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 页码: 14
作者:  Yang, Linyao;  Wang, Xiao;  Zhang, Jun;  Yang, Jun;  Xu, Yancai;  Hou, Jiachen;  Xin, Kejun;  Wang, Fei-Yue
Adobe PDF(4053Kb)  |  收藏  |  浏览/下载:307/52  |  提交时间:2022/03/17
Task analysis  Optimization  Generative adversarial networks  Computational modeling  Automation  Training  Standards  Embedding  generative adversarial network  network alignment (NA)  optimal transport  unsupervised learning  
SADRL: Merging human experience with machine intelligence via supervised assisted deep reinforcement learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 467, 页码: 300-309
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Jin, Junchen;  Huang, Yanhao;  Zhang, Jun Jason;  Wang, Fei-Yue
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:315/71  |  提交时间:2021/12/28
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Double DQN