CASIA OpenIR

浏览/检索结果: 共184条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:192/65  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
HMDRL: Hierarchical Mixed Deep Reinforcement Learning to Balance Vehicle Supply and Demand 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 12
作者:  Xi, Jinhao;  Zhu, Fenghua;  Ye, Peijun;  Lv, Yisheng;  Tang, Haina;  Wang, Fei-Yue
Adobe PDF(3316Kb)  |  收藏  |  浏览/下载:223/29  |  提交时间:2022/09/19
deep reinforcement learning  online ride-hailing system  hierarchical repositioning framework  parallel coordination mechanism  mixed state  
3D激光雷达的定位与建图研究 学位论文
, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
作者:  梁爽
Adobe PDF(32289Kb)  |  收藏  |  浏览/下载:319/9  |  提交时间:2022/06/28
3D激光雷达  同时定位与建图  有向几何点特征  3D激光里程计  滤波和平滑  3D激光-惯性里程计  
Supervised assisted deep reinforcement learning for emergency voltage control of power systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 475, 页码: 69-79
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Dai, Yuxin;  Yu, Zhihong;  Zhang, Jun Jason;  Bu, Guangquan;  Wang, Fei-Yue
Adobe PDF(2551Kb)  |  收藏  |  浏览/下载:265/56  |  提交时间:2022/06/06
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Emergency control  
HackGAN: Harmonious Cross-Network Mapping Using CycleGAN With Wasserstein-Procrustes Learning for Unsupervised Network Alignment 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 页码: 14
作者:  Yang, Linyao;  Wang, Xiao;  Zhang, Jun;  Yang, Jun;  Xu, Yancai;  Hou, Jiachen;  Xin, Kejun;  Wang, Fei-Yue
Adobe PDF(4053Kb)  |  收藏  |  浏览/下载:244/39  |  提交时间:2022/03/17
Task analysis  Optimization  Generative adversarial networks  Computational modeling  Automation  Training  Standards  Embedding  generative adversarial network  network alignment (NA)  optimal transport  unsupervised learning  
SADRL: Merging human experience with machine intelligence via supervised assisted deep reinforcement learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 467, 页码: 300-309
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Jin, Junchen;  Huang, Yanhao;  Zhang, Jun Jason;  Wang, Fei-Yue
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:260/58  |  提交时间:2021/12/28
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Double DQN  
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:85/34  |  提交时间:2023/04/26
Empirical Learning of Decision Parameters for Agent-Based Model 会议论文
, Macau, China, 2022
作者:  Song B(宋冰);  Xiong G(熊刚);  Zhu F(朱凤华);  Wu X(武许可);  Lv Y(吕宜生);  Ye P(叶佩军)
Adobe PDF(1359Kb)  |  收藏  |  浏览/下载:103/40  |  提交时间:2023/06/26
Parallel crop planning based on price forecast 期刊论文
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 期号: 1, 页码: 22
作者:  Fan, Menghan;  Kang, Mengzhen;  Wang, Xiujuan;  Hua, Jing;  He, Chaoxing;  Wang, Fei-Yue
Adobe PDF(2074Kb)  |  收藏  |  浏览/下载:252/51  |  提交时间:2021/12/28
agent-based modeling  crop planning  parallel agricultural management  price prediction  
宽度神经架构搜索 学位论文
工学博士, 中国科学院自动化研究所智能化大厦三层: 中国科学院大学人工智能学院, 2021
作者:  丁子祥
Adobe PDF(5152Kb)  |  收藏  |  浏览/下载:183/6  |  提交时间:2022/01/06
神经架构搜索  宽度卷积神经网络  宽度神经架构搜索