CASIA OpenIR

浏览/检索结果: 共38条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:196/67  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:112/35  |  提交时间:2023/06/29
Multi-Objective Bayesian Optimization using Deep Gaussian Processes with Applications to Copper Smelting Optimization 会议论文
, 新加坡, 2022-12
作者:  Kang, Liwen;  Wang, Xuelei;  Wu, Zhiheng;  Wang, Ruihua
Adobe PDF(607Kb)  |  收藏  |  浏览/下载:90/22  |  提交时间:2023/06/29
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:167/67  |  提交时间:2023/07/06
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization 会议论文
, 线上举办, 2021-10-11
作者:  Weihan Chen;  Peisong Wang;  Jian Cheng
Adobe PDF(696Kb)  |  收藏  |  浏览/下载:87/30  |  提交时间:2023/06/20
Adaptive Event-triggered Tracking Control for A Manipulator Based on Dynamic Neural Network 会议论文
, 重庆, 2021-7
作者:  Gao jie;  Zhang xiaodong;  Qiao hong
Adobe PDF(3222Kb)  |  收藏  |  浏览/下载:180/57  |  提交时间:2022/06/14
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:310/121  |  提交时间:2022/06/14
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:197/37  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:86/28  |  提交时间:2023/06/27
Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文
, 北京, 2020-12-5
作者:  Wang Xin;  Wei Qinglai;  Song Ruizhuo
Adobe PDF(898Kb)  |  收藏  |  浏览/下载:71/30  |  提交时间:2023/06/28