CASIA OpenIR

浏览/检索结果: 共61条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:141/34  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:112/35  |  提交时间:2023/06/29
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:120/37  |  提交时间:2023/06/28
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:168/67  |  提交时间:2023/07/06
Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture 会议论文
, 美国路易斯安那州新奥尔良, 2022.06.19
作者:  Zhang, Chenghao;  Tian, Kun;  Fan, Bin;  Meng, Gaofeng;  Zhang, Zhaoxiang;  Pan, Chunhong
Adobe PDF(2647Kb)  |  收藏  |  浏览/下载:142/54  |  提交时间:2023/04/25
MiaoSuan Wargame: A Multi-Mode Integrated Platform for Imperfect Information Game 会议论文
, Beijing, China, August 21-24, 2022
作者:  Jiale Xu;  Jian Hu;  Shixian Wang;  Xuyang Yang;  Wancheng Ni
Adobe PDF(726Kb)  |  收藏  |  浏览/下载:58/16  |  提交时间:2023/06/28
open platform  human-computer gaming  AI evaluation  Turing test  imperfect information game  wargame  
Class-Incremental Learning via Dual Augmentation 会议论文
, Virtual, Dec 6-14, 2021
作者:  Zhu Fei (朱飞);  Zhen Cheng;  Xu-Yao Zhang;  Cheng-Lin Liu
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:131/55  |  提交时间:2023/09/12
DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning 会议论文
, Indianapolis, IN, USA, 19-22 September 2021
作者:  Jinhao Xi;  Fenghua Zhu;  Yuanyuan Chen;  Yisheng Lv;  Chang Tan;  Feiyue Wang
Adobe PDF(1652Kb)  |  收藏  |  浏览/下载:91/19  |  提交时间:2023/06/26
DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文
, 西安, 2021.5.30-2021.6.5
作者:  Li, Jiayi;  Li, Boyao;  Lu, Tao;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:170/32  |  提交时间:2022/06/14
Trajectory-based Split Hindsight Reverse Curriculum Learning 会议论文
, Prague, Czech Republic, 2021-9
作者:  Wu, Jiaxi;  Zhang, Dianmin;  Zhong, Shanlin;  Qiao, Hong
Adobe PDF(5094Kb)  |  收藏  |  浏览/下载:200/43  |  提交时间:2022/06/14
Reinforcement Learning  Curriculum Learning