CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:73/24  |  提交时间:2024/06/05
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:49/17  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:39/8  |  提交时间:2024/05/28
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:214/51  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:187/66  |  提交时间:2023/06/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:135/38  |  提交时间:2023/06/27
LEARN EFFECTIVE REPRESENTATION FOR DEEP REINFORCEMENT LEARNING 会议论文
, Taipei, Taiwan, 26 August 2022
作者:  Zhan Yuan;  Xu Zhiwei;  Fan Guoliang
Adobe PDF(2093Kb)  |  收藏  |  浏览/下载:176/55  |  提交时间:2023/06/08
Adversarial analysis for source camera identification 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2021, 卷号: 31, 期号: 11, 页码: 4174 - 4186
作者:  Bo Wang;  Mengnan Zhao;  Wei Wang;  Xiaorui Dai;  Yi Li;  Yanqing Guo
Adobe PDF(6167Kb)  |  收藏  |  浏览/下载:173/45  |  提交时间:2023/04/26
A KG-based Enhancement Framework for Fact Checking Using Category Information 会议论文
, 线上, 2020年11月
作者:  Wang S(王帅);  Wang L(王磊);  Mao WJ(毛文吉)
Adobe PDF(1222Kb)  |  收藏  |  浏览/下载:212/52  |  提交时间:2022/07/01
fact checking knowledge graph  
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:240/53  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks