CASIA OpenIR

浏览/检索结果: 共67条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/05/28
Unsupervised Domain Adaptation on Sentence Matching Through Self-Supervision 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 6, 页码: 1237-1249
作者:  Bai, Gui-Rong;  Liu, Qing-Bin;  He, Shi-Zhu;  Liu, Kang;  Zhao, Jun
收藏  |  浏览/下载:28/0  |  提交时间:2024/03/26
unsupervised domain adaptation  sentence matching  self-supervision  
Unsupervised Dialogue State Tracking for End-to-End Task-Oriented Dialogue with a Multi-Span Prediction Network 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 4, 页码: 834-852
作者:  Liu, Qing-Bin;  He, Shi-Zhu;  Liu, Cao;  Liu, Kang;  Zhao, Jun
收藏  |  浏览/下载:29/0  |  提交时间:2024/02/22
end-to-end task-oriented dialogue  dialogue state tracking (DST)  unsupervised learning  reinforcement learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:142/29  |  提交时间:2023/06/21
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:154/62  |  提交时间:2023/06/20
Marine autonomous navigation for biomimetic underwater robots based on deep stereo attention network 会议论文
, Prague, Czech Republic, 2021年9月27日-2021年10月1日
作者:  Yan, Shuaizheng;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(4783Kb)  |  收藏  |  浏览/下载:145/58  |  提交时间:2023/06/12
Autonomous underwater vehicles  Visualization  Navigation  Biological system modeling  Real-time systems  
LEARN EFFECTIVE REPRESENTATION FOR DEEP REINFORCEMENT LEARNING 会议论文
, Taipei, Taiwan, 26 August 2022
作者:  Zhan Yuan;  Xu Zhiwei;  Fan Guoliang
Adobe PDF(2093Kb)  |  收藏  |  浏览/下载:147/49  |  提交时间:2023/06/08