CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning 会议论文
, 河南开封, 2024年5月17-19日
作者:  Huang, Shangjing;  Zhao, Zijie;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(15515Kb)  |  收藏  |  浏览/下载:32/11  |  提交时间:2024/06/26
VECTOR QUANTIZATION KNOWLEDGE TRANSFER FOR END-TO-END TEXT IMAGE MACHINE TRANSLATION 会议论文
Proceedings of 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, 14-19 April 2024
作者:  Ma, Cong;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1090Kb)  |  收藏  |  浏览/下载:48/21  |  提交时间:2024/06/26
V2X-BGN: Camera-based V2X-Collaborative 3D Object Detection with BEV Global Non-Maximum Suppression 会议论文
, Jeju Island, South Korea, June 2-5, 2024
作者:  Zhang Caiji;  Tian Bin;  Meng Shi;  Qi Shuangying;  Sun Yang;  Ai Yunfeng;  Chen Long
Adobe PDF(1659Kb)  |  收藏  |  浏览/下载:26/9  |  提交时间:2024/06/25
V2X  
Conditional Diffusion Guided by Part-level Latent for Dental Crown Point Cloud Generation 会议论文
, 昆明, 2024-3
作者:  Ao,Zhang;  Zhen,Shen;  Jian,Yang;  Qihang,Fang;  Gang,Xiong;  Xisong,Dong
Adobe PDF(9444Kb)  |  收藏  |  浏览/下载:46/16  |  提交时间:2024/06/25
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:42/18  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
作者:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  收藏  |  浏览/下载:46/20  |  提交时间:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
基于视觉表征的深度强化学习方法 学位论文
, 2024
作者:  刘民颂
Adobe PDF(10778Kb)  |  收藏  |  浏览/下载:49/4  |  提交时间:2024/06/22
深度强化学习,视觉表征学习,自监督学习,状态抽象,Transformer神经网络  
Learning to Deliberate: Multi-Pass Decoding for Document-Grounded Conversations 会议论文
, YOKOHAMA, JAPAN, 2024-07
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(1033Kb)  |  收藏  |  浏览/下载:44/14  |  提交时间:2024/06/17
dialogue system  document-grounded conversations  deliberation network  sequence-to-sequence framework  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:50/18  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning