CASIA OpenIR

浏览/检索结果: 共257条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation 会议论文
, 新奥尔良, 2023-12-9 至 2023-12-15
作者:  Keji He;  Chenyang Si;  Zhihe Lu;  Yan Huang;  Liang Wang;  Xinchao Wang
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:9/5  |  提交时间:2024/06/26
Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning 会议论文
, 河南开封, 2024年5月17-19日
作者:  Huang, Shangjing;  Zhao, Zijie;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(15515Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/06/26
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
作者:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
基于视觉表征的深度强化学习方法 学位论文
, 2024
作者:  刘民颂
Adobe PDF(10778Kb)  |  收藏  |  浏览/下载:13/1  |  提交时间:2024/06/22
深度强化学习,视觉表征学习,自监督学习,状态抽象,Transformer神经网络  
A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection 会议论文
, Seattle, United States, 2024-06-17至2024-06-21
作者:  Wang, Hanshi;  Zhang, Zhipeng;  Gao, Jin;  Hu, Weiming
Adobe PDF(2903Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/21
Visual Tracking via Spatially Aligned Correlation Filters Network 会议论文
, Munich, Germany, September 8, 2018 - September 14, 2018
作者:  Zhang, Mengdan;  Wang, Qiang;  Xing, Junliang;  Gao, Jin;  Peng, Peixi;  Hu, Weiming;  Maybank, Steve
Adobe PDF(1118Kb)  |  收藏  |  浏览/下载:16/3  |  提交时间:2024/06/21
Memory-based Error Label Suppression for Embodied Self-Improving Object Detection 会议论文
, 意大利巴里, 2024-8-28
作者:  Deng JR(邓杰仁);  Zhang HJ(张好剑);  Hu JH(胡建华);  Wang YK(王云宽)
Adobe PDF(2603Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/20
Training Large Language Models to Follow System Prompt with Self-Supervised Fine-Tuning 会议论文
, YOKOHAMA, JAPAN, 2024-07
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(1596Kb)  |  收藏  |  浏览/下载:18/6  |  提交时间:2024/06/17
large language models  supervised fine-tuning  instruct tuning  stylized generation  
Learning to Deliberate: Multi-Pass Decoding for Document-Grounded Conversations 会议论文
, YOKOHAMA, JAPAN, 2024-07
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(1033Kb)  |  收藏  |  浏览/下载:17/4  |  提交时间:2024/06/17
dialogue system  document-grounded conversations  deliberation network  sequence-to-sequence framework