CASIA OpenIR

浏览/检索结果: 共75条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:16/8  |  提交时间:2024/07/04
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms 会议论文
, Taiyuan, Shanxi, China, 2024-07-27
作者:  Zhang, Chu Yuan;  Yi, Jiangyan;  Tao, Jianhua;  Wang, Chenglong;  Yan, Xinrui
Adobe PDF(2254Kb)  |  收藏  |  浏览/下载:21/12  |  提交时间:2024/06/26
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/25
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:24/11  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Bi-Directional and Early Interaction Transformers for Bird’s Eye View Semantic Segmentation 会议论文
, Vancouver Convention Center, 2023 年 6 月 18 日 – 2023 年 6 月 22 日
作者:  Pan Cong;  He Yonghao;  Peng Junran;  Zhang Qian;  Sui Wei;  Zhang Zhaoxiang
Adobe PDF(2215Kb)  |  收藏  |  浏览/下载:41/21  |  提交时间:2024/06/12
CoDRMA: Collaborative Depth Refinement via Dual-Mask and Dual-Attention for Bird’s Eye View Collaborative 3D Object Detection 会议论文
, Bari,Italy, 2024年8月28
作者:  Yang,Kang;  Wang, Yongcai;  Han, Yunjun;  Jia,Qingshan
Adobe PDF(1601Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/06/11
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:41/15  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
Content Based Deep Learning Image Retrieval: A Survey 会议论文
, Lingshui, China, 2023-12-14
作者:  Chi, Zhang;  JIe, Liu
Adobe PDF(504Kb)  |  收藏  |  浏览/下载:37/10  |  提交时间:2024/05/28
Content Based Image Retrieval  Deep Learning  Convolution Neural Network