CASIA OpenIR

浏览/检索结果: 共37条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:25/11  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition 会议论文
, 线上, 2021-10
作者:  chen yuxin;  zhang ziqi;  yuan chunfeng;  li bing;  deng ying;  hu weiming
Adobe PDF(7181Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/25
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
作者:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  收藏  |  浏览/下载:34/18  |  提交时间:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:41/10  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Surface and Edge Detection for Primitive Fitting of Point Clouds 会议论文
, Los Angeles CA USA, 2023-8-6至2023-8-10
作者:  Li, Yuanqi;  Liu, Shun;  Yang, Xinran;  Guo, Jianwei;  Guo, Jie;  Guo, Yanwen
Adobe PDF(8381Kb)  |  收藏  |  浏览/下载:42/14  |  提交时间:2024/06/03
Primitive fitting  point cloud  shape reconstruction  deep neural network  
Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文
Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104
作者:  Yang YM(杨依明);  Wang ZC(王泽昌);  Xing DP(邢登鹏);  Wang P(王鹏)
Adobe PDF(3500Kb)  |  收藏  |  浏览/下载:29/12  |  提交时间:2024/05/30
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:41/15  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
D2AH-PPO: Playing ViZDoom With Object-Aware Hierarchical Reinforcement Learning 会议论文
, 中国重庆, 2024.5.7-5.9
作者:  Niu LY(钮龙宇);  Wan J(万军)
Adobe PDF(1645Kb)  |  收藏  |  浏览/下载:41/9  |  提交时间:2024/05/28
深度强化学习  表征学习  分层学习  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:33/19  |  提交时间:2024/05/28
Logic Traps in Evaluating Attribution Scores 会议论文
, Dublin, 22nd - 27th May 2022
作者:  Ju YM(鞠一鸣);  Zhang YZ(张元哲);  Yang C(杨朝);  Jiang ZT(江忠涛);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(1073Kb)  |  收藏  |  浏览/下载:154/56  |  提交时间:2023/06/29