CASIA OpenIR

浏览/检索结果: 共167条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:23/10  |  提交时间:2024/07/04
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:37/16  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
A Portable Robot-Assisted Device With Built-In Intelligence for Autonomous Ultrasound Acquisitions in Follow-Up Diagnosis 期刊论文
IEEE Transactions on Instrumentation and Measurement, 2024, 卷号: 73, 页码: 1-10
作者:  Deng ZK(邓兆锟);  Hou XL(侯西龙);  Chen C(陈晨);  Gu XL(谷晓林);  Hou ZG(侯增广);  Wang SY(王双翌)
Adobe PDF(6984Kb)  |  收藏  |  浏览/下载:25/11  |  提交时间:2024/06/25
Robots  Ultrasonic imaging  Probes  Robot sensing systems  Robot kinematics  Force  Safety  Autonomous US acquisition  medical ultrasound (US)  reinforcement learning (RL)  US robotic device  
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
作者:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  收藏  |  浏览/下载:38/20  |  提交时间:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Chen YR(陈亚冉);  Zhao DB(赵冬斌)
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:30/8  |  提交时间:2024/06/24
UAV Path Planning with Terrain Constraints for Aerial Scanning. 期刊论文
IEEE Transactions on Intelligent Vehicles, 2024, 卷号: 9, 期号: 1, 页码: 1189-1203
作者:  Jinbiao Yuan;  Zhenbao Liu;  Xiaoyu Xiong;  Yunfeng Ai;  Long Chen;  Bin Tian
Adobe PDF(3939Kb)  |  收藏  |  浏览/下载:75/20  |  提交时间:2024/06/20
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2024, 页码: 10.1109/TITS.2024.3400227
作者:  Zeyu Gao;  Yao Mu;  Chen Chen;  Jingliang Duan;  Ping Luo;  Yanfeng Lu;  Shengbo Eben Li
Adobe PDF(3954Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/06
End-to-end autonomous driving  deep reinforcement learning  world model  
Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10
作者:  Luntong Li;  Yuanheng Zhu
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:45/15  |  提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:44/8  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games