CASIA OpenIR

浏览/检索结果: 共39条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:29/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25
Invisible Intruders: Label-Consistent Backdoor Attack using Re-parameterized Noise Trigger 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 14, 期号: 8, 页码: 1-13
作者:  Bo Wang;  Fei Yu;  Fei Wei;  Yi Li;  Wei Wang
Adobe PDF(1364Kb)  |  收藏  |  浏览/下载:45/15  |  提交时间:2024/06/21
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:40/16  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:38/12  |  提交时间:2024/06/05
Learning Multi-Resolution Features for Unsupervised Anomaly Localization on Industrial Textured Surfaces 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1-13
作者:  Tao X(陶显);  Shaohua Yan;  Xinyi Gong;  Chandranath Adak
Adobe PDF(6034Kb)  |  收藏  |  浏览/下载:42/12  |  提交时间:2024/06/04
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
A deep learning-based computational prediction model for characterizing cellular biomarker distribution in tumor microenvironment 期刊论文
SPIE, 2022, 卷号: 12039, 页码: 1605-7422
作者:  Zhengyao Peng;  Chang Bian;  Yang Du;  Jie Tian
Adobe PDF(572Kb)  |  收藏  |  浏览/下载:50/22  |  提交时间:2024/05/28
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:208/79  |  提交时间:2023/06/26
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:175/34  |  提交时间:2023/06/21