CASIA OpenIR

浏览/检索结果: 共116条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:28/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文
International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684
作者:  Xin Zhao;  Shiyu Hu;  Yipei Wang;  Zhang Jing;  Yimin Hu;  Rongshuai Liu;  Haibin Ling;  Yin Li;  Renshu Li;  Kun Liu;  Jiadong Li
Adobe PDF(9076Kb)  |  收藏  |  浏览/下载:26/7  |  提交时间:2024/06/21
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:39/17  |  提交时间:2024/06/12
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:39/15  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:47/11  |  提交时间:2024/06/07
A Fish-like Binocular Vision System for Underwater Perception of Robotic Fish 期刊论文
Biomimetics, 2024, 页码: 171
作者:  Tong Ru;  Wu Zhengxing;  Wang Jinge;  Huang Yupei;  Chen Di;  Yu Junzhi
Adobe PDF(4134Kb)  |  收藏  |  浏览/下载:39/15  |  提交时间:2024/06/06
Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions 期刊论文
IEEE Transactions on Fuzzy Systems, 2024, 页码: 1
作者:  Qingxu Fu;  Zhiqiang Pu;  Yi Pan;  Tenghai Qiu;  Jianqiang Yi
Adobe PDF(4975Kb)  |  收藏  |  浏览/下载:37/12  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:38/12  |  提交时间:2024/06/05
Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 卷号: 23, 期号: 2, 页码: 1-19
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Yiming Ju;  Jun Zhao;  Kang Liu
Adobe PDF(1250Kb)  |  收藏  |  浏览/下载:50/18  |  提交时间:2024/05/30
Explanation  knowledge distillation  model compression