CASIA OpenIR

浏览/检索结果: 共81条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:21/5  |  提交时间:2024/07/12
Uncertainty-aware Boundary Attention Network for Real-time Semantic Segmentation 会议论文
, 中国福建厦门, 2023年10月13日
作者:  Zhu YB(朱袁兵);  Zhu BK(朱炳科);  Chen YY(陈盈盈);  Wang JQ(王金桥)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:26/12  |  提交时间:2024/06/27
Uncertainty Estimation  Real-time Semantic Segmentation  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/25
LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文
, Singapore, 2023-12
作者:  Zhitao He;  Pengfei Cao;  Yubo Chen;  Kang Liu;  Jun Zhao
Adobe PDF(1153Kb)  |  收藏  |  浏览/下载:17/4  |  提交时间:2024/06/25
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/24
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:43/18  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:45/18  |  提交时间:2024/06/11
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:50/11  |  提交时间:2024/06/05
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:52/18  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:41/12  |  提交时间:2024/06/05