CASIA OpenIR

浏览/检索结果: 共158条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/07/12
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25
MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文
, Bangkok, Thailand, 2024-08
作者:  Wang, Chenhao;  Cao, Pengfei;  Jin, Zhuoran;  Chen, Yubo;  Zeng, Daojian;  Liu, Kang;  Zhao, Jun
Adobe PDF(571Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/25
A Survey of Recent Advances in Commonsense Knowledge Acquisition: Methods and Resources 期刊论文
Machine Intelligence Research, 2024, 页码: 1
作者:  Wang, Chenhao;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Zhao, Jun
Adobe PDF(1228Kb)  |  收藏  |  浏览/下载:31/8  |  提交时间:2024/06/25
Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文
IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Wu, Zhengxing;  Yu, Lianyi;  Wang, Jian;  Dai, Shijie;  Tan, Min;  Yu, Junzhi
Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:83/45  |  提交时间:2024/06/24
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文
International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684
作者:  Xin Zhao;  Shiyu Hu;  Yipei Wang;  Zhang Jing;  Yimin Hu;  Rongshuai Liu;  Haibin Ling;  Yin Li;  Renshu Li;  Kun Liu;  Jiadong Li
Adobe PDF(9076Kb)  |  收藏  |  浏览/下载:40/11  |  提交时间:2024/06/21
Prediction and Calibration: Complex Reasoning over Knowledge Graph with Bi-directional Directed Acyclic Graph Neural Network 会议论文
, Toronto, Canada, 2023.07.09-2023.07.14
作者:  Yao Xu;  Shizhu HE;  Li Cai;  Kang Liu;  Jun Zhao
Adobe PDF(628Kb)  |  收藏  |  浏览/下载:44/14  |  提交时间:2024/06/20
Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering 会议论文
, Bangkok, Thailand, 2024.08.11-2024.08.16
作者:  Xiang Li;  Shizhu HE;  Fangyu Lei;  Jun Yang;  Tianhuang Su;  Kang Liu;  Jun Zhao
Adobe PDF(873Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/20
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:57/22  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning