CASIA OpenIR

浏览/检索结果: 共143条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:21/5  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:36/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:36/20  |  提交时间:2024/06/27
Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文
IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Wu, Zhengxing;  Yu, Lianyi;  Wang, Jian;  Dai, Shijie;  Tan, Min;  Yu, Junzhi
Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:65/34  |  提交时间:2024/06/24
Query2Triple: Unified Query Encoding for Answering Diverse Complex Queries over Knowledge Graphs 会议论文
, Singapore, 2023.11.06-2023.11.10
作者:  Yao Xu;  Shizhu HE;  Cunguang Wang;  Li Cai;  Kang Liu;  Jun Zhao
Adobe PDF(811Kb)  |  收藏  |  浏览/下载:34/11  |  提交时间:2024/06/20
MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文
, Torino (Italia), 2024.5.20 - 2024.5.25
作者:  Xiang Li;  Shizhu He;  Jiayu Wu;  Zhao Yang;  Yao Xu;  Yang Jun;  Haifeng Liu;  Kang Liu;  Jun Zhao
Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:35/9  |  提交时间:2024/06/20
Controller Design and Stability Analysis for Spinning Missile Via Tensor Product 期刊论文
Aerospace Science and Technology, 2022, 页码: 107877
作者:  Zhiming Zhou;  Zhen Liu;  Yi Pan;  Jianqiang Yi
Adobe PDF(1047Kb)  |  收藏  |  浏览/下载:46/17  |  提交时间:2024/06/20
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:46/17  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Mixspeech: Data augmentation for low-resource automatic speech recognition 会议论文
, Toronto, Canada, 2021.6.6-2021.6.11
作者:  Meng Linghui;  Xu Jin;  Tan Xu;  Wang Jindong;  Qin Tao;  Xu Bo
Adobe PDF(1111Kb)  |  收藏  |  浏览/下载:34/8  |  提交时间:2024/06/11
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:46/18  |  提交时间:2024/06/11