CASIA OpenIR

Browse/Search Results:  1-10 of 16 Help

Selected(0)Clear Items/Page:    Sort:
基于视觉-语言引导的机器人导航研究 学位论文
, 2024
Authors:  何科技
Adobe PDF(29796Kb)  |  Favorite  |  View/Download:65/5  |  Submit date:2024/06/25
视觉语言导航、数据稀缺、时序信息挖掘噪声、跨模态对齐、异常行为  
LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文
, Singapore, 2023-12
Authors:  Zhitao He;  Pengfei Cao;  Yubo Chen;  Kang Liu;  Jun Zhao
Adobe PDF(1153Kb)  |  Favorite  |  View/Download:16/4  |  Submit date:2024/06/25
基于基础模型的分层强化学习 学位论文
, 2024
Authors:  吴俣桥
Adobe PDF(16716Kb)  |  Favorite  |  View/Download:34/0  |  Submit date:2024/06/21
强化学习  分层强化学习  基础模型  
基于脉冲神经网络的类脑情感共情与利他决策计算模型 学位论文
, 2024
Authors:  冯慧
Adobe PDF(18253Kb)  |  Favorite  |  View/Download:60/1  |  Submit date:2024/06/11
情感共情,利他决策,脉冲神经网络,突触可塑性,多脑区协同  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
Authors:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  Favorite  |  View/Download:41/16  |  Submit date:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
类脑心理揣测脉冲神经网络模型研究 学位论文
, 2024
Authors:  Zhao,Zhuoya
Adobe PDF(23946Kb)  |  Favorite  |  View/Download:27/2  |  Submit date:2024/06/04
类脑心理揣测模型  脉冲神经网络  多智能体社会交互  区分自我和他人  类脑心理揣测模型  脉冲神经网络  多智能体社会交互  区分自我和他人  类脑心理揣测模型  脉冲神经网络  多智能体社会交互  区分自我和他人  
事件因果关系挖掘关键技术研究 学位论文
, 2024
Authors:  何致涛
Adobe PDF(3575Kb)  |  Favorite  |  View/Download:83/3  |  Submit date:2024/05/28
事件因果关系识别  事件因果关系解释生成  预训练语言模型  多智能体  
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
Authors:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  Favorite  |  View/Download:71/26  |  Submit date:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
Multi-robot cooperative target encirclement through learning distributed transferable policy 会议论文
, Online, July 19-24
Authors:  Zhang Tianle;  Liu Zhen;  Wu Shiguang;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(949Kb)  |  Favorite  |  View/Download:224/70  |  Submit date:2022/06/16
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
Authors:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  Favorite  |  View/Download:262/12  |  Submit date:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)