CASIA OpenIR

浏览/检索结果: 共76条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文
, 2024
作者:  何少钦
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:10/0  |  提交时间:2024/05/30
强化学习,离线强化学习,空战,智能决策,好奇心机制  
Toward the Intelligent, Safe Exploration of a Biomimetic Underwater Robot: Modeling, Planning, and Control 期刊论文
Biomimetics, 2024, 期号: 9, 页码: 126
作者:  Wang, Yu;  Wang, Jian;  Yu Lianyi;  Kong Shihan;  Yu Junzhi
Adobe PDF(1171Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/05/30
SA-MPF: A Status-Aware Mask Prediction Framework for Online Disease Diagnosis 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
作者:  Zefa Hu;  Linghui Meng;  Yunlong Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(307Kb)  |  收藏  |  浏览/下载:12/1  |  提交时间:2024/05/29
医疗领域任务型对话系统研究 学位论文
, 2024
作者:  胡泽发
Adobe PDF(3935Kb)  |  收藏  |  浏览/下载:12/0  |  提交时间:2024/05/29
医疗对话系统  任务型对话系统  对话理解  对话推理  幻觉现象  
Learning Individual Difference Rewards in Multi-Agent Reinforcement Learning 会议论文
, London, United Kingdom, 2023-5
作者:  Yang, Chen;  Yang, Guangkai;  Zhang, Junge
Adobe PDF(2419Kb)  |  收藏  |  浏览/下载:6/1  |  提交时间:2024/05/29
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:4/1  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:5/0  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/28