CASIA OpenIR

浏览/检索结果: 共25条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:19/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/06/05
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:29/9  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation  
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:45/14  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:46/19  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28
Spatial Domain Image Fusion with Particle Swarm Optimization and Lightweight AlexNet for Robotic Fish Sensor Fault Diagnosis 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 6, 页码: 489
作者:  Fan, Xuqing;  Deng, Sai;  Wu, Zhengxing;  Fan, Junfeng;  Zhou, Chao
Adobe PDF(5062Kb)  |  收藏  |  浏览/下载:121/8  |  提交时间:2023/12/21
image fusion  lightweight AlexNet  particle swarm optimization  fault diagnosis  robotic fish  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:144/4  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
Blood tau-PT217 contributes to the anesthesia/surgery-induced delirium-like behavior in aged mice 期刊论文
ALZHEIMERS & DEMENTIA, 2023, 卷号: 19, 期号: 9, 页码: 17
作者:  Lu, Jing;  Liang, Feng;  Bai, Ping;  Liu, Chenghao;  Xu, Miao;  Sun, Zhengwang;  Tian, Wenjie;  Dong, Yuanlin;  Zhang, Yiying;  Quan, Qimin;  Khatri, Ashok;  Shen, Yuan;  Marcantonio, Edward;  Crosby, Gregory;  Culley, Deborah J.;  Wang, Changning;  Yang, Guang;  Xie, Zhongcong
收藏  |  浏览/下载:170/0  |  提交时间:2023/11/17
anesthesia  delirium  phosphorylated tau at threonine 217  surgery  tau  tau phosphorylation