CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:8/5  |  提交时间:2024/06/25
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/11
Generative Calibration for In-context Learning 会议论文
, Singapore, 2023-10-6
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(763Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/06
Fault Diagnosis for Robotic Fish Sensors based on Spatial Domain Image Fusion and Convolution Neural Network 会议论文
, Tianjin, China, 2023-7
作者:  Xuqing Fan;  Sai Deng;  Junfeng Fan;  Chao Zhou;  Zhengxing Wu;  Yaming Ou;  Bin Zhang
Adobe PDF(1492Kb)  |  收藏  |  浏览/下载:24/7  |  提交时间:2024/06/05
Fault Diagnosis  GAF Fusion  CNN  Robotic Fish  
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:25/4  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:41/12  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:39/15  |  提交时间:2024/05/29
Spatial Domain Image Fusion with Particle Swarm Optimization and Lightweight AlexNet for Robotic Fish Sensor Fault Diagnosis 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 6, 页码: 489
作者:  Fan, Xuqing;  Deng, Sai;  Wu, Zhengxing;  Fan, Junfeng;  Zhou, Chao
Adobe PDF(5062Kb)  |  收藏  |  浏览/下载:117/6  |  提交时间:2023/12/21
image fusion  lightweight AlexNet  particle swarm optimization  fault diagnosis  robotic fish  
Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 12, 页码: 12670-12679
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1587Kb)  |  收藏  |  浏览/下载:155/9  |  提交时间:2023/11/17
Multiagent reinforcement learning (MARL)  path tracking control  reaction wheel  robotic fish  underwater robot  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:136/1  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system