CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:9/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:8/5  |  提交时间:2024/06/25
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/06/21
Bidirectional Sentence Ordering with Interactive Decoding 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15
作者:  Guirong Bai;  Shizhu HE;  Kang Liu;  Jun Zhao
Adobe PDF(1080Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/06/20
P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification 会议论文
, Dublin, Ireland, 2023.08.24
作者:  Wang XY(王溪源);  Wang FY(王方圆);  Xu B(徐波);  Xu L(徐亮);  Xiao J(肖京)
Adobe PDF(1542Kb)  |  收藏  |  浏览/下载:41/10  |  提交时间:2024/06/12
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/11
Fault Diagnosis for Robotic Fish Sensors based on Spatial Domain Image Fusion and Convolution Neural Network 会议论文
, Tianjin, China, 2023-7
作者:  Xuqing Fan;  Sai Deng;  Junfeng Fan;  Chao Zhou;  Zhengxing Wu;  Yaming Ou;  Bin Zhang
Adobe PDF(1492Kb)  |  收藏  |  浏览/下载:24/7  |  提交时间:2024/06/05
Fault Diagnosis  GAF Fusion  CNN  Robotic Fish  
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:26/8  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:108/23  |  提交时间:2024/01/22
Spatial Domain Image Fusion with Particle Swarm Optimization and Lightweight AlexNet for Robotic Fish Sensor Fault Diagnosis 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 6, 页码: 489
作者:  Fan, Xuqing;  Deng, Sai;  Wu, Zhengxing;  Fan, Junfeng;  Zhou, Chao
Adobe PDF(5062Kb)  |  收藏  |  浏览/下载:117/6  |  提交时间:2023/12/21
image fusion  lightweight AlexNet  particle swarm optimization  fault diagnosis  robotic fish