CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:12/6  |  提交时间:2024/06/25
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:30/8  |  提交时间:2024/06/05
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:168/33  |  提交时间:2023/06/21
鸡尾酒会问题与相关听觉模型的研究现状与展望 期刊论文
自动化学报, 2019, 卷号: 45, 期号: 2, 页码: 234-251
作者:  黄雅婷;  石晶;  许家铭;  徐波
Adobe PDF(3009Kb)  |  收藏  |  浏览/下载:220/75  |  提交时间:2022/09/17
Compressing Speaker Extraction Model with Ultra-low Precision Quantization and Knowledge Distillation 期刊论文
Neural Networks, 2022, 卷号: 154, 页码: 13-21
作者:  Yating Huang;  Yunzhe Hao;  Jiaming Xu;  Bo Xu
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:230/60  |  提交时间:2022/09/17
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:262/53  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
一种针对德州扑克AI的对手建模与策略集成框架 期刊论文
自动化学报, 2021, 期号: 0, 页码: 0
作者:  张蒙;  李凯;  吴哲;  臧一凡;  徐航;  兴军亮
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:428/120  |  提交时间:2021/06/21
不完美信息博弈  德州扑克  演化学习  在线对手建模  种群策略集成  
Implementation of a multi-link robotic dolphin with two 3-DOF flippers 期刊论文
Journal of Computational Information Systems, 2011, 卷号: 7, 期号: 7, 页码: 2601-2607
作者:  Shen Fei;  Wei Changming;  Cao Zhiqiang;  Xu De;  Yu Junzhi;  Zhou Chao
浏览  |  Adobe PDF(311Kb)  |  收藏  |  浏览/下载:318/76  |  提交时间:2015/08/12
Multi-link Structure  3-dof Flipper  Motion Control  Robotic Dolphin