CASIA OpenIR

浏览/检索结果: 共35条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/06/25
强化学习,分层强化学习  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/06/05
Leveraging Explicit Lexico-logical Alignments in Text-to-SQL Parsing 会议论文
, Dublin, May 22–27, 2022
作者:  Sun, Runxin;  He, Shizhu;  Zhu, Chong;  He, Yaohan;  Li, Jinlong;  Zhao, Jun;  Liu, Kang
Adobe PDF(528Kb)  |  收藏  |  浏览/下载:54/18  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:50/15  |  提交时间:2024/05/28
A Performance Optimization Strategy Based on Improved NSGA-II for a Flexible Robotic Fish 会议论文
, 英国伦敦, 2023.5.29
作者:  Lu, Ben;  Wang, Jian;  Liao, Xiaocun;  Zou, Qianqian;  Tan, Min;  Zhou, Chao
Adobe PDF(1449Kb)  |  收藏  |  浏览/下载:64/17  |  提交时间:2024/05/28
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:203/47  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Evolution of opinions with estimation and interference 会议论文
Proceedings of 41st Chinese Control Conference, Hefei, 2022.7.25-27
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(214Kb)  |  收藏  |  浏览/下载:156/55  |  提交时间:2023/06/28
Opinion dynamics, Self-cognition, Estimation  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:172/34  |  提交时间:2023/06/21
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文
, Washington D.C., USA, 2023-2-9
作者:  Qingyu Wang;  Tielin Zhang;  Minglun Han;  Yi Wang;  Duzhen Zhang;  Bo Xu
Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:182/52  |  提交时间:2023/06/20
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:252/77  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent