已选(0)清除
条数/页: 排序方式: |
| BEVBert: Multimodal Map Pre-training for Language-guided Navigation 会议论文 Proceedings of the IEEE International Conference on Computer Vision, Paris, France, 2023-10-2 作者: Dong An; Yuankai Qi; Yangguang Li; Yan Huang; Liang Wang; Tieniu Tan; Jing Shao Adobe PDF(1722Kb)  |  收藏  |  浏览/下载:15/2  |  提交时间:2024/05/28 |
| Explainable Reinforcement Learning via a Causal World Model 会议论文 Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22 作者: Yu ZY(余忠蔚); Ruan JQ(阮景晴); Xing DP(邢登鹏) Adobe PDF(850Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/05/28 强化学习 可解释人工智能 因果推理 |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu; Bin Zhang; Dapeng Li; Guangchong Zhou; Zeren Zhang; Guoliang Fan Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/05/28 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文 , Auckland, New Zealand, May 9-13, 2022 作者: Zhiwei Xu; Yunpeng Bai; Dapeng Li; Bin Zhang; Guoliang Fan Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/28 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Shenzhen, China, 18-22 July 2021 作者: Zhiwei Xu; Dapeng Li; Yunpeng Bai; Guoliang Fan Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/28 |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:174/39  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:147/55  |  提交时间:2023/06/29 |
| Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文 , Washington D.C., USA, 2023-2-9 作者: Qingyu Wang; Tielin Zhang; Minglun Han; Yi Wang; Duzhen Zhang; Bo Xu Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:156/48  |  提交时间:2023/06/20 |
| LEARN EFFECTIVE REPRESENTATION FOR DEEP REINFORCEMENT LEARNING 会议论文 , Taipei, Taiwan, 26 August 2022 作者: Zhan Yuan; Xu Zhiwei; Fan Guoliang Adobe PDF(2093Kb)  |  收藏  |  浏览/下载:149/50  |  提交时间:2023/06/08 |
| Wd3: Taming the estimation bias in deep reinforcement learning 会议论文 , Baltimore, MD, USA, 2020-12 作者: He Q(何强); Hou XW(侯新文) Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:203/39  |  提交时间:2022/06/27 deep reinforcement learning estimation bias neural networks |