已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang ; Yang Yiming ; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(7948Kb)  |   收藏  |  浏览/下载:42/16  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡) ; Pu ZQ(蒲志强) ; Cai QA(蔡奇昂) ; Li FM(李非墨) ; Chai XH(柴兴华)
Adobe PDF(7610Kb)  |   收藏  |  浏览/下载:27/12  |  提交时间:2024/06/21 |
| P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification 会议论文 , Dublin, Ireland, 2023.08.24 作者: Wang XY(王溪源) ; Wang FY(王方圆) ; Xu B(徐波) ; Xu L(徐亮); Xiao J(肖京)
Adobe PDF(1542Kb)  |   收藏  |  浏览/下载:64/16  |  提交时间:2024/06/12 |
| Generative Calibration for In-context Learning 会议论文 , Singapore, 2023-10-6 作者: Zhongtao Jiang ; Yuanzhe Zhang ; Cao Liu ; Jun Zhao ; Kang Liu
Adobe PDF(763Kb)  |   收藏  |  浏览/下载:45/21  |  提交时间:2024/06/06 |
| Fault Diagnosis for Robotic Fish Sensors based on Spatial Domain Image Fusion and Convolution Neural Network 会议论文 , Tianjin, China, 2023-7 作者: Xuqing Fan ; Sai Deng ; Junfeng Fan ; Chao Zhou ; Zhengxing Wu ; Yaming Ou; Bin Zhang
Adobe PDF(1492Kb)  |   收藏  |  浏览/下载:43/11  |  提交时间:2024/06/05 Fault Diagnosis GAF Fusion CNN Robotic Fish |
| Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦) ; Gao Y(高阳) ; Zhang BF(张保丰); Chang H(常惠) ; Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |   收藏  |  浏览/下载:65/22  |  提交时间:2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play. |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Guangchong Zhou; Zeren Zhang; Guoliang Fan![](/image/person.jpg)
Adobe PDF(8700Kb)  |   收藏  |  浏览/下载:56/15  |  提交时间:2024/05/28 |
| Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文 , Washington, DC, USA, February 7-14, 2023 作者: Zhiwei Xu ; Bin Zhang; Dapeng Li ; Zeren Zhang; Guangchong Zhou; Hao Chen ; Guoliang Fan![](/image/person.jpg)
Adobe PDF(4141Kb)  |   收藏  |  浏览/下载:48/19  |  提交时间:2024/05/28 |
| SOTVerse: A User-Defined Task Space of Single Object Tracking 期刊论文 International Journal of Computer Vision, 2023, 卷号: 132, 期号: 3, 页码: 1-59 作者: Shiyu, Hu ; Xin, Zhao; Kaiqi Huang![](/image/person.jpg)
Adobe PDF(53048Kb)  |   收藏  |  浏览/下载:90/8  |  提交时间:2024/01/22 Single object tracking Experimental environment Evaluation system Performance analysis |