已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:155/35  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:126/37  |  提交时间:2023/06/28 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:174/67  |  提交时间:2023/07/06 |
| MiaoSuan Wargame: A Multi-Mode Integrated Platform for Imperfect Information Game 会议论文 , Beijing, China, August 21-24, 2022 作者: Jiale Xu; Jian Hu; Shixian Wang; Xuyang Yang; Wancheng Ni Adobe PDF(726Kb)  |  收藏  |  浏览/下载:61/17  |  提交时间:2023/06/28 open platform human-computer gaming AI evaluation Turing test imperfect information game wargame |
| DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning 会议论文 , Indianapolis, IN, USA, 19-22 September 2021 作者: Jinhao Xi; Fenghua Zhu; Yuanyuan Chen; Yisheng Lv; Chang Tan; Feiyue Wang Adobe PDF(1652Kb)  |  收藏  |  浏览/下载:99/19  |  提交时间:2023/06/26 |
| DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文 , 西安, 2021.5.30-2021.6.5 作者: Li, Jiayi; Li, Boyao; Lu, Tao; Lu, Ning; Cai, Yinghao; Wang, Shuo Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:177/33  |  提交时间:2022/06/14 |
| Trajectory-based Split Hindsight Reverse Curriculum Learning 会议论文 , Prague, Czech Republic, 2021-9 作者: Wu, Jiaxi; Zhang, Dianmin; Zhong, Shanlin; Qiao, Hong Adobe PDF(5094Kb)  |  收藏  |  浏览/下载:203/43  |  提交时间:2022/06/14 Reinforcement Learning Curriculum Learning |
| Continual Learning for Fake Audio Detection 会议论文 , 线上(捷克), 2021-9 作者: Ma Haoxin; Yi Jiangyan; Tao Jianhua; Bai Ye; Tian Zhengkun; Wang Chenglong Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:225/58  |  提交时间:2022/06/20 fake audio detection continual learning detecting fake without forgetting |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨); He Q(何强); Bai YP(白云鹏); Hou XW(侯新文); Fan GL(范国梁); Liu Y(刘禹) Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:208/38  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |
| Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning 会议论文 , online, 2021-2 作者: Huang, Wenzhen; Yin Qiyue; Zhang Junge; Huang, Kaiqi Adobe PDF(5676Kb)  |  收藏  |  浏览/下载:158/36  |  提交时间:2022/01/11 |