CASIA OpenIR

浏览/检索结果: 共65条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading 会议论文
, 台湾台北, 20240507-20240510
作者:  Cheng X(程曦);  Zhang JH(张景昊);  Ceng YN(曾宇楠);  Xue WF(薛文芳)
Adobe PDF(739Kb)  |  收藏  |  浏览/下载:52/13  |  提交时间:2024/06/03
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:63/21  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:48/14  |  提交时间:2024/05/30
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:139/40  |  提交时间:2023/06/27
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:137/54  |  提交时间:2023/06/27
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution 会议论文
, Padua, Italy, 18-23 July 2022
作者:  Yunpeng Bai;  Chen Gong;  Bin Zhang;  Guoliang Fan;  Xinwen Hou;  Yu Liu
Adobe PDF(8946Kb)  |  收藏  |  浏览/下载:153/42  |  提交时间:2023/06/14
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:257/46  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
Towards Unconstrained Pointing Problem of Visual Question Answering: A Retrieval-based Method 会议论文
, 北京国际会议中心, 2018-08
作者:  Cheng, Wenlong;  Huang, Yan;  Wang, Liang
Adobe PDF(351Kb)  |  收藏  |  浏览/下载:227/47  |  提交时间:2022/06/14
Graph-to-Graph: Towards Accurate and Interpretable Online Handwritten Mathematical Expression Recognition. 会议论文
, 线上会议, 2021-2-2至2021-2-9
作者:  Jin-Wen Wu;  Fei Yin;  Yan-Ming Zhang;  Xu-Yao Zhang;  Cheng-Lin Liu
Adobe PDF(369Kb)  |  收藏  |  浏览/下载:218/58  |  提交时间:2022/01/20
TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting 会议论文
, 首尔, 2019.10.27
作者:  Wei Feng;  Wenhao He;  Fei Yin;  Xu-Yao Zhang;  Cheng-Lin Liu
Adobe PDF(2843Kb)  |  收藏  |  浏览/下载:238/59  |  提交时间:2021/07/06