CASIA OpenIR

浏览/检索结果: 共33条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/05/30
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:105/31  |  提交时间:2023/06/27
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:117/47  |  提交时间:2023/06/27
ADTIDO: Detecting the Tired Deck Officer with Fusion Feature Methods 期刊论文
SENSORS, 2022, 卷号: 22, 期号: 17, 页码: 16
作者:  Li, Chenghao;  Fu, Yuhui;  Ouyang, Ruihong;  Liu, Yu;  Hou, Xinwen
收藏  |  浏览/下载:169/0  |  提交时间:2022/11/14
EEG  deck officer  fatigue detection  ECD-EEG fusion features  Bi-GRU neural network classifier  
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:204/39  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:227/41  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:192/39  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution  
An Improvement based on Wasserstein GAN for Alleviating Mode Collapsing 会议论文
, Virtual, Glasgow, United kingdom, July 19, 2020 - July 24, 2020
作者:  Yingying Chen;  Xinwen Hou
Adobe PDF(913Kb)  |  收藏  |  浏览/下载:268/50  |  提交时间:2021/06/22
An intelligent identification system combining image and DNA sequence methods for fruit flies with economic importance (Diptera: Tephritidae) 期刊论文
PEST MANAGEMENT SCIENCE, 2021, 页码: 14
作者:  Wang, Jiangning;  Chen, Yingying;  Hou, Xinwen;  Wang, Yong;  Zhou, Libing;  Chen, Xiaolin
收藏  |  浏览/下载:226/0  |  提交时间:2021/05/17
fruit fly pests  intelligent identification system  image  deep learning  DNA sequence