CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:17/5  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:135/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
A multi-source behavioral and physiological recording system for cognitive assessment 期刊论文
SCIENTIFIC REPORTS, 2023, 卷号: 13, 期号: 1, 页码: 12
作者:  Wang, Zi-yang;  Liu, Li;  Liu, Yu
收藏  |  浏览/下载:31/0  |  提交时间:2023/11/17
Data-driven adaptive-critic optimal output regulation towards water level control of boiler-turbine systems 期刊论文
Expert Systems with Applications, 2022, 页码: 117883
作者:  Wei Qinglai;  Wang Xin;  Liu Yu;  Xiong Gang
Adobe PDF(2135Kb)  |  收藏  |  浏览/下载:167/59  |  提交时间:2023/05/23
CASIA’s System for IWSLT 2020 Open Domain Translation 会议论文
, Online, 2020-6-9
作者:  Wang, Qian;  Liu, Yuchen;  Ma, Cong;  Lu, Yu;  Wang, Yining;  Zhou, Long;  Zhao, Yang;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(547Kb)  |  收藏  |  浏览/下载:181/58  |  提交时间:2023/01/16
A Virtual Reality Platform for Context-Dependent Cognitive Research in Rodents 期刊论文
NEUROSCIENCE BULLETIN, 2022, 页码: 14
作者:  Qu, Xue-Tong;  Wu, Jin-Ni;  Wen, Yunqing;  Chen, Long;  Lv, Shi-Lei;  Liu, Li;  Zhan, Li-Jie;  Liu, Tian-Yi;  He, Hua;  Liu, Yu;  Xu, Chun
收藏  |  浏览/下载:251/0  |  提交时间:2022/12/27
Virtual reality  Spatial context  Contextual behavior  Hippocampus  Place cell  Learning and memory  Ca2+ imaging  
Pruning the Communication Bandwidth between Reinforcement Learning Agents through Causal Inference: An Innovative Approach to Designing a Smart Grid Power System 期刊论文
SENSORS, 2022, 卷号: 22, 期号: 20, 页码: 24
作者:  Zhang, Xianjie;  Liu, Yu;  Li, Wenjun;  Gong, Chen
收藏  |  浏览/下载:148/0  |  提交时间:2022/11/21
smart grid  deep reinforcement learning  cooperative agents  communication  causal model  estimating ITE  variational auto-encoder  
ADTIDO: Detecting the Tired Deck Officer with Fusion Feature Methods 期刊论文
SENSORS, 2022, 卷号: 22, 期号: 17, 页码: 16
作者:  Li, Chenghao;  Fu, Yuhui;  Ouyang, Ruihong;  Liu, Yu;  Hou, Xinwen
收藏  |  浏览/下载:178/0  |  提交时间:2022/11/14
EEG  deck officer  fatigue detection  ECD-EEG fusion features  Bi-GRU neural network classifier  
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:234/41  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:197/39  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution