CASIA OpenIR

浏览/检索结果: 共100条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:4/3  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Visual Tracking via Spatially Aligned Correlation Filters Network 会议论文
, Munich, Germany, September 8, 2018 - September 14, 2018
作者:  Zhang, Mengdan;  Wang, Qiang;  Xing, Junliang;  Gao, Jin;  Peng, Peixi;  Hu, Weiming;  Maybank, Steve
Adobe PDF(1118Kb)  |  收藏  |  浏览/下载:15/3  |  提交时间:2024/06/21
A Novel Approach to the Analysis of Altered Human Motor Synergistic Structures 会议论文
, Macau, China, November 9-11
作者:  Jingyao Chen;  Chen Wang;  Ningcun Xu;  Zeng-Guang Hou;  Liang Peng;  Pingye Deng;  Pu Zhang;  Chutian Zhang
Adobe PDF(543Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/06/21
MILP Models for Flexible Job Shop Scheduling with Spatial Constraints and Sequence Flexibility 会议论文
2024 IEEE 20th International Conference on Automation Science and Engineering, Bari,Italy, 2024年8月28
作者:  Han, Yunjun(韩云君);  Peng,Shaoming;  Shen, Zhen;  Tao,Zhikun;  Xiong, Gang
Adobe PDF(397Kb)  |  收藏  |  浏览/下载:28/7  |  提交时间:2024/06/11
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:19/5  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:21/5  |  提交时间:2024/06/05
Enhanced long-range communication among large scale brain networks during the pre-microsaccadic period 会议论文
, Orlando, Florida, USA, July 15-19, 2024
作者:  Gao, Ying;  He, Huiguang;  Sabel, Bernhard
Adobe PDF(861Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/06/03
Traffic Signal Control Based on Reinforcement Learning and Fuzzy Neural Network 会议论文
, Macau, China, October 8-12, 2022
作者:  Zhao, Hongxia;  Chen, Songhang;  Zhu, Fenghua;  Tang, Haina
Adobe PDF(565Kb)  |  收藏  |  浏览/下载:19/10  |  提交时间:2024/06/03
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:36/14  |  提交时间:2024/05/29