CASIA OpenIR

浏览/检索结果: 共102条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:48/20  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Visual Tracking via Spatially Aligned Correlation Filters Network 会议论文
, Munich, Germany, September 8, 2018 - September 14, 2018
作者:  Zhang, Mengdan;  Wang, Qiang;  Xing, Junliang;  Gao, Jin;  Peng, Peixi;  Hu, Weiming;  Maybank, Steve
Adobe PDF(1118Kb)  |  收藏  |  浏览/下载:49/17  |  提交时间:2024/06/21
A Novel Approach to the Analysis of Altered Human Motor Synergistic Structures 会议论文
, Macau, China, November 9-11
作者:  Jingyao Chen;  Chen Wang;  Ningcun Xu;  Zeng-Guang Hou;  Liang Peng;  Pingye Deng;  Pu Zhang;  Chutian Zhang
Adobe PDF(543Kb)  |  收藏  |  浏览/下载:48/18  |  提交时间:2024/06/21
MILP Models for Flexible Job Shop Scheduling with Spatial Constraints and Sequence Flexibility 会议论文
2024 IEEE 20th International Conference on Automation Science and Engineering, Bari,Italy, 2024年8月28
作者:  Han, Yunjun(韩云君);  Peng,Shaoming;  Shen, Zhen;  Tao,Zhikun;  Xiong, Gang
Adobe PDF(397Kb)  |  收藏  |  浏览/下载:63/19  |  提交时间:2024/06/11
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/05
Traffic Signal Control Based on Reinforcement Learning and Fuzzy Neural Network 会议论文
, Macau, China, October 8-12, 2022
作者:  Zhao, Hongxia;  Chen, Songhang;  Zhu, Fenghua;  Tang, Haina
Adobe PDF(565Kb)  |  收藏  |  浏览/下载:46/20  |  提交时间:2024/06/03
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:66/23  |  提交时间:2024/05/29
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:62/29  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:59/17  |  提交时间:2024/05/28