CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共104条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:30/14  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Chen YR(陈亚冉);  Zhao DB(赵冬斌)
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/06/24
Driving Control with Deep and Reinforcement Learning in The Open Racing Car Simulator 会议论文
, Siem Reap, Cambodia, 2018, 12, 13-16
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(697Kb)  |  收藏  |  浏览/下载:27/14  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:38/7  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16
作者:  Liu, Yuqi;  Zhang, Qichao;  Gao, Yinfeng;  Zhao, Dongbin
Adobe PDF(22863Kb)  |  收藏  |  浏览/下载:38/13  |  提交时间:2024/06/03
Reinforcement Learning  Autonomous Driving  Intersection Navigating  
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:  Zhang, Yong;  Mu, Chaoxu;  Zhao, Dongbin
收藏  |  浏览/下载:109/0  |  提交时间:2023/11/16
Data models  Real-time systems  Heuristic algorithms  Mathematical models  Adaptation models  Approximation algorithms  Cost function  Data generation model  feedback relearning control  delayed neural network  reinforcement learning  unmodeled nonlinear system  
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:255/81  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Dense Attention: A Densely Connected Attention Mechanism for Vision Transformer 会议论文
, Queensland, Australia, June 18 - 23, 2023
作者:  Nannan Li;  Yaran Chen;  Dongbin Zhao
Adobe PDF(3683Kb)  |  收藏  |  浏览/下载:171/41  |  提交时间:2023/06/28
Multi-Objective Neural Architecture Search for Light-Weight Model 会议论文
, Hangzhou, China, 22-24 November 2019
作者:  Nannan Li;  Yaran Chen;  Zixiang Ding;  Dongbin Zhao;  Zhonghua Pang;  Ruisheng Qin
Adobe PDF(430Kb)  |  收藏  |  浏览/下载:164/54  |  提交时间:2023/06/27
Neural architecture search  light-weight  multi-objective  reinforcement learning  image classification  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:73/35  |  提交时间:2023/05/22