CASIA OpenIR

浏览/检索结果: 共22条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文
, 北京, 2020-12-5
作者:  Wang Xin;  Wei Qinglai;  Song Ruizhuo
Adobe PDF(898Kb)  |  收藏  |  浏览/下载:85/34  |  提交时间:2023/06/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:99/28  |  提交时间:2023/06/27
Learning Individual Features to Decompose State Space for Robotic Skill Learning 会议论文
, Online, 2020-8
作者:  Fengyi Zhang;  Fangzhou Xiong;  Zhiyong Liu
Adobe PDF(622Kb)  |  收藏  |  浏览/下载:142/53  |  提交时间:2023/01/12
Robotic Skill Learning  Graph Neural Networks  State Decomposition  
Collaborative Optimization of Cyber Physical Social Systems for Urban Transportation Based on Knowledge Automation 会议论文
, 北京, 2020-12-5
作者:  Xiong, Gang;  Chen, Xiaoyu;  Shuo, Nan;  Lv, Yisheng;  Zhu, Fenghua;  Qu, Tianci;  Ye, Peijun
Adobe PDF(616Kb)  |  收藏  |  浏览/下载:218/78  |  提交时间:2022/06/16
Pedestrian Choice Modeling and Simulation of Staged Evacuation Strategies in Daya Bay Nuclear Power Plant 期刊论文
EEE Transactions on Computational Social Systems, 2020, 卷号: 7, 期号: 3, 页码: 686-695
作者:  Yang, Linyao;  Wang, Xiao;  Zhang, Jun Jason;  Zhou, Min;  Wang, Fei-Yue
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:149/49  |  提交时间:2022/06/15
Agent-based modeling  exit choice  pedestrian crowd evacuation  random forest (RF)  staged evacuation  
基于深度强化学习的单路口交通信号控制 期刊论文
交通工程, 2020, 卷号: 20, 期号: 2, 页码: 54-59
作者:  刘皓;  吕宜生
Adobe PDF(3074Kb)  |  收藏  |  浏览/下载:229/90  |  提交时间:2021/07/02
深度强化学习  深度Q网络  交通信号控制  智能交通系统  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:269/50  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
智能机器人共享控制与操作技能学习方法研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2020
作者:  席宝
Adobe PDF(9051Kb)  |  收藏  |  浏览/下载:327/20  |  提交时间:2021/02/01
位姿检测  共享控制  强化学习  策略梯度  示教引导  
Acting As A Decision Maker: Traffic-Condition-Aware Ensemble Learning for Traffic Flow Prediction 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 期号: Accepted, 页码: Accepted
作者:  Yuanyuan Chen;  Hongyu Chen;  Peijun Ye;  Yisheng Lv;  Fei-Yue Wang
Adobe PDF(2382Kb)  |  收藏  |  浏览/下载:272/57  |  提交时间:2020/10/16
Traffic Flow Prediction  Ensemble Learning  Deep Learning  
On Iterative Proportional Updating: Limitations and Improvements for General Population Synthesis 期刊论文
IEEE Transactions on Cybernetics, 2020, 卷号: 99, 期号: 1, 页码: 1-10
作者:  Peijun Ye;  Bin Tian;  Yisheng Lv;  Qijie Li;  Fei-Yue Wang
Adobe PDF(1310Kb)  |  收藏  |  浏览/下载:164/42  |  提交时间:2020/10/15
Agent-based simulation, bilevel optimization, iterative proportional updating (IPU), population synthesis