CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:9/1  |  提交时间:2024/07/12
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:53/17  |  提交时间:2024/06/05
Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文
, Madrid, Spain, 2021-12-7
作者:  Mi,Qirui;  Yang,Ning;  Zhang,Haifeng;  Zhang,Haijun;  Wang,Jun
Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/06/05
Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning 期刊论文
Physics of Fluids, 2024, 卷号: 36, 期号: 3, 页码: 031910
作者:  Cui,Xinyu;  Sun,Boai;  Zhu,Yi;  Yang,Ning;  Zhang,Haifeng;  Cui,Weicheng;  Fan,Dixia;  Wang,Jun
Adobe PDF(4056Kb)  |  收藏  |  浏览/下载:54/19  |  提交时间:2024/06/02
bio-mimetic robotic fish  deep reinforcement learning  
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
作者:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  收藏  |  浏览/下载:49/11  |  提交时间:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
基于门限和环签名的抗自适应攻击拜占庭容错共识算法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 7, 页码: 1471-1482
作者:  孙海锋;  张文芳;  王小敏;  马征;  黄路非;  李暄
Adobe PDF(2182Kb)  |  收藏  |  浏览/下载:56/19  |  提交时间:2024/04/25
区块链  拜占庭容错  共识算法  自适应攻击  环签名  门限签名  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:49/13  |  提交时间:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:146/4  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
A GAN-Based Short-Term Link Traffic Prediction Approach for Urban Road Networks Under a Parallel Learning Framework 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 12
作者:  Jin, Junchen;  Rong, Dingding;  Zhang, Tong;  Ji, Qingyuan;  Guo, Haifeng;  Lv, Yisheng;  Ma, Xiaoliang;  Wang, Fei-Yue
收藏  |  浏览/下载:275/0  |  提交时间:2022/06/06
Roads  Predictive models  Data models  Recurrent neural networks  Generators  Computer architecture  Deep learning  Short-term link speed prediction  signalized urban networks  Wasserstein generative adversarial network  
Integration of Train Control and Online Rescheduling for High-Speed Railways in Case of Emergencies 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 页码: 9
作者:  Dong, Hairong;  Liu, Xuan;  Zhou, Min;  Zheng, Wei;  Xun, Jing;  Gao, Shigen;  Song, Haifeng;  Li, Yidong;  Wang, Fei-Yue
收藏  |  浏览/下载:209/0  |  提交时间:2022/01/27
Rail transportation  Control systems  Delays  Dispatching  Trajectory  Wind  Optimization  Carrying capacity  emergencies  high-speed railways (HSRs)  integration  online rescheduling  train control