CASIA OpenIR

Browse/Search Results:  1-10 of 20 Help

Selected(0)Clear Items/Page:    Sort:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
Authors:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  Favorite  |  View/Download:19/4  |  Submit date:2024/07/12
NIR-II fluorescence imaging-guided colorectal cancer surgery targeting CEACAM5 by a nanobody 期刊论文
EBioMedicine ., 2023, 页码: 104476
Authors:  Guo XY(郭晓勇);  Li ZJ(李长剑);  Jia XH(贾晓花);  Qu Yawei;  Li Miaomiao;  Cao Caiguang;  Zhang Zeyu;  Qu Qiaojun;  Luo Shuangling;  Tang Jianqiang;  Liu Haifeng;  Hu Zhenhua;  Tian Jie
Adobe PDF(4688Kb)  |  Favorite  |  View/Download:30/10  |  Submit date:2024/06/25
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
Authors:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  Favorite  |  View/Download:41/16  |  Submit date:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
Authors:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  Favorite  |  View/Download:65/22  |  Submit date:2024/06/05
Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文
, Madrid, Spain, 2021-12-7
Authors:  Mi,Qirui;  Yang,Ning;  Zhang,Haifeng;  Zhang,Haijun;  Wang,Jun
Adobe PDF(1724Kb)  |  Favorite  |  View/Download:41/11  |  Submit date:2024/06/05
Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning 期刊论文
Physics of Fluids, 2024, 卷号: 36, 期号: 3, 页码: 031910
Authors:  Cui,Xinyu;  Sun,Boai;  Zhu,Yi;  Yang,Ning;  Zhang,Haifeng;  Cui,Weicheng;  Fan,Dixia;  Wang,Jun
Adobe PDF(4056Kb)  |  Favorite  |  View/Download:65/26  |  Submit date:2024/06/02
bio-mimetic robotic fish  deep reinforcement learning  
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
Authors:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  Favorite  |  View/Download:55/15  |  Submit date:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
高速铁路信号系统运维分层架构模型研究 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 1, 页码: 152-161
Authors:  林鹏;  田宇;  袁志明;  张琦;  董海荣;  宋海锋;  阳春华
Adobe PDF(2180Kb)  |  Favorite  |  View/Download:55/22  |  Submit date:2024/05/20
高速铁路信号系统运维  分层架构模型  定量评估  风险预警  故障诊断  
基于门限和环签名的抗自适应攻击拜占庭容错共识算法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 7, 页码: 1471-1482
Authors:  孙海锋;  张文芳;  王小敏;  马征;  黄路非;  李暄
Adobe PDF(2182Kb)  |  Favorite  |  View/Download:62/22  |  Submit date:2024/04/25
区块链  拜占庭容错  共识算法  自适应攻击  环签名  门限签名  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
Authors:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  Favorite  |  View/Download:57/14  |  Submit date:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning