CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:71/24  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:65/22  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Design of a Robotic Fish Based on a Passive Flexible Mechanism 会议论文
, 云南大理, 2019.12.6
作者:  Lu Ben;  Yuzhuo Fu;  Qianqian Zou;  Sai Deng;  Chao Zhou
Adobe PDF(10378Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/05/28
Energy Based Optimal Dynamic Stealth False Data Injection Attacks on the Smart Grid 会议论文
Proceedings of 2020 7th International Conference on Information, Cybernetics, and Computational Social Systems, Guangzhou, China, 2020.11.13-15
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(1191Kb)  |  收藏  |  浏览/下载:150/52  |  提交时间:2023/06/29
smart grid, security, false data injection attack, optimal control  
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:186/66  |  提交时间:2023/06/28
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:201/74  |  提交时间:2023/06/20
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:197/57  |  提交时间:2023/06/12
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:166/70  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:142/57  |  提交时间:2023/06/12
Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文
, Philadelphia, PA, USA, May 23-27, 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:177/41  |  提交时间:2023/06/12