CASIA OpenIR

浏览/检索结果: 共45条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:21/8  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Self-Modifying State Modeling for Simultaneous Machine Translation 会议论文
, Bangkok, Thailand, August 11–16, 2024
作者:  Donglei, Yu;  Xiaomian, Kang;  Yuchen, Liu;  YU, Zhou;  Chengqing, Zong
Adobe PDF(924Kb)  |  收藏  |  浏览/下载:17/8  |  提交时间:2024/06/20
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:30/8  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:45/14  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Spiking Adaptive Dynamic Programming with Poisson Process 会议论文
, 中国山东省青岛市, 2021-07-18
作者:  Wei QL(魏庆来);  Han LY(韩立元);  Zhang TL(张铁林)
Adobe PDF(2334Kb)  |  收藏  |  浏览/下载:44/14  |  提交时间:2024/05/28
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:196/44  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Robust Graph Neural Networks Against Adversarial Attacks via Jointly Adversarial Training 会议论文
, 上海, 2020-12-3
作者:  Tian Hu;  Ye Bowei;  Zheng Xiaolong;  Zhang Xingwei;  Wu Dash Desheng
Adobe PDF(443Kb)  |  收藏  |  浏览/下载:160/53  |  提交时间:2023/07/04