CASIA OpenIR

浏览/检索结果: 共85条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:13/2  |  提交时间:2024/06/06
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:23/9  |  提交时间:2024/05/30
Information bottleneck based knowledge selection for commonsense reasoning 期刊论文
Information Sciences, 2024, 卷号: 660, 页码: 120134
作者:  Zhao Yang;  Yuanzhe Zhang;  Pengfei Cao;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(1069Kb)  |  收藏  |  浏览/下载:18/7  |  提交时间:2024/05/30
SA-MPF: A Status-Aware Mask Prediction Framework for Online Disease Diagnosis 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
作者:  Zefa Hu;  Linghui Meng;  Yunlong Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(307Kb)  |  收藏  |  浏览/下载:25/5  |  提交时间:2024/05/29
医疗领域任务型对话系统研究 学位论文
, 2024
作者:  胡泽发
Adobe PDF(3935Kb)  |  收藏  |  浏览/下载:38/3  |  提交时间:2024/05/29
医疗对话系统  任务型对话系统  对话理解  对话推理  幻觉现象  
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:11/3  |  提交时间:2024/05/28
Contrastive Correlation Preserving Replay for Online Continual Learning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 124-139
作者:  Yu, Da;  Zhang, Mingyi;  Li, Mantian;  Zha, Fusheng;  Zhang, Junge;  Sun, Lining;  Huang, Kaiqi
收藏  |  浏览/下载:39/0  |  提交时间:2024/03/26
Task analysis  Correlation  Knowledge transfer  Training  Memory management  Data models  Mutual information  Continual learning  catastrophic forgetting  class-incremental learning  experience replay  
基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文
模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312
作者:  关伟凡;  张希
Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:129/44  |  提交时间:2023/06/29
强化学习  模仿学习  噪声对比估计  自适应权重  
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:93/35  |  提交时间:2023/06/29