CASIA OpenIR

浏览/检索结果: 共43条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:4/1  |  提交时间:2024/06/11
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/06/05
Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning 期刊论文
Physics of Fluids, 2024, 卷号: 36, 期号: 3, 页码: 031910
作者:  Cui,Xinyu;  Sun,Boai;  Zhu,Yi;  Yang,Ning;  Zhang,Haifeng;  Cui,Weicheng;  Fan,Dixia;  Wang,Jun
Adobe PDF(4056Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/06/02
bio-mimetic robotic fish  deep reinforcement learning  
Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 卷号: 23, 期号: 2, 页码: 1-19
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Yiming Ju;  Jun Zhao;  Kang Liu
Adobe PDF(1250Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/05/30
Information bottleneck based knowledge selection for commonsense reasoning 期刊论文
Information Sciences, 2024, 卷号: 660, 页码: 120134
作者:  Zhao Yang;  Yuanzhe Zhang;  Pengfei Cao;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(1069Kb)  |  收藏  |  浏览/下载:20/8  |  提交时间:2024/05/30
Toward the Intelligent, Safe Exploration of a Biomimetic Underwater Robot: Modeling, Planning, and Control 期刊论文
Biomimetics, 2024, 期号: 9, 页码: 126
作者:  Wang, Yu;  Wang, Jian;  Yu Lianyi;  Kong Shihan;  Yu Junzhi
Adobe PDF(1171Kb)  |  收藏  |  浏览/下载:23/6  |  提交时间:2024/05/30
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:14/4  |  提交时间:2024/05/28
Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems 会议论文
, 意大利, 2022-7
作者:  Chen YQ(陈逸群);  Yang Wei;  Tianle Zhang;  Shiguang Wu;  Hongxing Chang
Adobe PDF(15907Kb)  |  收藏  |  浏览/下载:147/33  |  提交时间:2023/06/28
基于扩散模型的生成图像质量改善方法研究 学位论文
, 2023
作者:  殷月琴
Adobe PDF(28050Kb)  |  收藏  |  浏览/下载:250/7  |  提交时间:2023/06/26
生成模型  图像生成  扩散模型  
Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文
, Macao, China, 2023-8
作者:  Pei Xu;  Junge Zhang;  Kaiqi Huang
Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:243/74  |  提交时间:2023/06/19