CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Bidirectional Sentence Ordering with Interactive Decoding 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15
作者:  Guirong Bai;  Shizhu HE;  Kang Liu;  Jun Zhao
Adobe PDF(1080Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/06/20
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:15/5  |  提交时间:2024/06/03
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding 会议论文
, Rhodes, Greece, 2023-6-6 - 2023-6-10
作者:  Zefa Hu;  Xiuyi Chen;  Haoran Wu;  Minglun Han;  Ziyi Ni;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1049Kb)  |  收藏  |  浏览/下载:37/12  |  提交时间:2024/05/29
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:41/16  |  提交时间:2024/05/29
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28
Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 12, 页码: 12670-12679
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1587Kb)  |  收藏  |  浏览/下载:155/9  |  提交时间:2023/11/17
Multiagent reinforcement learning (MARL)  path tracking control  reaction wheel  robotic fish  underwater robot  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:136/1  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:  Li, Kai;  Xu, Hang;  Zhao, Enmin;  Wu, Zhe;  Xing, Junliang
收藏  |  浏览/下载:123/0  |  提交时间:2023/11/17
Artificial intelligence (AI)  benchmark  imperfect-information game  Nash equilibrium  no-limit Texas hold'em (NLTH)  
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:150/43  |  提交时间:2023/06/28
TinyNeRF: Towards 100 times Compression of Volume Radiance Fields 会议论文
, 线上, 2023-02
作者:  Zhao TL(赵天理);  Chen JY(陈嘉园);  Leng C(冷聪);  Cheng J(程健)
Adobe PDF(2855Kb)  |  收藏  |  浏览/下载:189/38  |  提交时间:2023/06/21
Neural Radiance Fields  Discrete Cosine Transformation  Frequency Domain