CASIA OpenIR

Browse/Search Results:  1-10 of 345 Help

Selected(0)Clear Items/Page:    Sort:
跨语言语义关联增强的无监督机器翻译方法研究 学位论文
, 2024
Authors:  陆金梁
Adobe PDF(3544Kb)  |  Favorite  |  View/Download:26/1  |  Submit date:2024/06/13
神经机器翻译,跨语言预训练,译文质量估计,译文回翻,互信息  
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
Authors:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  Favorite  |  View/Download:6/3  |  Submit date:2024/06/12
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
Authors:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  Favorite  |  View/Download:4/2  |  Submit date:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
Authors:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  Favorite  |  View/Download:8/1  |  Submit date:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
Authors:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  Favorite  |  View/Download:3/1  |  Submit date:2024/06/11
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
Authors:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  Favorite  |  View/Download:2/0  |  Submit date:2024/06/11
基于预训练模型的决策序列化建模研究 学位论文
, 2024
Authors:  林润基
Adobe PDF(7811Kb)  |  Favorite  |  View/Download:33/0  |  Submit date:2024/06/07
预训练模型  决策序列化  序列模型  
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
Authors:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  Favorite  |  View/Download:17/2  |  Submit date:2024/06/07
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604
Authors:  Kun Jiang;  Wenzhang Liu;  Yuanda Wang;  Lu Dong;  Changyin Sun
Adobe PDF(2128Kb)  |  Favorite  |  View/Download:7/2  |  Submit date:2024/06/07
Latent variable model  maximum entropy  multi-agent reinforcement learning (MARL)  multi-agent system  
A survey of approaches for implementing optical neural networks 期刊
创刊日期: 2021, 收录类别: SCI, 出版者: ELSEVIER SCI LTD,
Sponsors:  Runqin Xu
Adobe PDF(9982Kb)  |  Favorite  |  View/Download:13/2  |  Submit date:2024/06/06
Artificial intelligence  Optics  Optical neural network