CASIA OpenIR

浏览/检索结果: 共177条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Exploiting Curriculum Learning in Unsupervised Neural Machine Translation 会议论文
, Online, November 7–11, 2021
作者:  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(866Kb)  |  收藏  |  浏览/下载:18/2  |  提交时间:2024/06/13
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:9/5  |  提交时间:2024/06/12
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/06/11
Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation 会议论文
, London, UK, May 29 - June 2, 2023
作者:  Lu Zhang;  Siqi Zhang;  Xu Yang;  Hong Qiao;  Zhiyong Li
Adobe PDF(963Kb)  |  收藏  |  浏览/下载:13/4  |  提交时间:2024/06/06
Alignment Rationale for Natural Language Inference 会议论文
, Online, 2021-8-1
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Zhao Yang;  Jun Zhao;  Kang Liu
Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:7/3  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:26/6  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:16/4  |  提交时间:2024/05/30
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:16/4  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation