验证码:

换一张

忘记密码？记住我

取消登录

切换中国科技网通行证登录

切换中国科技网通行证登录

取消

中文版 | English

中国科学院自动化研究所机构知识库

Knowledge Commons of Institute of Automation，CAS

登录注册

图片搜索

粘贴图片网址

首页
研究单元&专题
作者
文献类型
知识图谱
新闻&公告

在结果中检索

研究单元&专题

多模态人工智能系统... [95]

复杂系统认知与决策... [27]

脑图谱与类脑智能实... [18]

智能感知与计算研究... [13]

中科院工业视觉智能装... [9]

紫东太初大模型研究中... [7]

作者

文献类型

期刊论文 [125]

会议论文 [51]

发表日期

语种

英语 [177]

出处

NEUROCOMPU... [7]

IEEE TRANS... [5]

IEEE TRANS... [5]

PATTERN RE... [5]

IEEE TRANS... [4]

IEEE TRANS... [4]

资助项目

Major Proj... [4]

National N... [3]

National N... [3]

National N... [3]

National N... [3]

National N... [3]

收录类别

EI [43]

导师

资助机构

National ... [38]

National ... [18]

National K... [4]

National N... [4]

Beijing Na... [3]

Major Proj... [3]

知识图谱

CASIA OpenIR

已提交作品

待认领作品

已认领作品

未提交全文

浏览/检索结果: 共177条，第1-10条

帮助

限定条件

语种：英语

已选(0)清除条数/页：排序方式：
	Exploiting Curriculum Learning in Unsupervised Neural Machine Translation 会议论文 , Online, November 7–11, 2021 作者: Lu JL(陆金梁); Zhang JJ(张家俊) Adobe PDF(866Kb) \| 收藏 \| 浏览/下载：18/2 \| 提交时间：2024/06/13
	Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文 IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219 作者: Zhang,Haijun; Yang,Ning; Huangfu,Wei; Long,Keping; Leung,VictorCM Adobe PDF(1925Kb) \| 收藏 \| 浏览/下载：9/5 \| 提交时间：2024/06/12
	Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文 /, Orlando, FL, USA, 2023-11 作者: Wang, Yuxiao; Dai, Xingyuan; Wang, Kara; Ali, Hub; Zhu, Fenghua Adobe PDF(1410Kb) \| 收藏 \| 浏览/下载：6/2 \| 提交时间：2024/06/11 Imitation Learning Trajectory Planning Deep Reinforcement Learning Autonomous Driving
	Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文 The Visual Computer, 2024, 页码: 1-15 作者: Yu T(余挺); Meng WL(孟维亮); Wu ZQ(吴仲琦); Guo JW(郭建伟); Zhang XP(张晓鹏) Adobe PDF(2471Kb) \| 收藏 \| 浏览/下载：5/2 \| 提交时间：2024/06/11
	Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation 会议论文 , London, UK, May 29 - June 2, 2023 作者: Lu Zhang; Siqi Zhang; Xu Yang; Hong Qiao; Zhiyong Li Adobe PDF(963Kb) \| 收藏 \| 浏览/下载：13/4 \| 提交时间：2024/06/06
	Alignment Rationale for Natural Language Inference 会议论文 , Online, 2021-8-1 作者: Zhongtao Jiang; Yuanzhe Zhang; Zhao Yang; Jun Zhao; Kang Liu Adobe PDF(1280Kb) \| 收藏 \| 浏览/下载：7/3 \| 提交时间：2024/06/06
	Token-level Direct Preference Optimization 会议论文 , Vienna, Austria, 2024/7/21-27 作者: Zeng,Yongcheng; Liu,Guoqing; Ma,Weiyu; Yang,Ning; Zhang,Haifeng; Wang,Jun Adobe PDF(883Kb) \| 收藏 \| 浏览/下载：26/6 \| 提交时间：2024/06/05
	Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文 Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8 作者: He SQ(何少钦); Gao Y(高阳); Zhang BF(张保丰); Chang H(常惠); Zhang XC(张鑫辰) Adobe PDF(1496Kb) \| 收藏 \| 浏览/下载：20/9 \| 提交时间：2024/05/31 Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.
	Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文 Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10 作者: Chao Li; Chen Gong; Qiang He; Xinwen Hou Adobe PDF(1457Kb) \| 收藏 \| 浏览/下载：16/4 \| 提交时间：2024/05/30
	Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文 Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078 作者: Qiu JY(邱俊彦); Haidong Zhang; Yiping Yang Adobe PDF(831Kb) \| 收藏 \| 浏览/下载：16/4 \| 提交时间：2024/05/29 reinforcement learning dialogue policy learning curriculum learning knowledge distillation

首页
研究单元产出分布图
收录类型分布图
论文引用排行
作者
文献类型
学科分类
关于网站
使用帮助
联系我们

条目量25195
全文量13145
访问量5340974
下载量818722

版权所有 @2018 - 2024 中国科学院自动化研究所 - Powered by CSpace

地址邮编: 北京市海淀区中关村东路95号（100190）
电话: 010－82544495