CASIA OpenIR

浏览/检索结果: 共45条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Generating Relevant Article Comments via Variational Multi-Layer Fusion 会议论文
, Yokohama, Japan, 2024-7
作者:  Zou HY(邹瀚仪);  Xu HF(徐会芳);  Kong QC(孔庆超);  Cao YL(曹艺琳);  Mao WJ(毛文吉)
Adobe PDF(354Kb)  |  收藏  |  浏览/下载:25/8  |  提交时间:2024/06/24
article comment generation  variational auto-encoder  relevant information extraction  multi-layer fusion  
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
作者:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  收藏  |  浏览/下载:34/18  |  提交时间:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
作者:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  收藏  |  浏览/下载:47/12  |  提交时间:2024/06/24
SynDG: Syntax-aware Dialogue Generation 会议论文
, Tianjin China, March 17 - 20, 2023
作者:  Junyan Qiu;  Yiping Yang;  Haitao Wang
Adobe PDF(903Kb)  |  收藏  |  浏览/下载:40/13  |  提交时间:2024/06/17
dialogue system  natural language generation  dependency parsing  graph attention network  
Training Large Language Models to Follow System Prompt with Self-Supervised Fine-Tuning 会议论文
, YOKOHAMA, JAPAN, 2024-07
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(1596Kb)  |  收藏  |  浏览/下载:40/17  |  提交时间:2024/06/17
large language models  supervised fine-tuning  instruct tuning  stylized generation  
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:37/15  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Learning to Deliberate: Multi-Pass Decoding for Document-Grounded Conversations 会议论文
, YOKOHAMA, JAPAN, 2024-07
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(1033Kb)  |  收藏  |  浏览/下载:35/11  |  提交时间:2024/06/17
dialogue system  document-grounded conversations  deliberation network  sequence-to-sequence framework  
Bridging the Gap between Different Vocabularies for LLM Ensemble 会议论文
, Mexico City, Mexico, June 16–21, 2024
作者:  徐杨一帆;  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:58/18  |  提交时间:2024/06/13
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:63/22  |  提交时间:2024/06/05
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:32/12  |  提交时间:2024/06/03