CASIA OpenIR

Browse/Search Results:  1-10 of 74 Help

Selected(0)Clear Items/Page:    Sort:
自然语言嵌入的深度强化学习探索方法研究 学位论文
, 2024
Authors:  郭洲蕊
Adobe PDF(7588Kb)  |  Favorite  |  View/Download:33/1  |  Submit date:2024/06/26
深度强化学习  自然语言  探索  
LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文
, Singapore, 2023-12
Authors:  Zhitao He;  Pengfei Cao;  Yubo Chen;  Kang Liu;  Jun Zhao
Adobe PDF(1153Kb)  |  Favorite  |  View/Download:13/4  |  Submit date:2024/06/25
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
Authors:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  Favorite  |  View/Download:34/18  |  Submit date:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
Authors:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  Favorite  |  View/Download:47/12  |  Submit date:2024/06/24
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
Authors:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  Favorite  |  View/Download:37/15  |  Submit date:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
基于预训练模型的决策序列化建模研究 学位论文
, 2024
Authors:  林润基
Adobe PDF(7811Kb)  |  Favorite  |  View/Download:61/1  |  Submit date:2024/06/07
预训练模型  决策序列化  序列模型  
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
Authors:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  Favorite  |  View/Download:62/22  |  Submit date:2024/06/05
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
Authors:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  Favorite  |  View/Download:32/12  |  Submit date:2024/06/03
基于预训练语言模型的端到端概念体系构建方法 会议论文
, 中国哈尔滨市, 2023-8-5
Authors:  王思懿;  何世柱;  刘康;  赵军
Adobe PDF(794Kb)  |  Favorite  |  View/Download:44/20  |  Submit date:2024/05/31
基于序列展开模型的多智能体方法研究 学位论文
, 2024
Authors:  Luo ZX(罗正昕)
Adobe PDF(13451Kb)  |  Favorite  |  View/Download:51/1  |  Submit date:2024/05/30
多智能体  强化学习  序列展开模型  信度分配  非平稳性