CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Dynamic datasets and market environments for financial reinforcement learning 期刊论文
MACHINE LEARNING, 2024, 页码: 45
作者:  Liu, Xiao-Yang;  Xia, Ziyi;  Yang, Hongyang;  Gao, Jiechao;  Zha, Daochen;  Zhu, Ming;  Wang, Christina Dan;  Wang, Zhaoran;  Guo, Jian
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Financial reinforcement learning  FinRL  Dynamic dataset  Market environment  AI4Finance  Open finance  
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:66/22  |  提交时间:2024/06/05
Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 卷号: 23, 期号: 2, 页码: 1-19
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Yiming Ju;  Jun Zhao;  Kang Liu
Adobe PDF(1250Kb)  |  收藏  |  浏览/下载:56/20  |  提交时间:2024/05/30
Explanation  knowledge distillation  model compression  
Information bottleneck based knowledge selection for commonsense reasoning 期刊论文
Information Sciences, 2024, 卷号: 660, 页码: 120134
作者:  Zhao Yang;  Yuanzhe Zhang;  Pengfei Cao;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(1069Kb)  |  收藏  |  浏览/下载:51/17  |  提交时间:2024/05/30
Commonsense reasoning  Knowledge selection  Information bottleneck  KG-augmented model