CASIA OpenIR

浏览/检索结果: 共108条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Dynamic datasets and market environments for financial reinforcement learning 期刊论文
MACHINE LEARNING, 2024, 页码: 45
作者:  Liu, Xiao-Yang;  Xia, Ziyi;  Yang, Hongyang;  Gao, Jiechao;  Zha, Daochen;  Zhu, Ming;  Wang, Christina Dan;  Wang, Zhaoran;  Guo, Jian
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Financial reinforcement learning  FinRL  Dynamic dataset  Market environment  AI4Finance  Open finance  
Optimizing Reward Function Weights and Enhancing Control Mechanisms for Bipedal Robots Using LSTM and Attention Mechanisms 会议论文
, 河北保定, 2023-8-16
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(541Kb)  |  收藏  |  浏览/下载:36/14  |  提交时间:2024/07/01
Bidirectional Sentence Ordering with Interactive Decoding 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 2, 页码: 1-15
作者:  Guirong Bai;  Shizhu HE;  Kang Liu;  Jun Zhao
Adobe PDF(1080Kb)  |  收藏  |  浏览/下载:44/16  |  提交时间:2024/06/20
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:50/20  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Exploiting Curriculum Learning in Unsupervised Neural Machine Translation 会议论文
, Online, November 7–11, 2021
作者:  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(866Kb)  |  收藏  |  浏览/下载:67/23  |  提交时间:2024/06/13
Driving Control with Deep and Reinforcement Learning in The Open Racing Car Simulator 会议论文
, Siem Reap, Cambodia, 2018, 12, 13-16
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(697Kb)  |  收藏  |  浏览/下载:46/20  |  提交时间:2024/06/05
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:53/19  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:53/11  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:39/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:64/23  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation