CASIA OpenIR

Browse/Search Results:  1-10 of 311 Help

Selected(0)Clear Items/Page:    Sort:
A Deep Model for Partial Multi-label Image Classification with Curriculum-based Disambiguation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 801-814
Authors:  Feng Sun;  Ming-Kun Xie;  Sheng-Jun Huang
Adobe PDF(1337Kb)  |  Favorite  |  View/Download:10/3  |  Submit date:2024/07/18
Partial multi-label image classification  curriculum-based disambiguation  consistency regularization  label difficulty  candidate label set.  
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 782-800
Authors:  Jingqing Ruan;   Kaishen Wang;   Qingyang Zhang;   Dengpeng Xing;   Bo Xu
Adobe PDF(4577Kb)  |  Favorite  |  View/Download:6/3  |  Submit date:2024/07/18
Reinforcement learning  representation learning  subtask planning  task decomposition  pretraining.  
基于深度强化学习的足球智能体球员策略方法研究 学位论文
, 2024
Authors:  刘博寅
Adobe PDF(11380Kb)  |  Favorite  |  View/Download:32/0  |  Submit date:2024/07/12
足球  多智能体系统  深度强化学习  互信息  内在激励  预训练  
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
Authors:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  Favorite  |  View/Download:24/6  |  Submit date:2024/07/08
Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning 会议论文
, Chengdu, China, 2021-10
Authors:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(5740Kb)  |  Favorite  |  View/Download:30/7  |  Submit date:2024/07/08
标注受限的光学遥感图像目标检测模型与算法研究 学位论文
, 2024
Authors:  任至达
Adobe PDF(18136Kb)  |  Favorite  |  View/Download:22/1  |  Submit date:2024/07/08
光学遥感图像目标检测  标注受限  弱监督学习  显著性检测  特征增强  
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
Authors:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  Favorite  |  View/Download:32/9  |  Submit date:2024/07/04
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
Authors:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  Favorite  |  View/Download:16/8  |  Submit date:2024/07/04
Dynamic datasets and market environments for financial reinforcement learning 期刊论文
MACHINE LEARNING, 2024, 页码: 45
Authors:  Liu, Xiao-Yang;  Xia, Ziyi;  Yang, Hongyang;  Gao, Jiechao;  Zha, Daochen;  Zhu, Ming;  Wang, Christina Dan;  Wang, Zhaoran;  Guo, Jian
Favorite  |  View/Download:3/0  |  Submit date:2024/07/03
Financial reinforcement learning  FinRL  Dynamic dataset  Market environment  AI4Finance  Open finance  
Optimizing Reward Function Weights and Enhancing Control Mechanisms for Bipedal Robots Using LSTM and Attention Mechanisms 会议论文
, 河北保定, 2023-8-16
Authors:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(541Kb)  |  Favorite  |  View/Download:28/11  |  Submit date:2024/07/01