CASIA OpenIR

浏览/检索结果: 共172条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Coordinating explicit and implicit knowledge for knowledge-based VQA 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 151, 页码: 9
作者:  Wang, Qunbo;  Liu, Jing;  Wu, Wenjun
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Knowledge retrieval  Pre -trained model  Knowledge -based VQA  
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:49/20  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:35/18  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
The survey on multi-source data fusion in cyber-physical-social systems: Foundational infrastructure for industrial metaverses and industries 5.0 期刊论文
Information Fusion, 2024, 卷号: 107, 页码: 1-16
作者:  Xiao Wang;  Yutong Wang;  Jing Yang;  Xiaofeng Jia;  Lijun Li;  Weiping Ding;  Fei-Yue Wang
Adobe PDF(4446Kb)  |  收藏  |  浏览/下载:49/8  |  提交时间:2024/06/06
Multi-source data fusion  CPSS  Industrial metaverses  Parallel manufacturing  Social manufacturing  
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:47/18  |  提交时间:2024/06/05
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
作者:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  收藏  |  浏览/下载:49/21  |  提交时间:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:48/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Hierarchical Attention Network for Open-Set Fine-Grained Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: 1-14
作者:  Jiayin, Sun;  Hong, Wang;  Qiulei, Dong
Adobe PDF(2596Kb)  |  收藏  |  浏览/下载:60/18  |  提交时间:2024/05/28
Improved Video Emotion Recognition with Alignment of CNN and Human Brain Representations 期刊论文
IEEE Transactions on Affective Computing, 2023, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(3907Kb)  |  收藏  |  浏览/下载:70/23  |  提交时间:2024/05/28
CNN-brain Alignment  Brain-guided Deep Learning  Video Emotion Recognition  Representation Similarity Analysis  
General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds With One Stone 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6155-6167
作者:  Bao, Zenghao;  Tan, Zichang;  Li, Jun;  Wan, Jun;  Ma, Xibo;  Lei, Zhen
Adobe PDF(1634Kb)  |  收藏  |  浏览/下载:60/2  |  提交时间:2024/02/22
General age estimation  long-tailed age estimation  class-wise mean absolute error