CASIA OpenIR

浏览/检索结果: 共122条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:23/6  |  提交时间:2024/07/08
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
A Survey of Recent Advances in Commonsense Knowledge Acquisition: Methods and Resources 期刊论文
Machine Intelligence Research, 2024, 页码: 1
作者:  Wang, Chenhao;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Zhao, Jun
Adobe PDF(1228Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/25
Pro-tuning: Unified Prompt Tuning for Vision Tasks 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 34, 期号: 6, 页码: 4653 - 4667
作者:  Xing Nie;  Bolin Ni;  Jianlong Chang;  Gaofeng Meng;  Chunlei Huo;  Shiming Xiang;  Qi Tian
Adobe PDF(2224Kb)  |  收藏  |  浏览/下载:27/8  |  提交时间:2024/06/21
Improving diversity of speech‐driven gesture generation with memory networks as dynamic dictionaries. 期刊论文
CAAI Transactions on Intelligence Technology., 2024, 页码: 1–15
作者:  Zeyu Zhao;  Nan Gao;  Zhi Zeng;  Guixuan Zhang;  Jie Liu;  Shuwu Zhang
Adobe PDF(2067Kb)  |  收藏  |  浏览/下载:44/16  |  提交时间:2024/06/20
The survey on multi-source data fusion in cyber-physical-social systems: Foundational infrastructure for industrial metaverses and industries 5.0 期刊论文
Information Fusion, 2024, 卷号: 107, 页码: 1-16
作者:  Xiao Wang;  Yutong Wang;  Jing Yang;  Xiaofeng Jia;  Lijun Li;  Weiping Ding;  Fei-Yue Wang
Adobe PDF(4446Kb)  |  收藏  |  浏览/下载:43/6  |  提交时间:2024/06/06
Multi-source data fusion  CPSS  Industrial metaverses  Parallel manufacturing  Social manufacturing  
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
作者:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  收藏  |  浏览/下载:39/17  |  提交时间:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:48/11  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:40/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
基于显著性特征提取的图像描述算法 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 3, 页码: 735-746
作者:  王鑫;  宋永红;  张元林
Adobe PDF(4402Kb)  |  收藏  |  浏览/下载:39/14  |  提交时间:2024/05/20
图像描述  显著性特征提取  语言模型  编码器  解码器