CASIA OpenIR

浏览/检索结果: 共61条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:25/6  |  提交时间:2024/07/08
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:31/16  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval 会议论文
, 加拿大温哥华, 2023-6
作者:  chen yuxin;  ma zongyang;  zhang ziqi;  qi zhongang;  yuan chunfeng;  shan ying;  li bing;  hu weiming;  qie xiaohu;  wu jianping
Adobe PDF(1379Kb)  |  收藏  |  浏览/下载:32/8  |  提交时间:2024/06/25
交互场景下多模态抑郁程度评估与可解释性研究 学位论文
, 2023
作者:  蔡聪
Adobe PDF(5243Kb)  |  收藏  |  浏览/下载:12/0  |  提交时间:2024/06/25
抑郁程度评估  多模态  交互场景  机器学习  可解释性  
Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining 会议论文
, Paris France, 2023-10
作者:  Benjia Zhou;  Zhigang Chen;  Albert Clapes;  Jun Wan;  Yanyan Liang;  Sergio Escalera;  Zhen Lei;  Du Zhang
Adobe PDF(827Kb)  |  收藏  |  浏览/下载:31/7  |  提交时间:2024/06/06
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:43/18  |  提交时间:2024/06/05
High-Fidelity Clothed Avatar Reconstruction from a Single Image 会议论文
, Canada, Vancouver, 2023年6月18日-6月22日
作者:  Tingting Liao;  Xiaomei Zhang;  Yuliang Xiu;  Hongwei Yi;  Xudong Liu;  Guo-Jun Qi;  Yong Zhang;  Xuan Wang;  Xiangyu Zhu;  Zhen Lei
Adobe PDF(9282Kb)  |  收藏  |  浏览/下载:43/14  |  提交时间:2024/06/03
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:66/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning 会议论文
, Ottawa, ON, Canada, October 29-November 3, 2023
作者:  Zheng Lian;  Haiyang Sun;  Licai Sun;  Kang Chen;  Mingyu Xu;  Kexin Wang;  Ke Xu;  Yu He;  Ying Li;  Jinming Zhao;  Ye Liu;  Bin Liu;  Jiangyan Yi;  Meng Wang;  Erik Cambria;  Guoying Zhao;  Björn W. Schuller;  Jianhua Tao
Adobe PDF(993Kb)  |  收藏  |  浏览/下载:51/16  |  提交时间:2024/05/31
PCEN: Potential Correlation-Enhanced Network for Multimodal Named Entity Recognition 会议论文
, Charlotte, NC, USA, 02-03 October 2023
作者:  Jiakai Geng;  Chenyang Zhang;  Linjing Li;  Qing Yang;  Daniel Zeng
Adobe PDF(4985Kb)  |  收藏  |  浏览/下载:61/9  |  提交时间:2024/05/31
named entity recognition  multimodal learning  vision-language pre-trained model  inconsistency loss