CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 卷号: 19, 期号: 4, 页码: 17
作者:  Ma, Xuan;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:57/0  |  提交时间:2023/11/17
Knowledge reasoning  multi-modal commonsense inference  graph neural network  
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:185/56  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
Instance-aware Prompt Learning for Language Understanding and Generation 期刊论文
TALLIP, 2023, 页码: 19
作者:  Jin feihu;  Lu jinliang;  Zhang jiajun;  Zong chengqing
Adobe PDF(1091Kb)  |  收藏  |  浏览/下载:151/46  |  提交时间:2023/06/14
Topic-Oriented Dialogue Summarization 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 卷号: 31, 页码: 1797 - 1810
作者:  Lin, Haitao;  Zhu, Junnan;  Xiang, Lu;  Zhai, Feifei;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(3037Kb)  |  收藏  |  浏览/下载:190/77  |  提交时间:2023/06/13
dialogue summarization  abstractive summarization  controllable text generation  natural language processing  
无权访问的条目 期刊论文
作者:  Bo Zhou;  Yubo Chen;  Kang Liu;  Jun Zhao
Adobe PDF(602Kb)  |  收藏  |  浏览/下载:28/5  |  提交时间:2023/07/03
Cross-Modality Synergy Network for Referring Expression Comprehension and Segmentation 期刊论文
Neurocomputing, 2022, 卷号: 467, 期号: /, 页码: 99-114
作者:  Li, Qianzhong;  Zhang, Yujia;  Sun, Shiying;  Wu, Jinting;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(4555Kb)  |  收藏  |  浏览/下载:299/44  |  提交时间:2021/12/28
Referring expression comprehension  Referring expression segmentation  Cross-modality synergy  Attention mechanism  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
收藏  |  浏览/下载:282/0  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:260/33  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:303/75  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:261/58  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions