CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共20条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:121/22  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:310/75  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering 会议论文
, 线上, 2021-10
作者:  Liu, Fei;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(1174Kb)  |  收藏  |  浏览/下载:171/35  |  提交时间:2022/06/15
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:271/58  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering 会议论文
, 线上, 2020-10
作者:  Liu, Fei;  Liu, Jing;  Zhu, Xinxin;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2797Kb)  |  收藏  |  浏览/下载:319/172  |  提交时间:2022/06/15
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:201/30  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Erasing-based Attention Learning for Visual Question Answering 会议论文
, Nice, France, 2019-10
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2319Kb)  |  收藏  |  浏览/下载:151/48  |  提交时间:2022/06/15
Language and Visual Relations Encoding for Visual Question Answering 会议论文
, 中国台湾, 2019-9
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Lu, Hanqing
Adobe PDF(694Kb)  |  收藏  |  浏览/下载:144/53  |  提交时间:2022/06/15
Structure Preserving Convolutional Attention for Image Captioning 期刊论文
APPLIED SCIENCES-BASEL, 2019, 卷号: 9, 期号: 14, 页码: 10
作者:  Lu, Shichen;  Hu, Ruimin;  Liu, Jing;  Guo, Longteng;  Zheng, Fei
Adobe PDF(2351Kb)  |  收藏  |  浏览/下载:267/37  |  提交时间:2019/12/16
image captioning  attention  spatial structure  deep learning  computer vision  
Improving visual question answering using dropout and enhanced question encoder 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 90, 期号: 1, 页码: 404-414
作者:  Fang, Zhiwei;  Liu, Jing;  Li, Yong;  Qiao, Yanyuan;  Lu, Hanqing
浏览  |  Adobe PDF(1624Kb)  |  收藏  |  浏览/下载:459/121  |  提交时间:2019/04/23
Visual question answering  Coherent dropout  Siamese dropout  Enhanced question encoder