CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共7条,第1-7条 帮助

限定条件                                
已选(0)清除 条数/页:   排序方式:
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:165/34  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:366/93  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:238/38  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Structure Preserving Convolutional Attention for Image Captioning 期刊论文
APPLIED SCIENCES-BASEL, 2019, 卷号: 9, 期号: 14, 页码: 10
作者:  Lu, Shichen;  Hu, Ruimin;  Liu, Jing;  Guo, Longteng;  Zheng, Fei
Adobe PDF(2351Kb)  |  收藏  |  浏览/下载:302/42  |  提交时间:2019/12/16
image captioning  attention  spatial structure  deep learning  computer vision  
Image Captioning with Word Gate and Adaptive Self-Critical Learning 期刊论文
APPLIED SCIENCES-BASEL, 2018, 卷号: 8, 期号: 6, 页码: 13
作者:  Zhu, Xinxin;  Li, Lixiang;  Liu, Jing;  Guo, Longteng;  Fang, Zhiwei;  Peng, Haipeng;  Niu, Xinxin
Adobe PDF(3312Kb)  |  收藏  |  浏览/下载:413/65  |  提交时间:2019/12/16
image caption  image understanding  deep learning  computer vision  
Improving visual question answering using dropout and enhanced question encoder 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 90, 期号: 1, 页码: 404-414
作者:  Fang, Zhiwei;  Liu, Jing;  Li, Yong;  Qiao, Yanyuan;  Lu, Hanqing
浏览  |  Adobe PDF(1624Kb)  |  收藏  |  浏览/下载:504/132  |  提交时间:2019/04/23
Visual question answering  Coherent dropout  Siamese dropout  Enhanced question encoder  
Captioning Transformer with Stacked Attention Modules 期刊论文
APPLIED SCIENCES-BASEL, 2018, 卷号: 8, 期号: 5
作者:  Zhu, Xinxin;  Li, Lixiang;  Liu, Jing;  Peng, Haipeng;  Niu, Xinxin
收藏  |  浏览/下载:226/0  |  提交时间:2018/10/10
Image Caption  Image Understanding  Deep Learning  Computer Vision