CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共22条,第1-10条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:134/16  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:123/1  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Semi-supervised Temporal Action Proposal Generation via Exploiting 2-d Proposal Map 期刊论文
IEEE Transactions on Multimedia, 2021, 页码: 3624 - 3635
作者:  Wang, Weining;  Lin, Tianwei;  He, Dongliang;  Li, Fu;  Wen, Shilei;  Wang, Liang;  Liu, Jing
Adobe PDF(4851Kb)  |  收藏  |  浏览/下载:142/22  |  提交时间:2023/05/03
Semi-supervised learning  proposal map oriented mean-teacher  pseudo label  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:139/26  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:332/81  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:300/63  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Normalized and Geometry-Aware Self-Attention Network for Image Captioning 会议论文
, 线上, 2020.06.14
作者:  Guo LT(郭龙腾);  Liu J(刘静);  Zhu XX(朱欣鑫);  Yao P(姚鹏);  Lu SC(卢诗晨);  Lu HQ(卢汉清)
Adobe PDF(574Kb)  |  收藏  |  浏览/下载:312/74  |  提交时间:2021/06/25
Image captioning  Self-attention  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:213/32  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Chat with illustration 期刊论文
MULTIMEDIA SYSTEMS, 2016, 卷号: 22, 期号: 1, 页码: 5-16
作者:  Jiang, Yu;  Liu, Jing;  Lu, Hanqing
浏览  |  Adobe PDF(1788Kb)  |  收藏  |  浏览/下载:324/91  |  提交时间:2016/03/19
Instant Messaging Service  Text-to-picture  Layout  
Robust Structured Subspace Learning for Data Representation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 卷号: 37, 期号: 10, 页码: 2085-2098
作者:  Li, Zechao;  Liu, Jing;  Tang, Jinhui;  Lu, Hanqing
浏览  |  Adobe PDF(525Kb)  |  收藏  |  浏览/下载:964/418  |  提交时间:2015/10/13
Data Representation  Latent Subspace  Image Understanding  Feature Learning  Structure Preserving