CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共30条,第1-10条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
Temporal Action Proposal Generation With Action Frequency Adaptive Network 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 2340 - 2353
作者:  Yepeng Tang;  Weining Wang;  Chunjie Zhang;  Jing Liu;  Yao Zhao
Adobe PDF(10095Kb)  |  收藏  |  浏览/下载:66/19  |  提交时间:2024/03/26
Proposals  Task analysis  Data models  Time-frequency analysis  Representation learning  Predictive models  Information science  Temporal action proposal generation  expert learning  fine-gained detection  action frequency  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:170/24  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:162/13  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:162/32  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
An Efficient Sampling-Based Attention Network for Semantic Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2850-2863
作者:  He, Xingjian;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(3252Kb)  |  收藏  |  浏览/下载:392/79  |  提交时间:2022/06/10
Stochastic processes  Sampling methods  Semantics  Image segmentation  Computational complexity  Pattern recognition  Convolution  Semantic segmentation  stochastic sampling-based attention  deterministic sampling-based attention  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:361/91  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:332/73  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Structure Preserving Convolutional Attention for Image Captioning 期刊论文
APPLIED SCIENCES-BASEL, 2019, 卷号: 9, 期号: 14, 页码: 10
作者:  Lu, Shichen;  Hu, Ruimin;  Liu, Jing;  Guo, Longteng;  Zheng, Fei
Adobe PDF(2351Kb)  |  收藏  |  浏览/下载:300/41  |  提交时间:2019/12/16
image captioning  attention  spatial structure  deep learning  computer vision  
Image Captioning with Word Gate and Adaptive Self-Critical Learning 期刊论文
APPLIED SCIENCES-BASEL, 2018, 卷号: 8, 期号: 6, 页码: 13
作者:  Zhu, Xinxin;  Li, Lixiang;  Liu, Jing;  Guo, Longteng;  Fang, Zhiwei;  Peng, Haipeng;  Niu, Xinxin
Adobe PDF(3312Kb)  |  收藏  |  浏览/下载:411/65  |  提交时间:2019/12/16
image caption  image understanding  deep learning  computer vision  
Improving visual question answering using dropout and enhanced question encoder 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 90, 期号: 1, 页码: 404-414
作者:  Fang, Zhiwei;  Liu, Jing;  Li, Yong;  Qiao, Yanyuan;  Lu, Hanqing
浏览  |  Adobe PDF(1624Kb)  |  收藏  |  浏览/下载:501/132  |  提交时间:2019/04/23
Visual question answering  Coherent dropout  Siamese dropout  Enhanced question encoder