CASIA OpenIR

浏览/检索结果: 共35条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
EFCPose: End-to-End Multi-Person Pose Estimation with Fully Convolutional Heads 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 页码: early access
作者:  Wang Haixin;  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(4407Kb)  |  收藏  |  浏览/下载:4/3  |  提交时间:2024/06/03
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection (Aug, 10.1007/s11263-023-01855-1, 2023) 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 131, 页码: 3170–3192
作者:  Zhang, Libo;  Jiang, Lutao;  Ji, Ruyi;  Fan, Heng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:39/0  |  提交时间:2023/11/17
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:123/1  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Learning Semantics-Consistent Stripes With Self-Refinement for Person Re-Identification 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-12
作者:  Zhu Kuan;  Guo Haiyun;  Liu Songyan;  Wang Jinqiao;  Tang Ming
Adobe PDF(4384Kb)  |  收藏  |  浏览/下载:124/33  |  提交时间:2023/06/08
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:139/26  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
An Efficient Sampling-Based Attention Network for Semantic Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2850-2863
作者:  He, Xingjian;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(3252Kb)  |  收藏  |  浏览/下载:368/77  |  提交时间:2022/06/10
Stochastic processes  Sampling methods  Semantics  Image segmentation  Computational complexity  Pattern recognition  Convolution  Semantic segmentation  stochastic sampling-based attention  deterministic sampling-based attention  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:332/81  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:298/62  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Gesture recognition based on deep deformable 3D convolutional neural networks 期刊论文
PATTERN RECOGNITION, 2020, 期号: 107, 页码: 12
作者:  Zhang, Yifan;  Shi, Lei;  Wu, Yi;  Cheng, Ke;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1310Kb)  |  收藏  |  浏览/下载:466/134  |  提交时间:2020/08/31
Gesture recognition  Spatiotemporal deformable convolution  Spatiotemporal convolutional neural network  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:213/32  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination