CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
EFCPose: End-to-End Multi-Person Pose Estimation with Fully Convolutional Heads 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 页码: early access
作者:  Wang Haixin;  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(4407Kb)  |  收藏  |  浏览/下载:14/6  |  提交时间:2024/06/03
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection (Aug, 10.1007/s11263-023-01855-1, 2023) 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 131, 页码: 3170–3192
作者:  Zhang, Libo;  Jiang, Lutao;  Ji, Ruyi;  Fan, Heng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:41/0  |  提交时间:2023/11/17
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:141/7  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:147/29  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer