CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
EFCPose: End-to-End Multi-Person Pose Estimation with Fully Convolutional Heads 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 页码: early access
作者:  Wang Haixin;  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(4407Kb)  |  收藏  |  浏览/下载:49/16  |  提交时间:2024/06/03
Temporal Action Proposal Generation With Action Frequency Adaptive Network 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 2340 - 2353
作者:  Yepeng Tang;  Weining Wang;  Chunjie Zhang;  Jing Liu;  Yao Zhao
Adobe PDF(10095Kb)  |  收藏  |  浏览/下载:89/29  |  提交时间:2024/03/26
Proposals  Task analysis  Data models  Time-frequency analysis  Representation learning  Predictive models  Information science  Temporal action proposal generation  expert learning  fine-gained detection  action frequency  
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection (Aug, 10.1007/s11263-023-01855-1, 2023) 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 131, 页码: 3170–3192
作者:  Zhang, Libo;  Jiang, Lutao;  Ji, Ruyi;  Fan, Heng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:58/5  |  提交时间:2023/11/17
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:187/20  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition 会议论文
, Greece, 2023-6-4
作者:  Liu, Jiawei;  Wang, Hao;  Wang, Weining;  He, Xingjian;  Liu, Jing
Adobe PDF(1673Kb)  |  收藏  |  浏览/下载:187/42  |  提交时间:2023/07/06
文本指导的视频生成方法研究 学位论文
, 2023
作者:  刘佳伟
Adobe PDF(15246Kb)  |  收藏  |  浏览/下载:167/6  |  提交时间:2023/06/08
基于人工智能的内容生成  多模态  视频生成  
ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation 会议论文
, Queensland, Australia, 2023-6-18
作者:  Liu, Jiawei;  Wang, Weining;  Liu, Wei;  He, Qian;  Liu, Jing
Adobe PDF(4537Kb)  |  收藏  |  浏览/下载:229/49  |  提交时间:2023/05/04
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:180/40  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer