CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
SlowFastFormer for 3D human pose estimation 期刊论文
Computer Vision and Image Understanding, 2024, 期号: 243, 页码: 103992
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(989Kb)  |  收藏  |  浏览/下载:7/3  |  提交时间:2024/06/03
Temporal Action Proposal Generation With Action Frequency Adaptive Network 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 2340 - 2353
作者:  Yepeng Tang;  Weining Wang;  Chunjie Zhang;  Jing Liu;  Yao Zhao
Adobe PDF(10095Kb)  |  收藏  |  浏览/下载:39/15  |  提交时间:2024/03/26
Anchor-free temporal action localization via Progressive Boundary-aware Boosting 期刊论文
Information Processing & Management, 2022, 卷号: 60, 期号: 1, 页码: 103141
作者:  Tang, Yepeng;  Wang, Weining;  Yang, Yanwu;  Zhang, Chunjie;  Liu, Jing
Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:131/46  |  提交时间:2023/05/03
Temporal action localization  Anchor-free  Video understanding  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:139/26  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
A Peer-to-Peer Distributed Bisecting K-means 会议论文
, 线上, 2022-2-19
作者:  Gao HY(高浩元)
Adobe PDF(4307Kb)  |  收藏  |  浏览/下载:178/50  |  提交时间:2022/06/17
Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering 会议论文
, 线上, 2020-10
作者:  Liu, Fei;  Liu, Jing;  Zhu, Xinxin;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2797Kb)  |  收藏  |  浏览/下载:340/175  |  提交时间:2022/06/15
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:332/81  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:298/62  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:213/32  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Multiview Label Sharing for Visual Representations and Classifications 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 4, 页码: 903-913
作者:  Zhang, Chunjie;  Cheng, Jian;  Tian, Qi
Adobe PDF(615Kb)  |  收藏  |  浏览/下载:363/101  |  提交时间:2018/10/10
Multi-view Learning  Linear Transformation  Shared Space  Image Representation  Visual Classification