CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Dual-Path Transformer for 3D Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 5, 页码: 3260-3270
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(2410Kb)  |  收藏  |  浏览/下载:47/20  |  提交时间:2024/06/03
Transformers  Three-dimensional displays  Pose estimation  Task analysis  Solid modeling  Feature extraction  Benchmark testing  3D human pose estimation  transformer  motion  distillation  
Anchor-free temporal action localization via Progressive Boundary-aware Boosting 期刊论文
Information Processing & Management, 2022, 卷号: 60, 期号: 1, 页码: 103141
作者:  Tang, Yepeng;  Wang, Weining;  Yang, Yanwu;  Zhang, Chunjie;  Liu, Jing
Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:165/61  |  提交时间:2023/05/03
Temporal action localization  Anchor-free  Video understanding  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:169/36  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
A Peer-to-Peer Distributed Bisecting K-means 会议论文
, 线上, 2022-2-19
作者:  Gao HY(高浩元)
Adobe PDF(4307Kb)  |  收藏  |  浏览/下载:208/60  |  提交时间:2022/06/17
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:368/93  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:338/76  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
MSCap: Multi-Style Image Captioning with Unpaired Stylized Text 会议论文
, 美国长滩, 2019.06.16
作者:  Longteng, Guo;  Jing, Liu;  Peng, Yao;  Jiangwei, Li;  Hanqing, Lu
Adobe PDF(914Kb)  |  收藏  |  浏览/下载:142/33  |  提交时间:2021/06/25
Spatialflow: Bridging all tasks for panoptic segmentation 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2020, 卷号: 31, 期号: 6, 页码: 2288-2300
作者:  Chen, Qiang;  Cheng, Anda;  He, Xiangyu;  Wang, Peisong;  Cheng, Jian
Adobe PDF(4643Kb)  |  收藏  |  浏览/下载:201/43  |  提交时间:2020/10/20
panoptic segmentation  
人脸与人体结构化视觉分析 学位论文
工学博士, 中科院自动化所: 中国科学院大学, 2020
作者:  刘智威
Adobe PDF(6223Kb)  |  收藏  |  浏览/下载:245/10  |  提交时间:2020/09/10
人脸关键点定位,人体姿态估计,人脸识别  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:240/38  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination