CASIA OpenIR

浏览/检索结果: 共62条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks 会议论文
, New Orleans, Louisiana & Online, 2022-11-28
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Li, Wei;  Wang, Haixin;  Zhao, Chaoyang;  Wu, Liwei;  Zhao, Rui;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(1289Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/05/28
transformer  general visual framework  sequence prediction  multi-task  
Temporal Action Proposal Generation With Action Frequency Adaptive Network 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 2340 - 2353
作者:  Yepeng Tang;  Weining Wang;  Chunjie Zhang;  Jing Liu;  Yao Zhao
Adobe PDF(10095Kb)  |  收藏  |  浏览/下载:37/15  |  提交时间:2024/03/26
Anchor-free temporal action localization via Progressive Boundary-aware Boosting 期刊论文
Information Processing & Management, 2022, 卷号: 60, 期号: 1, 页码: 103141
作者:  Tang, Yepeng;  Wang, Weining;  Yang, Yanwu;  Zhang, Chunjie;  Liu, Jing
Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:130/45  |  提交时间:2023/05/03
Temporal action localization  Anchor-free  Video understanding  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:136/24  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
A Peer-to-Peer Distributed Bisecting K-means 会议论文
, 线上, 2022-2-19
作者:  Gao HY(高浩元)
Adobe PDF(4307Kb)  |  收藏  |  浏览/下载:177/49  |  提交时间:2022/06/17
Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering 会议论文
, 线上, 2020-10
作者:  Liu, Fei;  Liu, Jing;  Zhu, Xinxin;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2797Kb)  |  收藏  |  浏览/下载:336/175  |  提交时间:2022/06/15
人体姿态估计的表示学习研究 学位论文
, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
作者:  吴文竹
Adobe PDF(7762Kb)  |  收藏  |  浏览/下载:215/8  |  提交时间:2022/06/14
人体姿态估计  关键点上下文  对比学习  层级损失  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:328/80  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:295/61  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
MSCap: Multi-Style Image Captioning with Unpaired Stylized Text 会议论文
, 美国长滩, 2019.06.16
作者:  Longteng, Guo;  Jing, Liu;  Peng, Yao;  Jiangwei, Li;  Hanqing, Lu
Adobe PDF(914Kb)  |  收藏  |  浏览/下载:121/24  |  提交时间:2021/06/25