CASIA OpenIR

浏览/检索结果: 共54条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks 会议论文
, New Orleans, Louisiana & Online, 2022-11-28
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Li, Wei;  Wang, Haixin;  Zhao, Chaoyang;  Wu, Liwei;  Zhao, Rui;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(1289Kb)  |  收藏  |  浏览/下载:4/1  |  提交时间:2024/05/28
transformer  general visual framework  sequence prediction  multi-task  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:126/15  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Learning Semantics-Consistent Stripes With Self-Refinement for Person Re-Identification 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-12
作者:  Zhu Kuan;  Guo Haiyun;  Liu Songyan;  Wang Jinqiao;  Tang Ming
Adobe PDF(4384Kb)  |  收藏  |  浏览/下载:122/33  |  提交时间:2023/06/08
ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation 会议论文
, Queensland, Australia, 2023-6-18
作者:  Liu, Jiawei;  Wang, Weining;  Liu, Wei;  He, Qian;  Liu, Jing
Adobe PDF(4537Kb)  |  收藏  |  浏览/下载:183/38  |  提交时间:2023/05/04
Semi-supervised Temporal Action Proposal Generation via Exploiting 2-d Proposal Map 期刊论文
IEEE Transactions on Multimedia, 2021, 页码: 3624 - 3635
作者:  Wang, Weining;  Lin, Tianwei;  He, Dongliang;  Li, Fu;  Wen, Shilei;  Wang, Liang;  Liu, Jing
Adobe PDF(4851Kb)  |  收藏  |  浏览/下载:139/22  |  提交时间:2023/05/03
Semi-supervised learning  proposal map oriented mean-teacher  pseudo label  
Anchor-free temporal action localization via Progressive Boundary-aware Boosting 期刊论文
Information Processing & Management, 2022, 卷号: 60, 期号: 1, 页码: 103141
作者:  Tang, Yepeng;  Wang, Weining;  Yang, Yanwu;  Zhang, Chunjie;  Liu, Jing
Adobe PDF(1559Kb)  |  收藏  |  浏览/下载:126/44  |  提交时间:2023/05/03
Temporal action localization  Anchor-free  Video understanding  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:136/24  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering 会议论文
, 线上, 2021-10
作者:  Liu, Fei;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(1174Kb)  |  收藏  |  浏览/下载:182/38  |  提交时间:2022/06/15
Erasing-based Attention Learning for Visual Question Answering 会议论文
, Nice, France, 2019-10
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2319Kb)  |  收藏  |  浏览/下载:162/50  |  提交时间:2022/06/15
Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering 会议论文
, 线上, 2020-10
作者:  Liu, Fei;  Liu, Jing;  Zhu, Xinxin;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2797Kb)  |  收藏  |  浏览/下载:336/175  |  提交时间:2022/06/15