CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:143/17  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 5009-5021
作者:  Ji, Ruyi;  Li, Jiaying;  Zhang, Libo;  Liu, Jing;  Wu, Yanjun
Adobe PDF(4636Kb)  |  收藏  |  浏览/下载:133/4  |  提交时间:2023/11/16
Transformer  multi-grained assembly  fine-grained visual classification  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:143/26  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
MSCap: Multi-Style Image Captioning with Unpaired Stylized Text 会议论文
, 美国长滩, 2019.06.16
作者:  Longteng, Guo;  Jing, Liu;  Peng, Yao;  Jiangwei, Li;  Hanqing, Lu
Adobe PDF(914Kb)  |  收藏  |  浏览/下载:124/25  |  提交时间:2021/06/25
Semantic-spatial fusion network for human parsing 期刊论文
NEUROCOMPUTING, 2020, 卷号: 91, 期号: 402, 页码: 375-383
作者:  Zhang, Xiaomei;  Chen, Yingying;  Zhu, Bingke;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(2060Kb)  |  收藏  |  浏览/下载:318/47  |  提交时间:2020/07/20
SSFNet  Semantic modulation model  Resolution-aware model  Human parsing