CASIA OpenIR

浏览/检索结果: 共97条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:39/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
TENET: Beyond Pseudo-Labeling for Semi-supervised Few-shot Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 0
作者:  Ma CC(马成丞);  Dong WM(董未名);  Xu CS(徐常胜)
Adobe PDF(741Kb)  |  收藏  |  浏览/下载:102/24  |  提交时间:2024/01/29
Semi-supervised few-shot learning  few-shot learning  pseudo-labeling  linear regression  low-rank reconstruction  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:113/12  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:48/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Audio-driven Dubbing for User Generated Contents via Style-aware Semi-parametric Synthesis 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 卷号: 33, 期号: 3, 页码: 1247 - 1261
作者:  Song LS(宋林森);  Wu WY(吴文岩);  Fu CY(傅朝友);  Loy, Chen Change;  He R(赫然)
Adobe PDF(8629Kb)  |  收藏  |  浏览/下载:110/47  |  提交时间:2023/06/29
Talking Face Generation  Video Generation  GAN  Thin-plate Spline  
Everybody’s Talkin’: Let Me Talk as You Want 期刊论文
IEEE Transactions on Information Forensics and Security, 2022, 卷号: 17, 期号: 1, 页码: 585 - 598
作者:  宋林森;  吴文岩;  钱晨;  赫然;  Loy, Chen Change
Adobe PDF(15432Kb)  |  收藏  |  浏览/下载:71/11  |  提交时间:2023/06/29
Talking face generation  Video generation  GAN  Audio dubbing  
Semi-supervised cross-modal image generation with generative adversarial networks 期刊论文
Pattern Recognition, 2020, 卷号: 100, 页码: 107085
作者:  Li D(李丹);  Du CD(杜长德);  He HG(何晖光)
Adobe PDF(4031Kb)  |  收藏  |  浏览/下载:106/32  |  提交时间:2023/05/05
Graph-Enhanced Emotion Neural Decoding 期刊论文
IEEE Transactions on Medical Imaging, 2023, 页码: 1-1
作者:  Huang ZY(黄中昱);  Du CD(杜长德);  Wang YH;  Fu KC(付铠成);  He HG(何晖光)
Adobe PDF(6049Kb)  |  收藏  |  浏览/下载:245/55  |  提交时间:2023/05/05
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:126/23  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access
作者:  Shentong Mo;  Xin M(辛淼)
Adobe PDF(2209Kb)  |  收藏  |  浏览/下载:89/15  |  提交时间:2023/04/25
long-term forecasting  spatial-temporal graph transformer  Bayesian transformer  uncertainty estimation