CASIA OpenIR  > 智能感知与计算研究中心
AnyFace: Free-style Text-to-Face Synthesis and Manipulation
Sun, Jianxin1,2; Deng, Qiyao1,2; Li, Qi1,2; Sun, Muyi1; Ren, Min1,2; Sun, Zhenan1,2
2022
会议名称IEEE/CVF Conference on Computer Vision and Pattern Recognition
页码18687-18696
会议日期2022-6-19至2022-6-24
会议地点美国新奥尔良/线上会议
摘要

Existing text-to-image synthesis methods generally are only applicable to words in the training dataset. However, human faces are so variable to be described with limited words. So this paper proposes the first free-style text-to-face method namely AnyFace enabling much wider open world applications such as metaverse, social media, cosmetics, forensics, etc. AnyFace has a novel two-stream framework for face image synthesis and manipulation given arbitrary descriptions of the human face. Specifically, one stream performs text-to-face generation and the other conducts face image reconstruction. Facial text and image features are extracted using the CLIP (Contrastive LanguageImage Pre-training) encoders. And a collaborative Cross Modal Distillation (CMD) module is designed to align the linguistic and visual features across these two streams. Furthermore, a Diverse Triplet Loss (DT loss) is developed to model fine-grained features and improve facial diversity. Extensive experiments on Multi-modal CelebA-HQ and CelebAText-HQ demonstrate significant advantages of AnyFace over state-of-the-art methods. AnyFace can achieve high-quality, high-resolution, and high-diversity face synthesis and manipulation results without any constraints on the number and content of input captions.

关键词人脸生成 文本到图像生成
收录类别EI
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/48943
专题智能感知与计算研究中心
通讯作者Li, Qi
作者单位1.Center for Research on Intelligent Perception and Computing, NLPR, CASIA
2.School of Artificial Intelligence, University of Chinese Academy of Sciences (UCAS)
第一作者单位模式识别国家重点实验室
通讯作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Sun, Jianxin,Deng, Qiyao,Li, Qi,et al. AnyFace: Free-style Text-to-Face Synthesis and Manipulation[C],2022:18687-18696.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
AnyFace Free-style T(2409KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Sun, Jianxin]的文章
[Deng, Qiyao]的文章
[Li, Qi]的文章
百度学术
百度学术中相似的文章
[Sun, Jianxin]的文章
[Deng, Qiyao]的文章
[Li, Qi]的文章
必应学术
必应学术中相似的文章
[Sun, Jianxin]的文章
[Deng, Qiyao]的文章
[Li, Qi]的文章
相关权益政策
暂无数据
收藏/分享
文件名: AnyFace Free-style Text-to-Face Synthesis and Manipulation.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。