CASIA OpenIR

浏览/检索结果: 共358条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:41/8  |  提交时间:2024/02/23
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 14
作者:  Qi, Xingqun;  Sun, Muyi;  Wang, Zijian;  Liu, Jiaming;  Li, Qi;  Zhao, Fang;  Zhang, Shanghang;  Shan, Caifeng
Adobe PDF(6718Kb)  |  收藏  |  浏览/下载:78/30  |  提交时间:2024/02/22
Face photo-sketch synthesis  generative adversarial network  graph representation learning  intraclass and interclass  iterative cycle training (ICT)  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
PyMAF-X: Towards Well-Aligned Full-Body Model Regression From Monocular Images 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 10, 页码: 12287-12303
作者:  Zhang, Hongwen;  Tian, Yating;  Zhang, Yuxiang;  Li, Mengcheng;  An, Liang;  Sun, Zhenan;  Liu, Yebin
收藏  |  浏览/下载:68/0  |  提交时间:2023/11/16
Expressive human mesh recovery  full-body motion capture  mesh alignment feedback  monocular 3D reconstruction  
Motion Forecasting Network (MoFCNet): IMU-Based Human Motion Forecasting for Hip Assistive Exoskeleton 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 9, 页码: 5783-5790
作者:  Zhang, Xingxuan;  Zhang, Haojian;  Hu, Jianhua;  Deng, Jieren;  Wang, Yunkuan
Adobe PDF(1895Kb)  |  收藏  |  浏览/下载:158/70  |  提交时间:2023/11/17
Intention recognition  prosthetics and exoskeletons  human motion forecasting  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:95/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:162/43  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: early-access
作者:  Mengqi Rong;  Shuhan Shen
Adobe PDF(5811Kb)  |  收藏  |  浏览/下载:107/36  |  提交时间:2023/09/25
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:176/48  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer