CASIA OpenIR

浏览/检索结果: 共678条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:94/8  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/02/23
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 14
作者:  Qi, Xingqun;  Sun, Muyi;  Wang, Zijian;  Liu, Jiaming;  Li, Qi;  Zhao, Fang;  Zhang, Shanghang;  Shan, Caifeng
Adobe PDF(6718Kb)  |  收藏  |  浏览/下载:74/29  |  提交时间:2024/02/22
Face photo-sketch synthesis  generative adversarial network  graph representation learning  intraclass and interclass  iterative cycle training (ICT)  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:90/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: early-access
作者:  Mengqi Rong;  Shuhan Shen
Adobe PDF(5811Kb)  |  收藏  |  浏览/下载:104/36  |  提交时间:2023/09/25
Bag of Tricks for Training Data Extraction from Language Models 会议论文
, Hawaii, US, 2023-7
作者:  Yu, Weichen;  Pang, Tianyu;  Liu, Qian;  Du, Chao;  Kang, Bingyi;  Huang, Yan;  Yan, Shuicheng
Adobe PDF(2549Kb)  |  收藏  |  浏览/下载:162/67  |  提交时间:2023/07/03
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:173/47  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:117/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
TBERT: Dynamic BERT Inference with Top-k Based Predictors 会议论文
, Antwerp, Belgium, 2023-4-17
作者:  Liu, Zejian;  Zhao, Kun;  Cheng, Jian
Adobe PDF(3426Kb)  |  收藏  |  浏览/下载:83/21  |  提交时间:2023/06/19
Transformer  Dynamic Inference  Pruning  
Multimodal motor imagery decoding method based on temporal spatial feature alignment and fusion 期刊论文
Journal of Neural Engineering, 2023, 卷号: 20, 期号: 2, 页码: 026009
作者:  Zhang,Yukun;  Qiu,Shuang;  He,Huiguang
Adobe PDF(16882Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/31
brain-computer interface  motor imagery  multimodal  EEG-fNIRS  center loss