CASIA OpenIR

浏览/检索结果: 共1798条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:92/8  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/02/23
Efficient Remote Sensing Image Super-Resolution via Lightweight Diffusion Models 期刊论文
IEEE Geoscience and Remote Sensing Letters, 2024, 卷号: 21, 页码: 1-5
作者:  An T(安泰);  Xue B(薛斌);  Huo CL(霍春雷);  Xiang SM(向世明);  Pan CH(潘春洪)
Adobe PDF(30422Kb)  |  收藏  |  浏览/下载:98/22  |  提交时间:2024/01/17
Remote sensing super-resolution  lightweight diffusion models  cross-attention mechanism  satellite imagery  
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 234-244
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:6/0  |  提交时间:2024/03/26
Fake news detection  multi-modal learning  social media  
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:34/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
A new super-predefined-time convergence and noise-tolerant RNN for solving time-variant linear matrix-vector inequality in noisy environment and its application to robot arm 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2023, 页码: 17
作者:  Zheng, Boyu;  Yue, Chong;  Wang, Qianqian;  Li, Chunquan;  Zhang, Zhijun;  Yu, Junzhi;  Liu, Peter X.
收藏  |  浏览/下载:33/0  |  提交时间:2024/02/22
Linear matrix-vector inequality  Recurrent neural network  Robustness  Time-variant problem  Convergence  
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 14
作者:  Qi, Xingqun;  Sun, Muyi;  Wang, Zijian;  Liu, Jiaming;  Li, Qi;  Zhao, Fang;  Zhang, Shanghang;  Shan, Caifeng
Adobe PDF(6718Kb)  |  收藏  |  浏览/下载:73/28  |  提交时间:2024/02/22
Face photo-sketch synthesis  generative adversarial network  graph representation learning  intraclass and interclass  iterative cycle training (ICT)  
Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 15949-15963
作者:  Gao, Junyu;  Chen, Mengyuan;  Xu, Changsheng
收藏  |  浏览/下载:9/0  |  提交时间:2024/03/26
Uncertainty  Location awareness  Reliability  Videos  Noise measurement  Estimation  Deep learning  Weakly-supervised learning  temporal action localization  evidential deep learning  uncertainty estimation  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Enhancing Dimensional Emotion Recognition from Speech through Modulation-Filtered Cochleagram and Parallel Attention Recurrent Network 期刊论文
ELECTRONICS, 2023, 卷号: 12, 期号: 22, 页码: 15
作者:  Peng, Zhichao;  Zeng, Hua;  Li, Yongwei;  Du, Yegang;  Dang, Jianwu
收藏  |  浏览/下载:42/0  |  提交时间:2024/02/22
modulation-filtered cochleagram  parallel attention recurrent neural network  dimensional emotion recognition  auditory signal processing  noise-robust