CASIA OpenIR

浏览/检索结果: 共191条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:82/7  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文
, Washington D.C., USA, 2023-2-9
作者:  Qingyu Wang;  Tielin Zhang;  Minglun Han;  Yi Wang;  Duzhen Zhang;  Bo Xu
Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:127/41  |  提交时间:2023/06/20
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:133/50  |  提交时间:2023/06/20
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:111/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Integrating Relational Knowledge With Text Sequences for Script Event Prediction 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: early access
作者:  Zikang Wang;  Linjing Li;  Daniel Zeng
Adobe PDF(3215Kb)  |  收藏  |  浏览/下载:256/83  |  提交时间:2023/03/20
Graph-Enhanced Emotion Neural Decoding 期刊论文
IEEE Transactions on Medical Imaging, 2023, 页码: 1-1
作者:  Huang ZY(黄中昱);  Du CD(杜长德);  Wang YH;  Fu KC(付铠成);  He HG(何晖光)
Adobe PDF(6049Kb)  |  收藏  |  浏览/下载:225/51  |  提交时间:2023/05/05
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1-17
作者:  Du CD(杜长德);  Fu KC(付铠成);  Li JP(李劲鹏);  He HG(何晖光)
Adobe PDF(4669Kb)  |  收藏  |  浏览/下载:373/64  |  提交时间:2023/05/05
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:113/27  |  提交时间:2023/06/21
Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2534-2547
作者:  Li, Xingfeng;  Shi, Xiaohan;  Hu, Desheng;  Li, Yongwei;  Zhang, Qingchen;  Wang, Zhengxia;  Unoki, Masashi;  Akagi, Masato
收藏  |  浏览/下载:49/0  |  提交时间:2023/11/17
Affective computing  speech emotion recognition  acoustic representation  music theory and speech analysis