CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:12/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:44/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:125/23  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Multi-View Multi-Label Fine-Grained Emotion Decoding From Human Brain Activity 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:259/66  |  提交时间:2022/12/27
Decoding  Brain modeling  Functional magnetic resonance imaging  Predictive models  Emotion recognition  Dimensionality reduction  Pattern recognition  Fine-grained emotion decoding  multi-label learning  multi-view learning  product of experts (PoEs)  variational autoencoder  
ASCL: Adversarial supervised contrastive learning for defense against word substitution attacks 期刊论文
NEUROCOMPUTING, 2022, 卷号: 510, 页码: 59-68
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel
Adobe PDF(1054Kb)  |  收藏  |  浏览/下载:221/25  |  提交时间:2022/11/14
Adversarial example  Adversarial training  Model robustness  Contrastive learning  Natural language processing  
Narrowing the Gap: Improved Detector Training With Noisy Location Annotations 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 6369-6380
作者:  Wang, Shaoru;  Gao, Jin;  Li, Bing;  Hu, Weiming
Adobe PDF(1489Kb)  |  收藏  |  浏览/下载:213/26  |  提交时间:2022/11/14
Annotations  Noise measurement  Detectors  Task analysis  Training  Object detection  Degradation  Object detection  noisy label  Bayesian estimation  teacher-student learning  
AHDet: A dynamic coarse-to-fine gaze strategy for active object detection 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 522-532
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Pan, Chunhong
Adobe PDF(2664Kb)  |  收藏  |  浏览/下载:294/57  |  提交时间:2022/09/19
Object detection  Active object detection  Deep reinforcement learning  Convolutional neural networks  
Meta Graph Transformer: A Novel Framework for Spatial-Temporal Traffic Prediction 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 544-563
作者:  Ye, Xue;  Fang, Shen;  Sun, Fang;  Zhang, Chunxia;  Xiang, Shiming
Adobe PDF(3491Kb)  |  收藏  |  浏览/下载:218/25  |  提交时间:2022/09/19
Traffic prediction  Spatial-temporal modeling  Meta-learning  Attention mechanism  Deep learning  
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
收藏  |  浏览/下载:220/0  |  提交时间:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM