CASIA OpenIR

浏览/检索结果: 共28条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:22/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:53/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:43/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:135/24  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Decentralized Autonomous Operations and Organizations in TransVerse: Federated Intelligence for Smart Mobility 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 卷号: 53, 期号: 4, 页码: 2062-2072
作者:  Zhao, Chen;  Dai, Xingyuan;  Lv, Yisheng;  Niu, Jinglong;  Lin, Yilun
Adobe PDF(1921Kb)  |  收藏  |  浏览/下载:201/1  |  提交时间:2023/02/22
Intelligent Transportation Systems (ITS)  Artificial Systems, Computational Experiments, Parallel Execution (ACP)  Cyber–Physical–Social Systems (CPSS)  
Multi-View Multi-Label Fine-Grained Emotion Decoding From Human Brain Activity 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:277/66  |  提交时间:2022/12/27
Fine-grained Emotion Decoding  Multi-view Learning  Multi-label Learning  Variational Autoencoder  Product of Experts  
ASCL: Adversarial supervised contrastive learning for defense against word substitution attacks 期刊论文
NEUROCOMPUTING, 2022, 卷号: 510, 页码: 59-68
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel
Adobe PDF(1054Kb)  |  收藏  |  浏览/下载:230/27  |  提交时间:2022/11/14
Adversarial example  Adversarial training  Model robustness  Contrastive learning  Natural language processing  
Narrowing the Gap: Improved Detector Training With Noisy Location Annotations 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 6369-6380
作者:  Wang, Shaoru;  Gao, Jin;  Li, Bing;  Hu, Weiming
Adobe PDF(1489Kb)  |  收藏  |  浏览/下载:218/26  |  提交时间:2022/11/14
Annotations  Noise measurement  Detectors  Task analysis  Training  Object detection  Degradation  Object detection  noisy label  Bayesian estimation  teacher-student learning  
AHDet: A dynamic coarse-to-fine gaze strategy for active object detection 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 522-532
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Pan, Chunhong
Adobe PDF(2664Kb)  |  收藏  |  浏览/下载:301/58  |  提交时间:2022/09/19
Object detection  Active object detection  Deep reinforcement learning  Convolutional neural networks  
Meta Graph Transformer: A Novel Framework for Spatial-Temporal Traffic Prediction 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 544-563
作者:  Ye, Xue;  Fang, Shen;  Sun, Fang;  Zhang, Chunxia;  Xiang, Shiming
Adobe PDF(3491Kb)  |  收藏  |  浏览/下载:227/27  |  提交时间:2022/09/19
Traffic prediction  Spatial-temporal modeling  Meta-learning  Attention mechanism  Deep learning