CASIA OpenIR

浏览/检索结果: 共76条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
作者:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/06
Triple Robustness Augmentation Local Features for multi-source image registration 期刊论文
ISPRS Journal of Photogrammetry and Remote Sensing, 2023, 卷号: 199, 期号: 0, 页码: 1-14
作者:  Changwei Wang;  Lele Xu;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Ruisheng Wang;  Xiaopeng Zhang
Adobe PDF(6581Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/05/29
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:77/21  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Patch Loss: A generic multi-scale perceptual loss for single image super-resolution 期刊论文
Pattern Recognition, 2023, 卷号: 139, 页码: 109510
作者:  An T(安泰);  Mao BJ(毛彬杰);  Xue B(薛斌);  Huo CL(霍春雷);  Xiang SM(向世明);  Pan CH(潘春洪)
Adobe PDF(5876Kb)  |  收藏  |  浏览/下载:123/19  |  提交时间:2024/01/17
Single-image super-resolution  Multi-scale loss functions  Image visual perception  Perceptual metrics  
面向遥感场景的图像超分辨率算法研究 学位论文
, 2023
作者:  安泰
Adobe PDF(13189Kb)  |  收藏  |  浏览/下载:198/8  |  提交时间:2024/01/17
遥感图像超分辨率  深度学习  特征融合  注意力机制  扩散模型  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:45/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
Towards Better Word Importance Ranking in Textual Adversarial Attacks 会议论文
, Gold Coast, Australia, June 18-23, 2023
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel Dajun
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:248/103  |  提交时间:2023/09/27
CONTEXT-AWARE MASK PREDICTION NETWORK FOR END-TO-END TEXT-BASED SPEECH EDITING 会议论文
, Online, 2022
作者:  Wang T(汪涛)
Adobe PDF(2851Kb)  |  收藏  |  浏览/下载:85/41  |  提交时间:2023/08/07
Singing-Tacotron: Global Duration Control Attention and Dynamic Filter for End-to-end Singing Voice Synthesis 会议论文
, Online, 2022
作者:  Wang T(汪涛)
Adobe PDF(2873Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2023/08/07
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
收藏  |  浏览/下载:252/0  |  提交时间:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM