CASIA OpenIR

浏览/检索结果: 共124条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
文本无关说话人识别中句级特征提取方法研究综述 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 3, 页码: 664-688
作者:  陈晨;  韩纪庆;  陈德运;  何勇军
Adobe PDF(2278Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/05/20
说话人识别  句级特征提取  任务分段式策略  任务驱动式策略  联合学习  
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 153-168
作者:  Zefa Hu;  Ziyi Ni;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1525Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/04/23
Medical dialogue understanding, information extraction, text generation, knowledge-enhanced prompt, low-resource setting, data augmentation  
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:20/1  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 595-604
作者:  Yang Liu;  Haoqin Sun;  Wenbo Guan;  Yuqi Xia;   Zhen Zhao
Adobe PDF(1966Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/04/23
Speech emotion recognition (SER), 3-dimensional (3D) feature, cascaded attention network (CAN), triplet loss, joint loss  
Opportunities and challenges for biometrics 专著
Switzerland:Springer, 2020
作者:  Sun, Zhenan;  Li, Qi;  Liu, Yunfan;  Zhu, Yuhao
Adobe PDF(590Kb)  |  收藏  |  浏览/下载:76/32  |  提交时间:2024/02/23
End-to-End Paired Ambisonic-Binaural Audio Rendering 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 502-513
作者:  Yin Zhu;  Qiuqiang Kong;  Junjie Shi;  Shilei Liu;  Xuzhou Ye;  Ju-Chiang Wang;  Hongming Shan;  Junping Zhang
Adobe PDF(9612Kb)  |  收藏  |  浏览/下载:57/18  |  提交时间:2024/01/23
Ambisonic  attention  binaural rendering  neural network  
CONTEXT-AWARE MASK PREDICTION NETWORK FOR END-TO-END TEXT-BASED SPEECH EDITING 会议论文
, Online, 2022
作者:  Wang T(汪涛)
Adobe PDF(2851Kb)  |  收藏  |  浏览/下载:79/38  |  提交时间:2023/08/07
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:187/69  |  提交时间:2023/07/06
Audio-driven Dubbing for User Generated Contents via Style-aware Semi-parametric Synthesis 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 卷号: 33, 期号: 3, 页码: 1247 - 1261
作者:  Song LS(宋林森);  Wu WY(吴文岩);  Fu CY(傅朝友);  Loy, Chen Change;  He R(赫然)
Adobe PDF(8629Kb)  |  收藏  |  浏览/下载:113/47  |  提交时间:2023/06/29
Talking Face Generation  Video Generation  GAN  Thin-plate Spline  
Everybody’s Talkin’: Let Me Talk as You Want 期刊论文
IEEE Transactions on Information Forensics and Security, 2022, 卷号: 17, 期号: 1, 页码: 585 - 598
作者:  宋林森;  吴文岩;  钱晨;  赫然;  Loy, Chen Change
Adobe PDF(15432Kb)  |  收藏  |  浏览/下载:79/11  |  提交时间:2023/06/29
Talking face generation  Video generation  GAN  Audio dubbing