CASIA OpenIR

浏览/检索结果: 共81条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 595-604
作者:  Yang Liu;  Haoqin Sun;  Wenbo Guan;  Yuqi Xia;   Zhen Zhao
Adobe PDF(1966Kb)  |  收藏  |  浏览/下载:1/0  |  提交时间:2024/04/23
Speech emotion recognition (SER), 3-dimensional (3D) feature, cascaded attention network (CAN), triplet loss, joint loss  
Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2534-2547
作者:  Li, Xingfeng;  Shi, Xiaohan;  Hu, Desheng;  Li, Yongwei;  Zhang, Qingchen;  Wang, Zhengxia;  Unoki, Masashi;  Akagi, Masato
收藏  |  浏览/下载:55/0  |  提交时间:2023/11/17
Affective computing  speech emotion recognition  acoustic representation  music theory and speech analysis  
A Multitask Learning Approach Based on Cascaded Attention Network and Self-Adaption Loss for Speech Emotion Recognition 期刊论文
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2023, 卷号: E106A, 期号: 6, 页码: 876-885
作者:  Liu, Yang;  Xia, Yuqi;  Sun, Haoqin;  Meng, Xiaolei;  Bai, Jianxiong;  Guan, Wenbo;  Zhao, Zhen;  LI, Yongwei
收藏  |  浏览/下载:65/0  |  提交时间:2023/11/17
speech emotion recognition  non-personalized features  cascaded attention network  multitask learning  self-adaption loss  
面向情境化语音识别的建模方法研究 学位论文
, 2023
作者:  韩明伦
Adobe PDF(9191Kb)  |  收藏  |  浏览/下载:187/18  |  提交时间:2023/06/19
Automatic Speech Recognition  Contextualized Speech Recognition  Speech Recognition Customization  Multimodal Speech Recognition  Continuous Integrate-and-Fire Mechanism  
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection 会议论文
, Singapore, Singapore, 2022.05
作者:  Minglun Han;  Linhao Dong;  Zhenlin Liang;  Meng Cai;  Shiyu Zhou;  Zejun Ma;  Bo Xu
Adobe PDF(463Kb)  |  收藏  |  浏览/下载:161/45  |  提交时间:2023/05/29
Automatic Speech Recognition  Context Biasing  Speech Recognition Customization  Continuous Integrate-and-Fire Mechanism  
CIF-Based Collaborative Decoding for End-to-End Contextual Speech Recognition 会议论文
, Toronto, Canada, 2021-06-06
作者:  Minglun Han;  Linhao Dong;  Shiyu Zhou;  Bo Xu
Adobe PDF(469Kb)  |  收藏  |  浏览/下载:123/37  |  提交时间:2023/05/29
Contextual Speech Recognition  Automatic Speech Recognition  Context Biasing  
TWO-STAGE PRE-TRAINING FOR SEQUENCE TO SEQUENCE SPEECH RECOGNITION 会议论文
, 线上会议, 2021-7-18
作者:  Fan ZY(范志赟);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(230Kb)  |  收藏  |  浏览/下载:161/43  |  提交时间:2022/09/17
pre-training  speech recognition  encoder-decoder  sequence-to-sequence  
SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文
, 新加坡, 2019-12-14
作者:  Fan ZY(范志赟);  Li J(李杰);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(361Kb)  |  收藏  |  浏览/下载:151/50  |  提交时间:2022/09/17
Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector  
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:204/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training  
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:  Zheng, Aihua;  Hu, Menglan;  Jiang, Bo;  Huang, Yan;  Yan, Yan;  Luo, Bin
收藏  |  浏览/下载:224/0  |  提交时间:2022/03/17
Visualization  Task analysis  Measurement  Speech recognition  Videos  Location awareness  Image recognition  Adversarial learning  audio-visual matching  cross-modal learning  metric learning