×
验证码:
换一张
忘记密码?
记住我
切换中国科技网通行证登录
×
切换中国科技网通行证登录
登录
中文版
|
English
中国科学院自动化研究所机构知识库
Knowledge Commons of Institute of Automation,CAS
登录
注册
ALL
ORCID
题名
作者
导师
学科领域
关键词
资助项目
文献类型
出处
会议名称
收录类别
出版者
发表日期
存缴日期
学科门类
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
知识图谱
新闻&公告
在结果中检索
研究单元&专题
多模态人工智能系统全... [6]
毕业生 [6]
09年以前成果 [3]
复杂系统认知与决策实... [2]
模式识别实验室 [2]
学术期刊 [1]
更多...
作者
陶建华 [5]
李鹏 [4]
刘文举 [4]
温正棋 [4]
徐波 [3]
梁山 [2]
更多...
文献类型
期刊论文 [12]
学位论文 [6]
会议论文 [2]
发表日期
2023 [1]
2022 [2]
2021 [2]
2019 [1]
2016 [1]
2015 [1]
更多...
语种
英语 [12]
中文 [6]
出处
IEEE-ACM T... [3]
Annual Con... [2]
IEEE TRANS... [2]
CHINESE JO... [1]
COMPUTER S... [1]
IEEE TRANS... [1]
更多...
资助项目
Cooperativ... [1]
Huawei Noa... [1]
Inria-CAS ... [1]
Inria-CAS ... [1]
Key Resear... [1]
Key Resear... [1]
更多...
收录类别
SCI [11]
EI [2]
导师
刘文举 [3]
张树武 [1]
徐波 [1]
李成荣 [1]
资助机构
National N... [3]
Cooperativ... [1]
Huawei Noa... [1]
Inria-CAS ... [1]
Key Resear... [1]
Key Resear... [1]
更多...
×
知识图谱
CASIA OpenIR
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共20条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
发表日期升序
发表日期降序
期刊影响因子升序
期刊影响因子降序
作者升序
作者降序
提交时间升序
提交时间降序
WOS被引频次升序
WOS被引频次降序
Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2534-2547
作者:
Li, Xingfeng
;
Shi, Xiaohan
;
Hu, Desheng
;
Li, Yongwei
;
Zhang, Qingchen
;
Wang, Zhengxia
;
Unoki, Masashi
;
Akagi, Masato
收藏
  |  
浏览/下载:98/0
  |  
提交时间:2023/11/17
Affective computing
speech emotion recognition
acoustic representation
music theory and speech analysis
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:
Wang, Tao
;
Yi, Jiangyan
;
Fu, Ruibo
;
Tao, Jianhua
;
Wen, Zhengqi
收藏
  |  
浏览/下载:245/0
  |  
提交时间:2022/09/19
Speech processing
Decoding
Predictive models
Acoustics
Transfer learning
Training
Task analysis
Coarse-to-fine decoding
mask prediction
one-shot learning
text-based speech editing
text-to-speech
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching
期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 338-351
作者:
Zheng, Aihua
;
Hu, Menglan
;
Jiang, Bo
;
Huang, Yan
;
Yan, Yan
;
Luo, Bin
收藏
  |  
浏览/下载:266/0
  |  
提交时间:2022/03/17
Visualization
Task analysis
Measurement
Speech recognition
Videos
Location awareness
Image recognition
Adversarial learning
audio-visual matching
cross-modal learning
metric learning
On Learning Semantic Representations for Large-Scale Abstract Sketches
期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 9, 页码: 3366-3379
作者:
Xu, Peng
;
Huang, Yongye
;
Yuan, Tongtong
;
Xiang, Tao
;
Hospedales, Timothy M.
;
Song, Yi-Zhe
;
Wang, Liang
收藏
  |  
浏览/下载:215/0
  |  
提交时间:2021/11/03
Semantics
Visualization
Task analysis
Games
Feature extraction
Quantization (signal)
Speech recognition
Practical sketch-based application
semantic representation
hashing
retrieval
zero-shot recognition
edge-map dataset
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition
期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:
Fan, Cunhang
;
Yi, Jiangyan
;
Tao, Jianhua
;
Tian, Zhengkun
;
Liu, Bin
;
Wen, Zhengqi
Adobe PDF(2534Kb)
  |  
收藏
  |  
浏览/下载:435/57
  |  
提交时间:2021/03/08
Speech enhancement
Speech recognition
Training
Noise measurement
Logic gates
Acoustic distortion
Task analysis
Gated recurrent fusion
robust end-to-end speech recognition
speech distortion
speech enhancement
speech transformer
Large-scale Data Collection and Analysis via a Gamified Intelligent Crowdsourcing Platform
期刊论文
International Journal of Automation and Computing, 2019, 卷号: 16, 期号: 4, 页码: 427-436
作者:
Simone Hantke
;
Tobias Olenyi
;
Christoph Hausner
;
Tobias Appel
;
Björn Schuller
浏览
  |  
Adobe PDF(1345Kb)
  |  
收藏
  |  
浏览/下载:142/49
  |  
提交时间:2021/02/22
Human computation
speech analysis
crowdsourcing
gamified data collection
survey.
Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis
会议论文
Annual Conference of the International Speech Communication Association (INTERSPEECH), 美国, 2012
作者:
Wen, Zhengqi
;
Kawahara, Hideki
;
Tao, Jianhua
;
Zhengqi Wen
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2020/10/27
Speech Parametric Representation
Pitch-scaled Analysis
Voicing Cut-off Frequency
Principal Component Analysis
Monaural speech separation based on MAXVQ and CASA for robust speech recognition
期刊论文
COMPUTER SPEECH AND LANGUAGE, 2010, 卷号: 24, 期号: 1, 页码: 30-44
作者:
Li, Peng
;
Guan, Yong
;
Wang, Shijin
;
Xu, Bo
;
Liu, Wenju
收藏
  |  
浏览/下载:82/0
  |  
提交时间:2020/10/27
Monaural Speech Separation
Computational Auditory Scene Analysis (Casa)
Factorial-max Vector Quantization (Maxvq)
Automatic Speech Recognition (Asr)
Monaural voiced speech segregation based on elaborate harmonic grouping strategies
期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2011, 卷号: 54, 期号: 12, 页码: 2471-2480
作者:
Liu WenJu
;
Zhang XueLiang
;
Jiang Wei
;
Li Peng
;
Xu Bo
收藏
  |  
浏览/下载:87/0
  |  
提交时间:2020/10/27
Computational Auditory Scene Analysis
Voiced Speech Separation
Harmonistic Principle
Minimum Amplitude Principle
Elaborate Harmonic Grouping Strategies
Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement
期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 141-150
作者:
Liu, Bin
;
Tao, Jianhua
;
Wen, Zhengqi
;
Mo, Fuyuan
;
Bin Liu
收藏
  |  
浏览/下载:71/0
  |  
提交时间:2020/10/27
Analysis-synthesis Framework
Multi-band Summary Correlogram
Denoising Autoencoder
Speech Enhancement
Speech Coding