Knowledge Commons of Institute of Automation,CAS
Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network | |
Dong, Meng1,2; Huang, Xuhui2; Xu, Bo2,3,4 | |
发表期刊 | PLOS ONE |
ISSN | 1932-6203 |
2018-11-29 | |
卷号 | 13期号:11页码:19 |
摘要 | Speech recognition (SR) has been improved significantly by artificial neural networks (ANNs), but ANNs have the drawbacks of biologically implausibility and excessive power consumption because of the nonlocal transfer of real-valued errors and weights. While spiking neural networks (SNNs) have the potential to solve these drawbacks of ANNs due to their efficient spike communication and their natural way to utilize kinds of synaptic plasticity rules found in brain for weight modification. However, existing SNN models for SR either had bad performance, or were trained in biologically implausible ways. In this paper, we present a biologically inspired convolutional SNN model for SR. The network adopts the time-to-first-spike coding scheme for fast and efficient information processing. A biological learning rule, spike-timing-dependent plasticity (STDP), is used to adjust the synaptic weights of convolutional neurons to form receptive fields in an unsupervised way. In the convolutional structure, the strategy of local weight sharing is introduced and could lead to better feature extraction of speech signals than global weight sharing. We first evaluated the SNN model with a linear support vector machine (SVM) on the TIDIGITS dataset and it got the performance of 97.5%, comparable to the best results of ANNs. Deep analysis on network outputs showed that, not only are the output data more linearly separable, but they also have fewer dimensions and become sparse. To further confirm the validity of our model, we trained it on a more difficult recognition task based on the TIMIT dataset, and it got a high performance of 93.8%. Moreover, a linear spike-based classifier-tempotron-can also achieve high accuracies very close to that of SVM on both the two tasks. These demonstrate that an STDP-based convolutional SNN model equipped with local weight sharing and temporal coding is capable of solving the SR task accurately and efficiently. |
关键词 | 无 |
DOI | 10.1371/journal.pone.0204596 |
关键词[WOS] | STIMULUS LOCATION ; WORD RECOGNITION ; INFORMATION ; REPRESENTATIONS ; FEATURES ; NEURONS ; TIME ; CODE |
收录类别 | SCI |
语种 | 英语 |
资助项目 | Strategic Priority Research Program of the Chinese Academy of Sciences[XDBS01070000] ; Natural Science Foundation of China[11505283] ; Independent Deployment Project of CAS Center for Excellence in Brain Science and Intelligent Technology[CEBSIT2017-02] ; NVIDIA Corporation ; Strategic Priority Research Program of the Chinese Academy of Sciences[XDBS01070000] ; Natural Science Foundation of China[11505283] ; Independent Deployment Project of CAS Center for Excellence in Brain Science and Intelligent Technology[CEBSIT2017-02] ; NVIDIA Corporation |
WOS研究方向 | Science & Technology - Other Topics |
WOS类目 | Multidisciplinary Sciences |
WOS记录号 | WOS:000451763800010 |
出版者 | PUBLIC LIBRARY SCIENCE |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/25706 |
专题 | 脑图谱与类脑智能实验室_脑网络组研究 复杂系统认知与决策实验室_听觉模型与认知计算 |
通讯作者 | Huang, Xuhui; Xu, Bo |
作者单位 | 1.Harbin Univ Sci & Technol, Sch Automat, Harbin, Heilongjiang, Peoples R China 2.Chinese Acad Sci, Res Ctr Brain Inspired Intelligence, Inst Automat, Beijing, Peoples R China 3.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China 4.Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China |
推荐引用方式 GB/T 7714 | Dong, Meng,Huang, Xuhui,Xu, Bo. Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network[J]. PLOS ONE,2018,13(11):19. |
APA | Dong, Meng,Huang, Xuhui,&Xu, Bo.(2018).Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network.PLOS ONE,13(11),19. |
MLA | Dong, Meng,et al."Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network".PLOS ONE 13.11(2018):19. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论