Speaker Segmentation Based on Sparse Neural Network

MA Yong; BAO Zhang-chun

doi:10.11936/bjutxb2014050063

MA Yong, BAO Zhang-chun. Speaker Segmentation Based on Sparse Neural Network[J]. Journal of Beijing University of Technology, 2015, 41(5): 662-667. DOI: 10.11936/bjutxb2014050063

Citation:

MA Yong, BAO Zhang-chun. Speaker Segmentation Based on Sparse Neural Network[J]. Journal of Beijing University of Technology, 2015, 41(5): 662-667. DOI: 10.11936/bjutxb2014050063

Citation:

MA Yong, BAO Zhang-chun. Speaker Segmentation Based on Sparse Neural Network[J]. Journal of Beijing University of Technology, 2015, 41(5): 662-667. DOI: 10.11936/bjutxb2014050063

Speaker Segmentation Based on Sparse Neural Network

Graphical Abstract

Graphical Abstract

Abstract

Abstract

A method of speaker segmentation based on sparse neural network is presented.The speaker factor feature is extracted using the sparse neural network of one hidden layer from the super-vector feature of speech signals,then the label of every speech frame obtained by K-means clustering is used to segment different speakers,and the problem of over-fitting is tackled by the dropout technology in the training process of sparse network.The performance evaluation on the multi-speaker audio stream corpus generated from the TIMIT databases shows that the performance of speaker segmentation is improved by increasing the number of sparse network's hidden nodes,and the proposed speaker segmentation algorithm based on the sparse neural network performs better than the Bayesian information criterion(BIC) method and the sparse auto-encoder method.

FullText(HTML)

References (14)

Cited By

Turn off MathJax

Article Contents

Speaker Segmentation Based on Sparse Neural Network

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content