基于空间声学特征的说话人分类算法
Speaker Classification Algorithm Based on Spatial Acoustic Feature
-
摘要: 针对先验信息缺失情况下的说话人分类问题,可以采用提取基于多距离麦克风的空间声学特征的方法进行说话人分类.为了解决由于空间声学特征维数随麦克风个数的增加而迅速增长带来的计算代价问题,需要对其进行降维处理.用一种基于空间声学特征的优化鉴别式保局投影说话人分类方法,实现了在保留空间声学特征流型结构的同时降低计算代价的效果.实验在多距离麦克风语音会议数据集上进行验证,得到在大部分数据集上本方法的分类误差率(diarization error rate,DER)得分低于传统方法的结果.结果表明:本方法的说话人分类性能比传统方法有所提高.Abstract: For the speaker classification issue with missing priori information, it could be used to extract the spatial acoustic features based on multiple distance microphone to classify speakers. Its dimension needed to be reduced to solve the problem of computational cost caused by the rapid growth of space acoustic features dimension with the increasing microphone's number. A novel optimized discriminant locality preserving projections speaker classification method was proposed, which could preserve manifold structure of spatial acoustic feature and decrease computing cost. Experiments were validated with the speech conference data set collected by the multi-distance microphones. In most of the data set the diarization error rate (DER) score of this method was lower than traditional methods. Results show that this method has the improved speaker classification performance than the traditional methods.