Speaker Recognition Using Neural Tree Networks

Part of Advances in Neural Information Processing Systems 6 (NIPS 1993)

Bibtex Metadata Paper


Kevin Farrell, Richard Mammone


A new classifier is presented for text-independent speaker recognition. The new classifier is called the modified neural tree network (MNTN). The NTN is a hierarchical classifier that combines the properties of decision trees and feed-forward neural networks. The MNTN differs from the stan(cid:173) dard NTN in that a new learning rule based on discriminant learning is used, which minimizes the classification error as opposed to a norm of the approximation error. The MNTN also uses leaf probability mea(cid:173) sures in addition to the class labels. The MNTN is evaluated for several speaker identification experiments and is compared to multilayer percep(cid:173) trons (MLPs) , decision trees, and vector quantization (VQ) classifiers. The VQ classifier and MNTN demonstrate comparable performance and per(cid:173) form significantly better than the other classifiers for this task. Addition(cid:173) ally, the MNTN provides a logarithmic saving in retrieval time over that of the VQ classifier. The MNTN and VQ classifiers are also compared for several speaker verification experiments where the MNTN is found to outperform the VQ classifier.