Speaker identification based on adaptive discriminative vector quantisation

G. Zhou; W.B. Mikhael

Speaker identification based on adaptive discriminative vector quantisation

Access Full Text

Speaker identification based on adaptive discriminative vector quantisation

Author(s): G. Zhou and W.B. Mikhael
DOI: 10.1049/ip-vis:20050074

For access to this article, please select a purchase option:

Buy article PDF

Buy Knowledge Pack

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership

Recommend Title Publication to library

IEE Proceedings - Vision, Image and Signal Processing — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Author(s): G. Zhou ¹ and W.B. Mikhael ¹
- Affiliations: 1: Department of Electrical and Computer Engineering, University of Central Florida, Orlando, USA
Source: Volume 153, Issue 6, December 2006, p. 754 – 760
DOI: 10.1049/ip-vis:20050074 , Print ISSN 1350-245X, Online ISSN 1359-7108

Published

A novel adaptive discriminative vector quantisation technique for speaker identification (ADVQSI) is introduced. In the training mode of ADVQSI, for each speaker, the speech feature vector space is divided into a number of subspaces. The feature space segmentation is based on the difference between the probability distribution of the speech feature vectors from each speaker and that from all speakers in the speaker identification (SI) group. Then, an optimal discriminative weight, which represents the subspace's role in SI, is calculated for each subspace of each speaker by employing adaptive techniques. The largest template differences between speakers in the SI group are achieved by using optimal discriminative weights. In the testing mode of ADVQSI, discriminative weighted average vector quantisation (VQ) distortions are used for SI decisions. The performance of ADVQSI is analysed and tested experimentally. The experimental results confirm the performance improvement employing the proposed technique in comparison with existing VQ techniques for SI and recently reported discriminative VQ techniques for SI (DVQSI).

References

1. 1)
  - W.B. Mikhael , P. Premakanthan . Speaker identification employing redundant vector quantisers. Electron. Lett. , 1396 - 1398
2. 2)
  - A. Gray , R.M. Gray . (1991) Vector quantisation and signal compression.
3. 3)
  - J.P. Campbell . Speaker recognition: a tutorial. Proc. IEEE , 1437 - 1462
4. 4)
  - R.J. Mammone , X. Zhang , R.P. Pamachandran . Robust speaker recognition: a feature-based approach. IEEE Signal Process. Mag. , 5 , 58 - 71.
5. 5)
  - K.R. Farrell , R.J. Mammone , K.T. Assaleh . Speaker recognition using neural networks and conventional classifiers. IEEE Trans. Speech Audio Process. , 1 , 194 - 205
6. 6)
  - L. Rabiner , B. Juang . (1993) Fundamentals of speech recognition.
7. 7)
  - Zhou, G., Mikheal, W.B.: `Speaker identification based on discriminative vector quantisation', 46thIEEE Int. Midwest Symp. on Circuits and Systems, December 2003, Cairo, Egypt.
8. 8)
  - T. Matsui , S. Furui . Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMM's. IEEE Trans. Speech Audio Process. , 3 , 456 - 459
9. 9)
  - F.K. Soong , A.E. Rosenberg , L.R. Rabiner , B.-H. Juang . A vector quantisation approach to speaker recognition. ICASSP-85 , 387 - 390
10. 10)
  - D.A. Reynolds . Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. , 91 - 108
11. 11)
  - S. Furui . Recent advance in speaker recognition. Pattern Recognit. Lett. , 859 - 872
12. 12)
  - A. Higgins , L. Bhaler , J. Porter . Voice identification using nearest neighbour distance measure. ICASSP-93 , 375 - 378
13. 13)
  - H. Gish , M. Schmidt . Text-independent speaker identification. IEEE Signal Process. Mag. , 18 - 32
14. 14)
  - H. Sakoe , S. Chiba . Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. , 43 - 49
15. 15)
  - G. Zhou , W.B. Mikhael , B. Myers . A novel discriminative vector quantisation approach for speaker identification. J. Circuits, Syst. Comput. , 3 , 581 - 596
16. 16)
  - J. Oglesby , J.S. Mason . Optimisation of neural models for speaker identification. ICASSP-90 , 261 - 264
17. 17)
  - I. Lapidot , H. Guterman , A. Cohen . Unsupervised speaker recognition based on competition between self-organizing maps. IEEE Trans. Neural Netw. , 4 , 877 - 887
18. 18)
  - N.Z. Tishby . On the application of mixture AR hidden Markov models to text independent speaker recognition. IEEE Trans. Acoust. Speech Signal Process. , 563 - 570
19. 19)
  - R. Vergin , D. O'Shaughnessy , A. Farhat . Generalised mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition. IEEE Trans. Speech Audio Process. , 525 - 532
20. 20)
  - Zhou, G., Mikhael, W.B.: `Analysis of discriminative vector quantization approach for speaker identification', 8thWorld Multi-Conf. on Systemic, Cybernetics and Information, 2004, Orlando, FL, USA, IV, p. 479–483.
21. 21)
  - K. Yu , J. Mason , J. Oglesby . Speaker recognition using hidden Markov models, dynamic time warping and vector quantisation. IEE Proc., Vis. Image Signal Process. , 5 , 313 - 318
22. 22)
  - W.M. Campell , K.T. Assaleh , C.C. Broun . Speaker recognition with polynomial classifiers. IEEE Trans. Speech Audio Process. , 4 , 205 - 211
23. 23)
  - R.O. Duda , P.E. Hart , D.G. Stork . Pattern classification.
24. 24)
  - B. Gold , N. Morgan . (2000) Speech and audio signal processing: processing and perception of speech and music.
25. 25)
  - Y. Linde , A. Buzo , R.M. Gray . An algorithm for vector quantizer design. IEEE Trans. Commun. , 702 - 710

Login

Not registered yet?

Share

Tools

Login to add to favourites

Key

Speaker identification based on adaptive discriminative vector quantisation

Speaker identification based on adaptive discriminative vector quantisation

Buy article PDF

Buy Knowledge Pack

Thank you

References

Related content