Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

Speaker identification based on adaptive discriminative vector quantisation

Speaker identification based on adaptive discriminative vector quantisation

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings - Vision, Image and Signal Processing — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A novel adaptive discriminative vector quantisation technique for speaker identification (ADVQSI) is introduced. In the training mode of ADVQSI, for each speaker, the speech feature vector space is divided into a number of subspaces. The feature space segmentation is based on the difference between the probability distribution of the speech feature vectors from each speaker and that from all speakers in the speaker identification (SI) group. Then, an optimal discriminative weight, which represents the subspace's role in SI, is calculated for each subspace of each speaker by employing adaptive techniques. The largest template differences between speakers in the SI group are achieved by using optimal discriminative weights. In the testing mode of ADVQSI, discriminative weighted average vector quantisation (VQ) distortions are used for SI decisions. The performance of ADVQSI is analysed and tested experimentally. The experimental results confirm the performance improvement employing the proposed technique in comparison with existing VQ techniques for SI and recently reported discriminative VQ techniques for SI (DVQSI).

References

    1. 1)
    2. 2)
      • A. Gray , R.M. Gray . (1991) Vector quantisation and signal compression.
    3. 3)
      • J.P. Campbell . Speaker recognition: a tutorial. Proc. IEEE , 1437 - 1462
    4. 4)
    5. 5)
    6. 6)
      • L. Rabiner , B. Juang . (1993) Fundamentals of speech recognition.
    7. 7)
      • Zhou, G., Mikheal, W.B.: `Speaker identification based on discriminative vector quantisation', 46thIEEE Int. Midwest Symp. on Circuits and Systems, December 2003, Cairo, Egypt.
    8. 8)
      • T. Matsui , S. Furui . Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMM's. IEEE Trans. Speech Audio Process. , 3 , 456 - 459
    9. 9)
      • F.K. Soong , A.E. Rosenberg , L.R. Rabiner , B.-H. Juang . A vector quantisation approach to speaker recognition. ICASSP-85 , 387 - 390
    10. 10)
    11. 11)
      • S. Furui . Recent advance in speaker recognition. Pattern Recognit. Lett. , 859 - 872
    12. 12)
      • A. Higgins , L. Bhaler , J. Porter . Voice identification using nearest neighbour distance measure. ICASSP-93 , 375 - 378
    13. 13)
      • H. Gish , M. Schmidt . Text-independent speaker identification. IEEE Signal Process. Mag. , 18 - 32
    14. 14)
      • H. Sakoe , S. Chiba . Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. , 43 - 49
    15. 15)
      • G. Zhou , W.B. Mikhael , B. Myers . A novel discriminative vector quantisation approach for speaker identification. J. Circuits, Syst. Comput. , 3 , 581 - 596
    16. 16)
      • J. Oglesby , J.S. Mason . Optimisation of neural models for speaker identification. ICASSP-90 , 261 - 264
    17. 17)
    18. 18)
      • N.Z. Tishby . On the application of mixture AR hidden Markov models to text independent speaker recognition. IEEE Trans. Acoust. Speech Signal Process. , 563 - 570
    19. 19)
      • R. Vergin , D. O'Shaughnessy , A. Farhat . Generalised mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition. IEEE Trans. Speech Audio Process. , 525 - 532
    20. 20)
      • Zhou, G., Mikhael, W.B.: `Analysis of discriminative vector quantization approach for speaker identification', 8thWorld Multi-Conf. on Systemic, Cybernetics and Information, 2004, Orlando, FL, USA, IV, p. 479–483.
    21. 21)
    22. 22)
      • W.M. Campell , K.T. Assaleh , C.C. Broun . Speaker recognition with polynomial classifiers. IEEE Trans. Speech Audio Process. , 4 , 205 - 211
    23. 23)
      • R.O. Duda , P.E. Hart , D.G. Stork . Pattern classification.
    24. 24)
      • B. Gold , N. Morgan . (2000) Speech and audio signal processing: processing and perception of speech and music.
    25. 25)
      • Y. Linde , A. Buzo , R.M. Gray . An algorithm for vector quantizer design. IEEE Trans. Commun. , 702 - 710
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-vis_20050074
Loading

Related content

content/journals/10.1049/ip-vis_20050074
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address