http://iet.metastore.ingenta.com
1887

Speaker identification based on adaptive discriminative vector quantisation

Speaker identification based on adaptive discriminative vector quantisation

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings - Vision, Image and Signal Processing — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A novel adaptive discriminative vector quantisation technique for speaker identification (ADVQSI) is introduced. In the training mode of ADVQSI, for each speaker, the speech feature vector space is divided into a number of subspaces. The feature space segmentation is based on the difference between the probability distribution of the speech feature vectors from each speaker and that from all speakers in the speaker identification (SI) group. Then, an optimal discriminative weight, which represents the subspace's role in SI, is calculated for each subspace of each speaker by employing adaptive techniques. The largest template differences between speakers in the SI group are achieved by using optimal discriminative weights. In the testing mode of ADVQSI, discriminative weighted average vector quantisation (VQ) distortions are used for SI decisions. The performance of ADVQSI is analysed and tested experimentally. The experimental results confirm the performance improvement employing the proposed technique in comparison with existing VQ techniques for SI and recently reported discriminative VQ techniques for SI (DVQSI).

References

    1. 1)
      • W.M. Campell , K.T. Assaleh , C.C. Broun . Speaker recognition with polynomial classifiers. IEEE Trans. Speech Audio Process. , 4 , 205 - 211
    2. 2)
    3. 3)
      • S. Furui . Recent advance in speaker recognition. Pattern Recognit. Lett. , 859 - 872
    4. 4)
      • H. Gish , M. Schmidt . Text-independent speaker identification. IEEE Signal Process. Mag. , 18 - 32
    5. 5)
      • B. Gold , N. Morgan . (2000) Speech and audio signal processing: processing and perception of speech and music.
    6. 6)
      • A. Higgins , L. Bhaler , J. Porter . Voice identification using nearest neighbour distance measure. ICASSP-93 , 375 - 378
    7. 7)
    8. 8)
      • J.P. Campbell . Speaker recognition: a tutorial. Proc. IEEE , 1437 - 1462
    9. 9)
    10. 10)
      • T. Matsui , S. Furui . Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMM's. IEEE Trans. Speech Audio Process. , 3 , 456 - 459
    11. 11)
      • F.K. Soong , A.E. Rosenberg , L.R. Rabiner , B.-H. Juang . A vector quantisation approach to speaker recognition. ICASSP-85 , 387 - 390
    12. 12)
      • H. Sakoe , S. Chiba . Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. , 43 - 49
    13. 13)
      • L. Rabiner , B. Juang . (1993) Fundamentals of speech recognition.
    14. 14)
      • N.Z. Tishby . On the application of mixture AR hidden Markov models to text independent speaker recognition. IEEE Trans. Acoust. Speech Signal Process. , 563 - 570
    15. 15)
      • J. Oglesby , J.S. Mason . Optimisation of neural models for speaker identification. ICASSP-90 , 261 - 264
    16. 16)
    17. 17)
    18. 18)
    19. 19)
      • Zhou, G., Mikheal, W.B.: `Speaker identification based on discriminative vector quantisation', 46thIEEE Int. Midwest Symp. on Circuits and Systems, December 2003, Cairo, Egypt.
    20. 20)
      • Zhou, G., Mikhael, W.B.: `Analysis of discriminative vector quantization approach for speaker identification', 8thWorld Multi-Conf. on Systemic, Cybernetics and Information, 2004, Orlando, FL, USA, IV, p. 479–483.
    21. 21)
      • G. Zhou , W.B. Mikhael , B. Myers . A novel discriminative vector quantisation approach for speaker identification. J. Circuits, Syst. Comput. , 3 , 581 - 596
    22. 22)
      • A. Gray , R.M. Gray . (1991) Vector quantisation and signal compression.
    23. 23)
      • Y. Linde , A. Buzo , R.M. Gray . An algorithm for vector quantizer design. IEEE Trans. Commun. , 702 - 710
    24. 24)
      • R.O. Duda , P.E. Hart , D.G. Stork . Pattern classification.
    25. 25)
      • R. Vergin , D. O'Shaughnessy , A. Farhat . Generalised mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition. IEEE Trans. Speech Audio Process. , 525 - 532
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-vis_20050074
Loading

Related content

content/journals/10.1049/ip-vis_20050074
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address