Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

Score bi-Gaussian equalisation for multimodal person verification

Score bi-Gaussian equalisation for multimodal person verification

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Signal Processing — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Multimodal biometric fusion at score level can be performed by means of combinatory or classificatory techniques. In the first case, it is straightforward that the normalisation of the scores is a very important issue for the success of the fusion process. In the classificatory approach as, for instance, in support vector machine (SVM)-based systems, only simple normalisation methods are usually applied. In this work, histogram equalisation of biometric score distribution is successfully applied in a multimodal person verification system composed by prosodic, speech spectrum and face information. Furthermore, a new bi-Gaussian equalisation (BGEQ) is introduced, which takes into account the separate statistics of the genuine and impostor scores by using as a reference a sum of two Gaussian functions, whose standard deviations model the overlap between the genuine and impostor lobes of the original distributions. Multimodal verification experiments are shown, where prosodic and speech spectrum scores are provided by speech experts using the Switchboard-I database, and face scores are obtained by a face recognition expert using XM2VTS database. BGEQ in combination with an SVM fusion system with a polynomial kernel has obtained the best results and has outperformed in more than a 21.29% the results obtained by min–max normalisation.

References

    1. 1)
      • Stolcke, A., Ferrer, L., Kajarekar, S., Shrigerg, E., Venkataraman, A.: `MLLR transforms as features in speaker recognition', Proc. Eurospeech, 2005, Lisbon, p. 2425–2428.
    2. 2)
      • A. Jain . (1986) Fundamentals of digital image processing.
    3. 3)
      • Pelenacos, J., Sridharan, S.: `Feature warping for robust speaker verification', Proc. ISCA Workshop on Speaker Recognition – 2001: A Speaker Oddyssey, June 2001, p. 213–218.
    4. 4)
      • Indovina, M., Uludag, U., Snelick, R., Mink, A., Jain, A.: `Multimodal biometric authentication methods: a COTS approach', Proc. MMUA 2003, Workshop on Multimodal User Authentication, 11–12 December 2003, Santa Barbara, CA, p. 99–106.
    5. 5)
      • P.A. Devijver , J. Kittler . (1982) Pattern recognition: a statistical approach.
    6. 6)
      • R. Auckenthaler , M. Carey , H. Lloyd-Thomas . Score normalization for text-independent speaker verification systems. Digit. Signal Process. , 42 - 54
    7. 7)
      • Lucey, S., Chen, T.: `Improved audio-visual speaker recognition via the use of a hybrid combination strategy', Presented at the Fourth Int. Conf. Audio- and Video-Based Biometric Person Authentication, 2003, Guildford, UK.
    8. 8)
      • Tefas, A., Zafeiriou, S., Pitas, I.: `Discriminant NMFfaces for frontal face verification', Proc. of IEEE Int. Workshop on Machine Learning for Signal Processing (MLSP 2005), 28–30 September 2005, Mystic, Connecticut.
    9. 9)
      • R.M. Bolle , J.H. Connell , S. Pankanti , N.K. Ratha , A.W. Senior . (2004) Guide to biometrics.
    10. 10)
      • N. Poh , J. Kittler . Incorporating model-specific score distribution in speaker verification systems. IEEE Trans. Audio Speech Lang. , 3 , 594 - 606
    11. 11)
      • J.J. Wolf . Efficient acoustic parameters for speaker recognition. J. Acoust. Soc. Am. , 2044 - 2056
    12. 12)
      • C. Sanderson . (2008) Biometric person recognition: face, speech and fusion.
    13. 13)
      • Fox, N.A., Gross, R., Chazal, P., Cohn, J.F., Reilly, R.B.: `Person identification using automatic integration of speech, lip and face experts', Presented at ACM SIGMM 2003 Multimedia Biometrics Methods and Applications Workshop, 2003, Berkeley, CA.
    14. 14)
      • F.R. Hampel , E.M. Ronchetti , P.J. Rousseeuw , W.A. Stahel . (1986) Robust statistics: the approach based on influence functions.
    15. 15)
      • A.K. Jain , K. Nandakumar , A. Ross . Score normalization in multimodal biometric systems. Pattern Recognit. , 12 , 2270 - 2285
    16. 16)
      • P. Ejarque , A. Garde , J. Anguita , J. Hernando . On the use of genuine-impostor statistical information for score fusion in multimodal biometrics. Multimodal Biometrics in Annals of Telecommunication , 109 - 129
    17. 17)
      • Wang, Y., Wang, Y., Tan, T.: `Combining fingerprint and voiceprint biometrics for identity verification: and experimental comparison', Presented at ICBA 2004, 2004, Hong Kong, China.
    18. 18)
      • A. Ross , K. Nandakumar , A. Jain . (2006) Handbook of multibiometrics’, International Series on Biometrics.
    19. 19)
      • Farrús, M., Garde, A., Ejarque, P., Luque, J., Hernando, J.: `On the fusion of prosody, voice spectrum and face features for multimodal person verification', Proc. Interspeech 2006, September 2006, Pittsburgh, USA.
    20. 20)
      • N. Cristianini , J. Shawe-Taylor . (2000) An introduction to support vector machines (and other kernel-based learning methods).
    21. 21)
      • R.O. Duda , P.E. Hart , D.G. Stork . Pattern classification.
    22. 22)
      • Ejarque, P., Hernando, J.: `Variance reduction by using separate genuine-impostor statistics in multimodal biometrics', Proc. Interspeech 2005, September 2005, Lisbon, Portugal, p. 785–788.
    23. 23)
      • Nadeu, C., Hernando, J., Gorricho, M.: `On the decorrelation of filter bank energies in speech recognition', Presented at Eurospeech, 1995.
    24. 24)
      • Balchandran, R., Mammone, R.: `Non parametric estimation and correction of non-linear distortion in speech systems', Proc. IEEE Int. Conf. Acoust. Speech Signal Proc., 1998.
    25. 25)
      • J.R. Vacca . (2007) Biometric technologies and verification systems.
    26. 26)
      • Lüttin, J., Maître, G.: `Evaluation protocol for the extended M2VTS database (XM2VTSDB)', IDIAP Communication 98-05 (1998), Martigny, Switzerland.
    27. 27)
      • Godfrey, J.J., Holliman, E.C., McDaniel, J.: `Switchboard: telephone speech corpus for research and development', Presented at ICASSP, 1990.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-spr.2008.0159
Loading

Related content

content/journals/10.1049/iet-spr.2008.0159
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address