http://iet.metastore.ingenta.com
1887

On the use of quality measures in face and speaker identity verification based on video and audio streams

On the use of quality measures in face and speaker identity verification based on video and audio streams

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Signal Processing — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

This study addresses the advantage of adding quality information of the biometric signals into a multimedia-based (video and audio) identity verification system. The quality information of the biometric signals can be used in several ways and stages in the biometric system. In this study, the authors introduce quality-based decisions in two stages: score normalisation and frame selection. Quality-based score normalisation helps to handle quality dependent drifts in the scores distributions. We derive a necessary and sufficient condition for reducing error when introducing quality-based score normalisation and present a score normalising technique. Additionally, the number of frontal faces and speech vectors extracted from the video and audio streams allows quality-based selection of frames, both in training and test, to preserve quality in the statistical representation of the signals. For these two stages we defined some quality measures for speaker and frontal face signals and run experiments to show the reliability of the proposed techniques over the BANCA database.

References

    1. 1)
      • P.S. Aleksic , A.K. Katsaggelos . Audio-visual biometrics. Proc. IEEE , 11 , 2025 - 2043
    2. 2)
      • Phillips, P.J., Grother, P., Micheals, R.J., Blackburn, D.M., Tabassi, E., Bone, M.: `Face recognition vendor test', Evaluation Report. Technical Report, NISTIR, 2003.
    3. 3)
      • J.L. Alba-Castro , D. González-Jiménez , E. Argones-Rúa , E. González-Agulla , E. Otero-Muras . Pose-corrected face processing on video sequences for webcam-based remote biometric authentication. J. Electron. Imag. , 011004 - 011001
    4. 4)
      • D.A. Reynolds , T.F. Quatieri , R.B. Dunn . Speaker verification using adapted Gaussian mixture models. Digit. Signal Process. , 19 - 41
    5. 5)
      • J.L. Gauvain , C.H. Lee . Maximum a posteriori estimation for multivariate Gaussian mixture observationsof Markov chains. IEEE Trans. Speech Audio Process. , 2 , 291 - 298
    6. 6)
      • J. Fiérrez-Aguilar , J. Ortega-García , J. González-Rodríguez , J. Bigün . Discriminative multimodal biometric authentication based on quality measures. Pattern Recognit. , 5 , 777 - 779
    7. 7)
      • K. Kryszczuk , A. Drygajlo . (2007) Q – stack: uni- and multimodal classifier stacking with quality measures.
    8. 8)
      • K. Kryszczuk , A. Drygajlo . (2007) Improving classification with class-independent quality measures: Q-stack in face verification.
    9. 9)
      • K. Nandakumar , Y. Chen , S.C. Dass , A.K. Jain . Likelihood ratio-based biometric score fusion. IEEE Trans. Pattern Anal. Mach. Intell. , 2 , 342 - 347
    10. 10)
      • Argones-Rúa, E., Alba-Castro, J.L., García-Mateo, C.: `Quality-based score normalization and frame selection for video-based person authentication', Proc. First COST 2101 Workshop on Biometrics and Identity Management (BIOID 2008), May 2008.
    11. 11)
      • E. Argones-Rúa , J.L. Alba-Castro , C. García-Mateo . (2008) Quality-based score normalization for audiovisual person authentication.
    12. 12)
      • J. Fiérrez-Aguilar , Y. Chen , J. Ortega-García , A.K. Jain , D. Zhang , A.K. Jain . Incorporating image quality in multi-algorithm fingerprint verification.
    13. 13)
      • B.-B. Enrique , S. Bengio , F. Bimbot . (2003) The BANCA database and evaluation protocol.
    14. 14)
      • (2007) Speech processing, transmission and quality aspects (STQ); distributed speech recognition; front-end feature extraction algorithm; compression algorithm.
    15. 15)
      • Rainer, L., Maydt, J.: `An extended set of haar-like features for rapid object detection', Proceedings of the 2002 International Conference on Image Processing, 2002, 1, p. 900–903.
    16. 16)
      • L. Wiskott , J.-M. Fellous , N. Krüger , C. von der Malsburg . Face recognition by elastic bunch graph matching. IEEE Trans. Pattern Analysis Mach. Intell. , 7 , 775 - 779
    17. 17)
      • S. Bengio , L. Maríethoz . A statistical significance test for person authentication. ODYSSEY 2004 – The Speaker and Language Recognition Workshop , 237 - 244
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-spr.2008.0170
Loading

Related content

content/journals/10.1049/iet-spr.2008.0170
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address