http://iet.metastore.ingenta.com
1887

Age interval and gender prediction using PARAFAC2 and SVMs based on visual and aural features

Age interval and gender prediction using PARAFAC2 and SVMs based on visual and aural features

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Biometrics — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Parallel factor analysis 2 (PARAFAC2) is employed to reduce the dimensions of visual and aural features and provide ranking vectors. Subsequently, score level fusion is performed by applying a support vector machine (SVM) classifier to the ranking vectors derived by PARAFAC2 to make gender and age interval predictions. The aforementioned procedure is applied to the Trinity College Dublin Speaker Ageing database, which is supplemented with face images of the speakers and two single-modality benchmark datasets. Experimental results demonstrate the advantage of using combined aural and visual features for both prediction tasks.

References

    1. 1)
      • A. Lanitis .
        1. Lanitis, A.: ‘A survey of the effects of aging on biometric identity verification’, Int. J. Biometrics, 2010, 2, (1), pp. 3452.
        . Int. J. Biometrics , 1 , 34 - 52
    2. 2)
      • T. Kinnunen , H. Li .
        2. Kinnunen, T., Li, H.: ‘An overview of text-independent speaker recognition: from features to supervectors’, Speech Commun., 2010, 52, (1), pp. 1240.
        . Speech Commun. , 1 , 12 - 40
    3. 3)
      • R.A. Harshman .
        3. Harshman, R.A.: ‘PARAFAC2: mathematical and technical notes’. UCLA Working Papers in Phonetics, 1972, vol. 22, pp. 3047.
        . , 30 - 47
    4. 4)
      • E. Pantraki , C. Kotropoulos , A. Lanitis .
        4. Pantraki, E., Kotropoulos, C., Lanitis, A.: ‘Age interval and gender prediction using PARAFAC2 applied to speech utterances’. Proc. Int. Workshop Biometrics and Forensics, Limassol, Cyprus, March 2016, pp. 16.
        . Proc. Int. Workshop Biometrics and Forensics , 1 - 6
    5. 5)
      • M.C. Polastro , P.M.S. Eleuterio .
        5. Polastro, M.C., Eleuterio, P.M.S.: ‘Nudetective: A forensic tool to help combat child pornography through automatic nudity detection’. Proc. IEEE Int. Workshop Database and Expert Systems Applications, Bilbao, Spain, August 2010, pp. 349353.
        . Proc. IEEE Int. Workshop Database and Expert Systems Applications , 349 - 353
    6. 6)
      • F. Kelly , A. Drygajlo , N. Harte .
        6. Kelly, F., Drygajlo, A., Harte, N.: ‘Speaker verification with long-term ageing data’. Proc. IARP Int. Conf. Biometrics, New Delhi, India, March 2012, pp. 478483.
        . Proc. IARP Int. Conf. Biometrics , 478 - 483
    7. 7)
      • G. Panis , A. Lanitis , N. Tsapatsoulis .
        7. Panis, G., Lanitis, A., Tsapatsoulis, N., et al: ‘Overview of research on facial ageing using the FG-NET ageing database’, IET Biometrics, 2016, 5, (2), pp. 3746.
        . IET Biometrics , 2 , 37 - 46
    8. 8)
      • (2011)
        8. NIST Multimodal Information Group: ‘NIST 2008 speaker recognition evaluation test set’ (Linguistic Data Consortium, Philadelphia, US, 2011).
        .
    9. 9)
      • F. Kelly , N. Harte .
        9. Kelly, F., Harte, N.: ‘Effects of long-term ageing on speaker verification’. Biometrics and ID Management, 2011 (LNCS, 6583), pp. 113124.
        . Biometrics and ID Management , 113 - 124
    10. 10)
      • F. Kelly , R. Saeidi , N. Harte .
        10. Kelly, F., Saeidi, R., Harte, N., et al: ‘Effect of long-term ageing on i-vector speaker verification’. Proc. Interspeech, Singapore, September 2014, pp. 8690.
        . Proc. Interspeech , 86 - 90
    11. 11)
      • S.O. Sadjadi , S. Ganapathy , J.W. Pelecanos .
        11. Sadjadi, S.O., Ganapathy, S., Pelecanos, J.W.: ‘Speaker age estimation on conversational telephone speech using senone posterior based i-vectors’. Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Shanghai, China, March 2016, pp. 50405044.
        . Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , 5040 - 5044
    12. 12)
      • H. Liu , X. Sun .
        12. Liu, H., Sun, X.: ‘A partial least squares based ranker for fast and accurate age estimation’. Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Shanghai, China, March 2016, pp. 27922796.
        . Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , 2792 - 2796
    13. 13)
      • X. Geng , Z.H. Zhou , K. Smith-Miles .
        13. Geng, X., Zhou, Z.H., Smith-Miles, K.: ‘Automatic age estimation based on facial aging patterns’, IEEE Trans. Pattern Anal. Mach. Intell., 2007, 29, (12), pp. 22342240.
        . IEEE Trans. Pattern Anal. Mach. Intell. , 12 , 2234 - 2240
    14. 14)
      • M.S. Nixon , P.L. Correia , K. Nasrollahi .
        14. Nixon, M.S., Correia, P.L., Nasrollahi, K., et al: ‘On soft biometrics’, Pattern Recognit. Lett., 2015, 68, pp. 218230.
        . Pattern Recognit. Lett. , 218 - 230
    15. 15)
      • O.A. Arigbabu , S.M.S. Ahmad , W.A.W. Adnan .
        15. Arigbabu, O.A., Ahmad, S.M.S., Adnan, W.A.W., et al: ‘Recent advances in facial soft biometrics’, Vis. Comput., 2015, 31, (5), pp. 513525.
        . Vis. Comput. , 5 , 513 - 525
    16. 16)
      • L. Liu , J. Liu , J. Cheng .
        16. Liu, L., Liu, J., Cheng, J.: ‘Age-group classification of facial images’. Proc. IEEE Int. Conf. Machine Learning and Applications, Boca Raton, FL, USA, December 2012, pp. 693696.
        . Proc. IEEE Int. Conf. Machine Learning and Applications , 693 - 696
    17. 17)
      • G. Guo , G. Mu , Y. Fu .
        17. Guo, G., Mu, G., Fu, Y., et al: ‘Human age estimation using bio-inspired features’. Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition, Miami, FL, USA, June 2009, pp. 112119.
        . Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition , 112 - 119
    18. 18)
      • W.L. Chao , J.Z. Liu , J.J. Ding .
        18. Chao, W.L., Liu, J.Z., Ding, J.J.: ‘Facial age estimation based on label-sensitive learning and age-oriented regression’, Pattern Recognit., 2013, 46, (3), pp. 628641.
        . Pattern Recognit. , 3 , 628 - 641
    19. 19)
      • G. Levi , T. Hassner .
        19. Levi, G., Hassner, T.: ‘Age and gender classification using convolutional neural networks’. Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition Workshops, Boston, MA, USA, June 2015, pp. 3442.
        . Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition Workshops , 34 - 42
    20. 20)
      • S.E. Bekhouche , A. Ouafi , A. Benlamoudi .
        20. Bekhouche, S.E., Ouafi, A., Benlamoudi, A., et al: ‘Facial age estimation and gender classification using multi level local phase quantization’. Proc. IEEE Int. Conf. Control, Engineering & Information Technology, Tlemcen, Algeria, May 2015, pp. 14.
        . Proc. IEEE Int. Conf. Control, Engineering & Information Technology , 1 - 4
    21. 21)
      • N. Mesgarani , M. Slaney , S.A. Shamma .
        21. Mesgarani, N., Slaney, M., Shamma, S.A.: ‘Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations’, IEEE Trans. Audio Speech Lang. Process., 2006, 14, (3), pp. 920930.
        . IEEE Trans. Audio Speech Lang. Process. , 3 , 920 - 930
    22. 22)
      • Y. Panagakis , C.L. Kotropoulos , G.R. Arce .
        22. Panagakis, Y., Kotropoulos, C.L., Arce, G.R.: ‘Music genre classification via joint sparse low-rank representation of audio features’, IEEE/ACM Trans. Audio Speech Lang. Process., 2014, 22, (12), pp. 19051917.
        . IEEE/ACM Trans. Audio Speech Lang. Process. , 12 , 1905 - 1917
    23. 23)
      • P.A. Chew , B.W. Bader , T.G. Kolda .
        23. Chew, P.A., Bader, B.W., Kolda, T.G., et al: ‘Cross-language information retrieval using PARAFAC2’. Proc. ACM Int. Conf. Knowledge Discovery and Data Mining, San Jose, CA, USA, August 2007, pp. 143152.
        . Proc. ACM Int. Conf. Knowledge Discovery and Data Mining , 143 - 152
    24. 24)
      • C.C. Chang , C.J. Lin .
        24. Chang, C.C., Lin, C.J.: ‘LIBSVM: A library for support vector machines’, ACM Trans. Intell. Syst. Technol., 2011, 2, (3), pp. 27:127:27.
        . ACM Trans. Intell. Syst. Technol. , 3 , 27:1 - 27:27
    25. 25)
      • D. Turnbull , L. Barrington , D. Torres .
        25. Turnbull, D., Barrington, L., Torres, D., et al: ‘Semantic annotation and retrieval of music and sound effects’, IEEE Trans. Audio Speech Lang. Process., 2008, 16, (2), pp. 467476.
        . IEEE Trans. Audio Speech Lang. Process. , 2 , 467 - 476
    26. 26)
      • S. Yan , H. Wang , X. Tang .
        26. Yan, S., Wang, H., Tang, X., et al: ‘Learning auto-structured regressor from uncertain nonnegative labels’. Proc. IEEE Int. Conf. Computer Vision, Rio de Janeiro, Brazil, October 2007, pp. 18.
        . Proc. IEEE Int. Conf. Computer Vision , 1 - 8
    27. 27)
      • A. Lanitis , C. Draganova , C. Christodoulou .
        27. Lanitis, A., Draganova, C., Christodoulou, C.: ‘Comparing different classifiers for automatic age estimation’, IEEE Trans. Syst. Man Cybern. B, Cybern., 2004, 34, (1), pp. 621628.
        . IEEE Trans. Syst. Man Cybern. B, Cybern. , 1 , 621 - 628
    28. 28)
      • P.C. Loizou . (2007)
        28. Loizou, P.C.: ‘Speech enhancement: theory and practice’ (CRC Press, Boca Raton, FL, 2007, 2nd edn. 2013).
        .
    29. 29)
      • C. Goutte , E. Gaussier .
        29. Goutte, C., Gaussier, E.: ‘A probabilistic interpretation of precision, recall and F-score, with implication for evaluation’. Proc. Eur. Conf. Information Retrieval Research, Santiago de Compostela, Spain, March 2005, pp. 345359.
        . Proc. Eur. Conf. Information Retrieval Research , 345 - 359
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-bmt.2016.0122
Loading

Related content

content/journals/10.1049/iet-bmt.2016.0122
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address