access icon openaccess Identification of glottal instants using electroglottographic signal for vulnerable cases of voicing

Robust detection of glottal instants is essential for various speech and biomedical applications. Glottal closing and glottal opening are two crucial instants/epochs of a glottal cycle. The first-order derivative of the Electroglottographic (EGG) signal demonstrates important peaks at those locations for standard voicing, but the detection of glottal instants becomes erroneous when the peak to peak amplitude of the EGG signal is very low, irregular and unpredictable. In this work, a new efficient method is proposed for identification of glottal instants from the EGG signals including the segments of the signals where the signals are feeble with irregular periodicity. The overall accuracy of detection will be enhanced by identifying the glottal instants for the whole part of the signal including the vulnerable segments of signal. As the phase of a signal is uniform in nature, the phase information of the EGG signal has been explored to detect glottal instants accurately. Under low strength of the EGG signal, the proposed method remarkably has better performance compared to the existing instants detection methods and for pathological EGG signal, the detection accuracy of glottal instants is better than other existing methods.

Inspec keywords: speech recognition; medical signal processing; medical signal detection; feature extraction

Other keywords: Electroglottographic signal; biomedical applications; glottal instant identification; pathological EGG signal; phase information; instant detection methods; glottal opening; glottal closing

Subjects: Speech processing techniques; Signal detection; Biomedical engineering; Biology and medical computing; Biomedical measurement and imaging; Speech recognition and synthesis

References

    1. 1)
      • 23. Oppenheim, A.V., Schafer, R.W., Buck, J.R.: ‘Discrete-time signal processing’ (Prentice Hall, Upper Saddle River, NJ, 1999).
    2. 2)
      • 24. Kominek, J., Black, A.W.: ‘The CMU-Arctic speech databases’. ISCA Speech Synthesis Workshop, Pittsburgh, PA, USA, 2004, pp. 222224.
    3. 3)
      • 2. Bachhav, P.B., Patil, H.A., Patel, T.B.: ‘A novel filtering based approach for epoch extraction’. Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Brisbane, Australia, 2015, pp. 47844788.
    4. 4)
    5. 5)
    6. 6)
      • 10. Murty, K.S.R., Yegnanarayana, B.: ‘Epoch extraction from speech signals’, J. Speech Lang. Hear. Res., 1990, 16, (8), pp. 245254.
    7. 7)
      • 22. Mandal, T., Sreenivasa Rao, K.: ‘Robust detection of glottal activity using unwrapped phase electroglottographic signal’. Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Calgary, Canada, 2018, pp. 55845589.
    8. 8)
    9. 9)
    10. 10)
    11. 11)
    12. 12)
    13. 13)
      • 17. Brookes, M., Naylor, P., Gudnason, J.: ‘A quantitative assessment of group delay methods for identifying glottal closures in voiced speech’, IEEE Trans. Acoust., Speech, Signal Process., 2006, 14, (2), pp. 456466.
    14. 14)
      • 4. Thotappa, D., Prasanna, S.R.M.: ‘Reference and automatic marking of glottal opening instants using egg signal’. Proc. Intl. Conf. on Signal Processing and Communications (SPCOM), Bangalore, India, 2014, pp. 42604264.
    15. 15)
      • 25. Lindsey, G., Breen, A., Nevard, S.: ‘SPAR's archivable actual word databases’ (Tech. Rep. Univ. College London, London, UK, 1987).
    16. 16)
      • 9. Jyothish Lal, G., Gopalakrishnan, E.A., Govind, D.: ‘Accurate estimation of glottal closure instants and glottal opening instants from electroglottographic signal using variational mode decomposition’, Circuits Syst. Signal Process., 2017, 37, pp. 810830.
    17. 17)
    18. 18)
    19. 19)
      • 15. Henrich, N., Roubeau, B., Castellengo, M.: ‘On the use of electroglottography for characterisation of the laryngeal mechanisms’. Proc. of the Stockholm Music Acoustics Conf., Stockholm, Sweden, 2003, pp. 69.
    20. 20)
    21. 21)
      • 12. Mathur, A., Chaudhary, N., Upadhyay, A., et al: ‘Detection of glottal closure instants from voiced speech signals using the Fourier-bessel series expansion’. Proc. Intl. Conf. on Communications and Signal Processing, Melmaruvathur, India, 2015, pp. 474478.
    22. 22)
    23. 23)
    24. 24)
      • 5. Kadiri, S.R., Yegnanarayana, B.: ‘Analysis of singing voice for epoch extraction using zero frequency filtering method’. Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Brisbane, Australia, 2015, pp. 42604264.
    25. 25)
http://iet.metastore.ingenta.com/content/journals/10.1049/htl.2019.0085
Loading

Related content

content/journals/10.1049/htl.2019.0085
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading