This is an open access article published by the IET under the Creative Commons Attribution -NonCommercial License (http://creativecommons.org/licenses/by-nc/3.0/)
Robust detection of glottal instants is essential for various speech and biomedical applications. Glottal closing and glottal opening are two crucial instants/epochs of a glottal cycle. The first-order derivative of the Electroglottographic (EGG) signal demonstrates important peaks at those locations for standard voicing, but the detection of glottal instants becomes erroneous when the peak to peak amplitude of the EGG signal is very low, irregular and unpredictable. In this work, a new efficient method is proposed for identification of glottal instants from the EGG signals including the segments of the signals where the signals are feeble with irregular periodicity. The overall accuracy of detection will be enhanced by identifying the glottal instants for the whole part of the signal including the vulnerable segments of signal. As the phase of a signal is uniform in nature, the phase information of the EGG signal has been explored to detect glottal instants accurately. Under low strength of the EGG signal, the proposed method remarkably has better performance compared to the existing instants detection methods and for pathological EGG signal, the detection accuracy of glottal instants is better than other existing methods.
References
-
-
1)
-
23. Oppenheim, A.V., Schafer, R.W., Buck, J.R.: ‘Discrete-time signal processing’ (Prentice Hall, Upper Saddle River, NJ, 1999).
-
2)
-
24. Kominek, J., Black, A.W.: ‘The CMU-Arctic speech databases’. ISCA Speech Synthesis Workshop, Pittsburgh, PA, USA, 2004, pp. 222–224.
-
3)
-
2. Bachhav, P.B., Patil, H.A., Patel, T.B.: ‘A novel filtering based approach for epoch extraction’. Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Brisbane, Australia, 2015, pp. 4784–4788.
-
4)
-
3. Kadiri, S.R., Yegnanarayana, B.: ‘Epoch extraction from emotional speech using single frequency filtering appoach’, Speech Commun., 2017, 86, pp. 52–63 (doi: 10.1016/j.specom.2016.11.005).
-
5)
-
14. Henrich, N., dAlessandro, C., Doval, B., et al: ‘On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation’, J. Acoust. Soc. Am., 2004, 115, (3), pp. 1321–1332 (doi: 10.1121/1.1646401).
-
6)
-
10. Murty, K.S.R., Yegnanarayana, B.: ‘Epoch extraction from speech signals’, J. Speech Lang. Hear. Res., 1990, 16, (8), pp. 245–254.
-
7)
-
22. Mandal, T., Sreenivasa Rao, K.: ‘Robust detection of glottal activity using unwrapped phase electroglottographic signal’. Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Calgary, Canada, 2018, pp. 5584–5589.
-
8)
-
8. Childers, D.G., Hicks, D.M., Moore, G.P., et al: ‘Electroglottography and vocal fold physiology’, J. Speech, Lang. Hear. Res., 1990, 33, (2), pp. 245–254 (doi: 10.1044/jshr.3302.245).
-
9)
-
16. Adiga, N., Prasanna, S.: ‘Detection of glottal activity using different attributes of source information’, IEEE Signal Process. Lett., 2015, 22, (11), pp. 2107–2111 (doi: 10.1109/LSP.2015.2461008).
-
10)
-
6. Deshpande, P., Manikandan, M.S.: ‘Effective glottal instant detection and electroglottographic parameter extraction for automated voice pathology assessment’, IEEE. J. Biomed. Health. Inform., 2018, 22, (2), pp. 398–408 (doi: 10.1109/JBHI.2017.2654683).
-
11)
-
11. Koutrouvelis, A.I., Kafentzis, G.P., Gaubitch, N.D., et al: ‘A fast method for high-resolution voiced/unvoiced detection and glottal closure/opening instant estimation of speech’, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2016, 24, (2), pp. 316–328 (doi: 10.1109/TASLP.2015.2506263).
-
12)
-
13. Aneeja, G., Kadiri, S.R., Yegnanarayana, B.: ‘Detection of glottal closure instants in degraded speech using single frequency filtering analysis’, INTERSPEECH, 2018, pp. 2300–2304 (doi: 10.21437/Interspeech.2018-1018).
-
13)
-
17. Brookes, M., Naylor, P., Gudnason, J.: ‘A quantitative assessment of group delay methods for identifying glottal closures in voiced speech’, IEEE Trans. Acoust., Speech, Signal Process., 2006, 14, (2), pp. 456–466.
-
14)
-
4. Thotappa, D., Prasanna, S.R.M.: ‘Reference and automatic marking of glottal opening instants using egg signal’. Proc. Intl. Conf. on Signal Processing and Communications (SPCOM), Bangalore, India, 2014, pp. 4260–4264.
-
15)
-
25. Lindsey, G., Breen, A., Nevard, S.: ‘SPAR's archivable actual word databases’ (Tech. Rep. Univ. College London, London, UK, 1987).
-
16)
-
9. Jyothish Lal, G., Gopalakrishnan, E.A., Govind, D.: ‘Accurate estimation of glottal closure instants and glottal opening instants from electroglottographic signal using variational mode decomposition’, Circuits Syst. Signal Process., 2017, 37, pp. 810–830.
-
17)
-
21. Sunil Kumar, S.B., Mandal, T., Sreenivasa Rao, K.: ‘Robust glottal activity detection using the phase of an electroglottographic signal’, Biomed. Signal Process. Control (Elsevier), 2017, 36, pp. 27–38 (doi: 10.1016/j.bspc.2017.03.007).
-
18)
-
1. Aicha, B., Noureddine, E.: ‘Electroglottographic measures based on GCI and GOI detection using multiscale product’, Int. J. Comput. Commun. Control, 2008, 3, (1), pp. 21–32 (doi: 10.15837/ijccc.2008.1.2371).
-
19)
-
15. Henrich, N., Roubeau, B., Castellengo, M.: ‘On the use of electroglottography for characterisation of the laryngeal mechanisms’. Proc. of the Stockholm Music Acoustics Conf., Stockholm, Sweden, 2003, pp. 6–9.
-
20)
-
19. Roubeau, B., Henrich, N., Castellengo, M.: ‘Laryngeal vibratory mechanisms: the notion of vocal register revisited’, J. Voice, 2009, 23, (4), pp. 425–438 (doi: 10.1016/j.jvoice.2007.10.014).
-
21)
-
12. Mathur, A., Chaudhary, N., Upadhyay, A., et al: ‘Detection of glottal closure instants from voiced speech signals using the Fourier-bessel series expansion’. Proc. Intl. Conf. on Communications and Signal Processing, Melmaruvathur, India, 2015, pp. 474–478.
-
22)
-
18. Thomas, M.R.P., Naylor, P.: ‘The Sigma algorithm: a glottal activity detector for electroglottographic signals’, IEEE Trans. Audio, Speech and Lang. Process., 2009, 17, (8), pp. 155–1566 (doi: 10.1109/TASL.2009.2022430).
-
23)
-
7. Childers, D.G., Larar, J.: ‘Electroglottography for laryngeal function assessment and speech analysis’, IEEE Trans. Biomed. Eng., 1984, 31, (12), pp. 807–817 (doi: 10.1109/TBME.1984.325242).
-
24)
-
5. Kadiri, S.R., Yegnanarayana, B.: ‘Analysis of singing voice for epoch extraction using zero frequency filtering method’. Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Brisbane, Australia, 2015, pp. 4260–4264.
-
25)
-
20. Yamauchi, A., Yokonishi, H., Imagawa, H., et al: ‘Quantification of vocal fold vibration in various laryngeal disorders using high-speed digital imaging’, J. Voice, 2015, 30, (2), pp. 205–214 (doi: 10.1016/j.jvoice.2015.04.016).
http://iet.metastore.ingenta.com/content/journals/10.1049/htl.2019.0085
Related content
content/journals/10.1049/htl.2019.0085
pub_keyword,iet_inspecKeyword,pub_concept
6
6