© The Institution of Engineering and Technology
Jitter and shimmer appear as symptoms of disordered voices and they are defined as the cycle-to-cycle variation of the cycle length and cycle amplitude, respectively. The frequency modulation of the phonatory excitation, i.e. vocal jitter, causes amplitude modulation (shimmer) of the speech signal due to the action of the vocal tract transfer function. In this Letter, the authors investigate the extent to which the vocal tract filtering affects the accuracy of measurement of vocal cycle perturbations of the phonatory excitation. To achieve this goal, the direct digital synthesiser is used to generate a series of glottal excitation cycles with the desired amount of vocal perturbations that crosses the vocal tract model to produce a set of 231 stimuli of sustained vowels [a]. Praat is used to measure the amount of perturbations. Results show that the values of the input shimmer inserted in the glottal excitation are smaller than the values measured by Praat. An undesired amount of shimmer is generated in the speech signal when the glottal excitation signal perturbed with jitter crosses through the vocal tract.
References
-
-
1)
-
17. Rao, R.V., Savsani, V.J., Vakharia, D.P.: ‘Teaching–learning-based optimization: a novel method for constrained mechanical design optimization problems’, Comput.-Aided Des., 2011, 43, (3), pp. 303–315 (doi: 10.1016/j.cad.2010.12.015).
-
2)
-
5. Teixeira, J.P., Gonçalves, A.: ‘Accuracy of jitter and shimmer measurements’, Procedia Technol., 2014, 16, pp. 1190–1199, doi: 10.1016/j.protcy.2014.10.134.
-
3)
-
1. Dejonckere, P.H.: ‘Gender differences in the prevalence of occupational voice disorders’, in Dejonckere, P.H. (Ed): ‘Occupational voice: care and cure’ (Kugler Publications, The Hague, The Netherlands, 2001), pp. 11–20.
-
4)
-
16. Rruqja, N., Dejonckere, P., Cantarella, G., et al: ‘Testing software tools with synthesized deviant voices for medicolegal assessment of occupational dysphonia’, Biomed. Signal Process. Control, 2014, 13, pp. 71–78, doi: 10.1016/j.bspc.2014.03.011.
-
5)
-
7. Boersma, P., Weenink, D.: ‘Praat: doing phonetics by computer [computer program]. Version 6.0.43’, 2018. .
-
6)
-
10. Gerratt, B.R., Kreiman, J.: ‘Measuring vocal quality with speech synthesis’, J. Acoust. Soc. Am., 2001, 110, (5), pp. 2560–2566, doi: 10.1121/1.1409969.
-
7)
-
8. Fant, G., Liljencrants, J., Lin, Q.G.: ‘A four-parameter model of glottal flow’, STL-QPSR, 1985, 26, (4), pp. 1–13.
-
8)
-
2. Cantürk, I., Karabiber, F.: ‘A machine learning system for the diagnosis of Parkinson's disease from speech signals and its application to multiple speech signal types’, Arab. J. Sci. Eng., 2016, 41, (12), pp. 5049–5059, doi: 10.1007/s13369-016-2206-3.
-
9)
-
15. Schoentgen, J.: ‘Stochastic models of jitter’, J. Acoust. Soc. Am., 2001, 109, (4), pp. 1631–1650, doi: 10.1121/1.1350557.
-
10)
-
13. Rao, K.V.: ‘Power consumption optimization strategy in micro ball-end milling of D2 steel via TLBO coupled with 3D FEM simulation’, Measurement, 2019, 132, pp. 68–78, doi: 10.1016/j.measurement.2018.09.044.
-
11)
-
6. Bennane, Y., Kacha, A., Schoentgen, J., et al: ‘Synthesis of pathological voices and experiments on the effect of jitter and shimmer in voice quality perception’. 5th Int. Conf. on Electrical Engineering – Boumerdes (ICEE-B), Boumerdes, Algeria, Octobre 2017, pp. 1–6, doi 10.1109/ICEE-B.2017.8192026.
-
12)
-
3. Brockmann-Bauser, M., Bohlender, J.E., Mehta, D.D.: ‘Acoustic perturbation measures improve with increasing vocal intensity in individuals with and without voice disorders’, J. Voice., 2018, 32, (2), pp. 162–168, doi: 10.1016/j.jvoice.2017.04.008.
-
13)
-
14. Klatt, D.H.: ‘Software for a cascade/parallel formant synthesizer’, J. Acoust. Soc. Am., 1980, 67, (3), pp. 971–995, doi: 10.1121/1.383940.
-
14)
-
9. Eddins, D.A., Anand, A., Shrivastav, R.: ‘Interactions among perceived breathiness, roughness, and overall severity of dysphonia’, J. Acoust. Soc. Am., 2019, 145, (3), pp. 1797–1797, doi: 10.1121/1.5101578.
-
15)
-
12. Allam, M., Nandhini, M.: ‘Optimal feature selection using binary teaching learning based optimization algorithm’, J. King Saud Univ.-Comput. Inf. Sci., 2018, 10.1016/j.jksuci.2018.12.001.
-
16)
-
4. Schoentgen, J.: ‘Spectral models of additive and modulation noise in speech and phonatory excitation signals’, J. Acoust. Soc. Am., 2003, 113, (1), pp. 553–562, doi: 10.1121/1.1523384.
http://iet.metastore.ingenta.com/content/journals/10.1049/el.2020.0620
Related content
content/journals/10.1049/el.2020.0620
pub_keyword,iet_inspecKeyword,pub_concept
6
6