http://iet.metastore.ingenta.com
1887

Excitation synchronous formant analysis

Excitation synchronous formant analysis

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings I (Communications, Speech and Vision) — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Speech signals can be efficiently parametrised by the resonant frequencies of the vocal tract known as formants. The automatic analysis of the signal into a suitable set of formant parameters has however proved to be a difficult problem, particularly for female speech. The technique of excitation synchronous formant analysis has been proposed as an improved method of formant analysis [5]. The paper considers the performance of this technique, particularly where the analysis interval is over the closed phase of the larynx. The improved performance of closed-phase formant analysis is demonstrated by comparison with pitch-synchronous and fixed-frame formant analysis. The closed-phase region is determined first using a laryngograph signal and secondly using a modified form of the Gold-Rabiner fundamental-frequency estimator, using only the acoustic waveform. The improved performance of closed-phase formant analysis is also demonstrated by a better ability to follow the transient features of the signal, with fewer missed or extra formants, and better formant continuity. The ability to follow formant transitions during glides (e.g. w, r, l) and in voiced segments following plosives is particularly apparent. These improvements are illustrated in various phonetic contexts. The technique has been tested for sensitivity to analysis position, which is important when the glottal closures are determined from the acoustic waveform. This method of formant analysis is currently being applied to the development of speech synthesis by rule, and to provide a set of features for phonetic recognition.

References

    1. 1)
      • A.V. Oppenheim , R.V. Schafer . Homomorphic analysis of speech. IEEE Trans. , 2 , 221 - 226
    2. 2)
      • B.S. Atal . Influence of pitch on formant frequencies and bandwidths obtained by linear prediction analysis. J. Acoust. Soc. Am.
    3. 3)
      • D.E. Veeneman , S.L. Bement . Automatic glottal inverse filtering from speech and electrographic signals. IEEE Trans. , 2 , 369 - 377
    4. 4)
      • D.Y. Wong , J.D. Markel , A.H. Gray . Least squares glottal inverse filtering from the acoustic waveform. IEEE Trans. , 350 - 355
    5. 5)
      • W.C. Gish . (1981) , Speech analysis using acoustical and glottal sensors.
    6. 6)
      • Hunt, M.J., Harvenburg, C.E.: `Generation of controlled speech stimuli by pitch synchronous LPC analysis of natural utterances', Proceedings of 12th International Congress on Acoustics, July 1986, Toronto, Canada.
    7. 7)
      • J.D. Markel , A.H. Gray . (1976) , Linear prediction of speech.
    8. 8)
      • J.N. Holmes , F. Fallside , W.A. Woods . (1985) A parallel-formant synthesiser for machine voice output, Computer speech and language.
    9. 9)
      • S.S. McCandless . An algorithm for automatic formant extraction using linear prediction spectra. IEEE Trans. , 2 , 134 - 141
    10. 10)
      • Krishnamurthy, A.K.: `Two channel (speech and EEG) analysis for formant tracking and glottal inverse filtering', ICASSP 84, 1984, p. 36.6.1–36.6.4.
    11. 11)
      • M.A. Jenkins , J.F. Traub . A three stage algorithm for real polynomials using quadratic iteration. SI AM J. Numer. Anal. , 4
    12. 12)
      • A.J. Fourcin , E. Abberton . First applications of a new laryngograph. Medical and Biological Illustration , 172 - 182
    13. 13)
      • W. Hess , H. Indefrey . Accurate pitch determination of speech signals by means of a laryngograph. IEE Int. Conf.
    14. 14)
      • L.R. Rabiner , R.W. Schafer . (1978) , Digital processing of speech signals.
    15. 15)
      • G. Lindsay . (1987) , Internal SPAR Report on Definition of ACAWD Database.
    16. 16)
      • : `Multilingual speech input/output assessment, methodology and standardisation', 1541, SAM Esprit Project Report, 1988, chap. 1.
    17. 17)
      • B.S. Atal , R. Reddy . (1975) Linear prediction of speech — recent advances with applications to speech analysis, Speech recognition.
    18. 18)
      • Lindsay, G., Davies, P., Fourcin, A.: `Laryngeal coarticulation effects in English VCV sequences', International Conference on Speech Input/Output; Techniques and Applications, March 1986, p. 99–103.
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-i-2.1989.0014
Loading

Related content

content/journals/10.1049/ip-i-2.1989.0014
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address