Data-driven, nonlinear, formant-to-acoustic mapping for ASR

Author(s): P.J.B. Jackson ; B.-H. Lo ; M.J. Russell
DOI: 10.1049/el:20020436

For access to this article, please select a purchase option:

Buy article PDF

Buy Knowledge Pack

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership

Recommend Title Publication to library

Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Author(s): P.J.B. Jackson ¹ ; B.-H. Lo ¹ ; M.J. Russell ¹
- Affiliations: 1: Department of Electronics, Electrical & Computer Engineering, University of Birmingham, Birmingham, United Kingdom
Source: Volume 38, Issue 13, 20 June 2002, p. 667 – 669
DOI: 10.1049/el:20020436 , Print ISSN 0013-5194, Online ISSN 1350-911X

With a view to using an articulatory representation in automatic recognition of conversational speech, two nonlinear methods for mapping from formants to short-term spectra were investigated: multilayered perceptrons (MLPs), and radial basis function (RBF) networks. Five schemes for dividing the TIMIT data according to their phone class were tested. The r.m.s. error of the RBF networks was 10%, less than that of the MLP, and the scheme based on discrete articulatory regions gave the greatest improvements over a single network.

Data-driven, nonlinear, formant-to-acoustic mapping for ASR

Data-driven, nonlinear, formant-to-acoustic mapping for ASR

Buy article PDF

Buy Knowledge Pack

Thank you

References

Related content