Data-driven, nonlinear, formant-to-acoustic mapping for ASR

Access Full Text

Data-driven, nonlinear, formant-to-acoustic mapping for ASR

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

With a view to using an articulatory representation in automatic recognition of conversational speech, two nonlinear methods for mapping from formants to short-term spectra were investigated: multilayered perceptrons (MLPs), and radial basis function (RBF) networks. Five schemes for dividing the TIMIT data according to their phone class were tested. The r.m.s. error of the RBF networks was 10%, less than that of the MLP, and the scheme based on discrete articulatory regions gave the greatest improvements over a single network.

Inspec keywords: speech recognition; radial basis function networks; multilayer perceptrons

Other keywords: TIMIT data; RBF networks; nonlinear methods; RMS error; formant-to-acoustic mapping; radial basis function networks; automatic speech recognition; short-term spectra; ASR; phone class; multilayered perceptrons; articulatory representation; MLP; conversational speech

Subjects: Speech recognition; Neural computing techniques; Speech recognition and synthesis

References

    1. 1)
      • C.M. Bishop . (1995) Neural networks for pattern recognition.
    2. 2)
      • Richards, H.B., Bridle, J.S.: `The HDM: a segmental hidden dynamic model of coarticulation', Proc. IEEE-ICASSP'99, 1999, p. 357–360.
    3. 3)
    4. 4)
    5. 5)
      • Holmes, J.N.J.: `Speech processing system using formant analysis', US patent, 6292775, September 2001.
    6. 6)
      • W.J. Holmes , M.J. Russell . Probabilistic-trajectory segmental HMMs. Comput. Speech Lang. , 1 , 3 - 37
    7. 7)
      • Holmes, W.J., Holmes, J.N., Garner, P.N.: `Using formant frequencies in speech recognition', Proc. Eurospeech'97, 1997, p. 2083–2086.
http://iet.metastore.ingenta.com/content/journals/10.1049/el_20020436
Loading

Related content

content/journals/10.1049/el_20020436
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading