Texture analysis approach for improving HMM speech recognition in presence of microinterruptions

Access Full Text

Texture analysis approach for improving HMM speech recognition in presence of microinterruptions

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A simple yet powerful algorithm for enhancing a signal corrupted by microinterruptions is outlined. The algorithm performs a texture analysis of the peak-based spectrogram image for correcting the damage caused by noise and has been used as a pre-processor in hidden Markov model (HMM) speech recognition. Improvements in accuracy as high as 16% have been obtained with the T120 database.

Inspec keywords: speech recognition; signal representation; hidden Markov models

Other keywords: microinterruptions; HMM speech recognition; texture analysis approach; hidden Markov model; T120 database; signal enhancement; peak-based spectrogram image

Subjects: Signal processing theory; Markov processes; Signal detection; Speech recognition and synthesis; Markov processes; Speech recognition

References

    1. 1)
      • R.M. Haralick . Statistical and structural approaches to texture. Proc. IEEE , 5 , 786 - 804
    2. 2)
      • A. Rosenfeld , R.A. Hummel , S.W. Zucker . Scene labeling by relaxation operators. IEEE Trans. , 6 , 420 - 433
    3. 3)
      • R. Steele . (1992) Mobile radio communications.
    4. 4)
      • R.J. McAulay , T.F. Quatieri . Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans. , 4 , 744 - 754
http://iet.metastore.ingenta.com/content/journals/10.1049/el_19990339
Loading

Related content

content/journals/10.1049/el_19990339
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading