http://iet.metastore.ingenta.com
1887

Mean compensation based on projection-based group delay scheme for noisy speech recognition

Mean compensation based on projection-based group delay scheme for noisy speech recognition

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A mean vector compensation technique based on the projection-based group delay scheme has been combined with a semi-continuous HMM to improve the recognition rate in noisy environments. The proposed approach compensates the mean vector using a projection-based scale factor and the bias estimated from the training and/or testing data to balance the mismatch between different environments. Experiments show that the significant improvement in speaker-dependent, isolated word recognition was achieved by adding the projection-based scale factor and mean vector bias.

References

    1. 1)
      • Itakura, F., Umezaki, T.: `Distance measure for speech recognition based onthe smoothed group delay spectrum', Proc. ICASSP, 1987, p. 1257–1260.
    2. 2)
      • D. Mansour , B.H. Juang . A family of distortion measure based uponprojection operation for robust speech recognition. IEEE Trans. , 1659 - 1671
    3. 3)
      • Calson, B.A., Clements, M.A.: `Application of a weighted projectionmeasure for robust hidden Markov model based speech recognition', Proc. Int. Conf. Acoust., Speech, Signal Processing, 1991, p. 921–924.
    4. 4)
      • S.L. Tung , I.S. Lei , Y.T. Juang . Projection-based group delay scheme for speech recognition. IEEE Trans. , 2 , 138 - 140
    5. 5)
      • Y. Gong . Speech recognition in noisy environments: A survey. Speech Commun. , 261 - 291
    6. 6)
      • M. Rahim , B.-H. Juang . Signal bias removal by maximum likelihoodestimation for robust telephone speech recognition. IEEE Trans. , 1 , 19 - 30
    7. 7)
      • A. Sanker , C.H. Lee . A maximum-likelihood approach to stochasticmatching for robust speech recognition. IEEE Trans. , 190 - 202
    8. 8)
      • M.G. Rahim , B.H. Juang , W. Chou , E. Buhrke . Signal conditioning techniques for robust speech recognition. IEEE Signal Process. Lett. , 4 , 107 - 109
http://iet.metastore.ingenta.com/content/journals/10.1049/el_19990982
Loading

Related content

content/journals/10.1049/el_19990982
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address