Nonlinear cepstral equalisation method for noisy speech recognition

Access Full Text

Nonlinear cepstral equalisation method for noisy speech recognition

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings - Vision, Image and Signal Processing — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

The authors deal with the problem of automatic speech recognition in the presence of additive white noise. The effect of noise is modelled as an additive term to the power spectrum of the original clean speech. The cepstral coefficients of the noisy speech are then derived from this model. The reference cepstral vectors trained from clean speech are adapted to their appropriate noisy version to best fit the testing speech cepstral vector. The LPC coefficients, LPC derived cepstral coefficients, and the distance between test and reference, are all regarded as functions of the noise ratio (the spectral power ratio of noise to noisy speech). A gradient based algorithm is proposed to find the optimal noise ratio as well as the minimum distance between the test cepstral vector and the noise adapted reference. A recursive algorithm based on Levinson-Durbin recursion is proposed to simultaneously calculate the LPC coefficients and the derivatives of the LPC coefficients with respect to the noise ratio. The stability of the proposed adaptation algorithm is also addressed. Experiments on multispeaker (50 males and 50 females) isolated Mandarin digits recognition demonstrate remarkable performance improvements over noncompensated method under noisy environment. The results are also compared to the projection based approach, and experiments show that the proposed method is superior to the projection approach under a severe noisy environment.

Inspec keywords: speech processing; speech recognition; autoregressive processes; natural languages; equalisers; linear predictive coding; cepstral analysis; white noise; recursive functions

Other keywords: Levinson-Durbin recursion; power spectrum; nonlinear cepstral equalisation method; multispeaker isolated Mandarin digits recognition; LPC derived cepstral coefficients; spectral power ratio; noise ratio; optimal noise ratio; noisy speech recognition; additive white noise; recursive algorithm; minimum distance; LPC coefficients; automatic speech recognition; projection based approach; gradient based algorithm

Subjects: Human speech communication; Speech processing techniques; Speech and audio signal processing; Codes; Speech recognition

http://iet.metastore.ingenta.com/content/journals/10.1049/ip-vis_19941373
Loading

Related content

content/journals/10.1049/ip-vis_19941373
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading