Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

Improved line spectral frequency estimation through anti-aliasing filtering

Improved line spectral frequency estimation through anti-aliasing filtering

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings - Vision, Image and Signal Processing — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A study of the classic line spectral frequency (LSF) extraction methods along with the assumptions made during their estimation is presented. LSF extraction is investigated from an over-sampling and decimation perspective and LSFs are shown to contain high-frequency variations that led to spectral overlapping problems. An anti-aliasing filter prior to decimation, with a cut-off frequency dependent on the final LSF vector transmission rate, is proposed to alleviate the aliasing problem of the classic extraction methods. The proposed method shows a clear advantage because it produces the same quantisation distortion as the classic methods at a lower bit requirement, with significant reduction in 2 and 4 dB outliers that greatly affect synthesised speech quality. This was also confirmed through a listening test.

References

    1. 1)
      • F. Tzeng . Analysis-by-synthesis linear predictive speech coding at 2.4 kbit/s. Proc. Globecom , 1253 - 1257
    2. 2)
      • Eriksson, T., Kang, H.-G., Hedelin, P.: `Low-rate quantisation of spectrum parameters', Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, 2000, Istanbul, Turkey, 3, p. 1447–1450.
    3. 3)
    4. 4)
      • S. Villette . (2001) Sinusoidal speech coding for low and very low bit rate application.
    5. 5)
      • Juang, B.H., Gray, A.H.: `Multiple stage vector quantisation for speech coding', Proc. Int. Conf. on Acoustics, Speech, Signal Processing, 1982, Paris, France, p. 597–600.
    6. 6)
      • Ohmuro, H., Moriya, T., Mano, K., Miki, S.: `Coding of LSP parameters using interframe moving average prediction and multi-stage vector quantisation', Proc. IEEE Workshop on Speech Coding for Telecommunications, October 1993, Sainte-Adele, Canada, p. 63–64.
    7. 7)
      • W.P. LeBlanc , B. Bhattacharya , S.A. Mahmoud , V. Cuperman . Efficient search and design procedures for robust multi-stage VQ of LPCparameters for 4 kb/s speech coding. IEEE Trans. Speech Audio Process. , 4 , 373 - 385
    8. 8)
      • J.R. Deller , J.G. Proakis , J.H.L. Hansen . (1993) Discrete-time processing of speech signal.
    9. 9)
      • W.B. Kleijn , K.K. Paliwal . (1995) Speech coding and synthesis.
    10. 10)
      • J.G. Proakis , D.G. Manolakis . (1996) Digital signal processing: principles, algorithms and applications.
    11. 11)
      • McCree, A., Truong, K., George, E.B., Barnwell, T.P., Viswanathan, V.: `A 2.4 kbits/s MELP coder candidate for the new U.S. Federal Standard', Int. Conf. on Acoustics, Speech, and Signal Processing, 1996, 1, p. 200–203.
    12. 12)
      • (1998) GSM adaptive multi rate speech transcoding.
    13. 13)
      • F. Itakura . Line spectrum representation of linear predictive coefficients of speech signals. J. Acoust. Soc. Am.
    14. 14)
      • Knagenhjelm, H.P., Kleijn, W.B.: `Spectral dynamics is more important than spectral distortion', IEEE Proc. Int. Conf. on Acoustics, Speech, and Signal Processing, 1995, New York, NY, USA, 1, p. 732–735.
    15. 15)
      • Villette, S., Cho, Y.D., Kondoz, A.M.: `Efficient parameter quantisation for 2.4/1.2 kb/s split-band LPC coding', IEEE Workshop on Speech Coding, 17–20 September 2000, Dalavan, Wisconsin, USA.
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-vis_20045257
Loading

Related content

content/journals/10.1049/ip-vis_20045257
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address