http://iet.metastore.ingenta.com
1887

Harmonic postprocessing off speech synthesised by stochastic coders

Harmonic postprocessing off speech synthesised by stochastic coders

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings I (Communications, Speech and Vision) — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

High quality speech coding at medium-to-low bit rates is presently one of the major goals in speech research. Stochatic coding represents an important step towards this objective. Yet, the quality of synthetic speech is still not always good enough. A subjectively important part of the distortion may arise from imperfect reproduction of voiced regions, where the harmonic structure is not as well marked in the synthetic signal as it is in the original speech signal. Postprocessing of synthetic signals using harmonic modelling arises as a natural solution to reduce this distortion. The disadvantages of this method in terms of additional delay, complexity and dependency on high precision pitch detectors can be well counterbalanced by the higher quality of resynthesised speech signals in voiced regions.

References

    1. 1)
      • Adoul, J.-P., Mabilleau, P., Delprat, M., Morissete, S.: `Fast CELP coding based on algebraic codes', Proceedings of International Conference on Acoustics, speech and signal processing, April 1987, p. 1957–1960.
    2. 2)
      • L.B. Almeida , J.M. Tribolet . Nonstationary spectral modelling of voiced speech. IEEE Trans. , 664 - 678
    3. 3)
      • Almeida, L.B., Silva, F.M.: `Variable-frequency synthesis: an improved harmonic coding scheme', Proceedings of International Conference on Acoustics, speech and signal processing, March 1984, p. 27.5.1–27.5.4.
    4. 4)
      • B.S. Atal , M.R. Schroeder . Adaptive predictive coding of speech signals. Bell Syst. Tech. J. , 1973 - 1986
    5. 5)
      • Atal, B.S., Remde, J.R.: `A new model of LPC excitation for producing natural-sounding speech at low bit rates', Proceedings of International Conference on Acoustics, speech and signal processing, May 1982, p. 614–617.
    6. 6)
      • Copperi, M., Sereno, D.: `Vector quantisation and perceptual criteria for low-rate coding of speech', Proceedings of International Conference on Acoustics, speech and signal processing, March 1985, p. 252–255.
    7. 7)
      • Gersho, A., Ramstad, T., Versvik, I.: `Fully vectorquantised subband coding with adaptive codebook allocation', Proceedings of International Conference on Acoustics, speech and signal processing, March 1984, Paper 10.7.
    8. 8)
      • Kroon, P., Atal, B.S.: `Quantisation procedures for 4.8 kbps CELP coders', Proceedings of International Conference on Acoustics, speech and signal processing, April 1987, p. 1649–1652.
    9. 9)
      • Lin, D.: `New approaches to stochastic coding of speech sources at very low bit rates', Proceedings of EUSIPCO, September 1986, p. 445–448.
    10. 10)
      • J. Makhoul , S. Roucos , H. Gish . Vector quantisation in speech coding. Proc. IEEE , 1551 - 1588
    11. 11)
      • Schroeder, M.R., Atal, B.S.: `Rate-distortion theory and predictive coding', Proceedings of International Conference on Acoustics, speech and signal processing, March 1981, p. 201–204.
    12. 12)
      • Schroeder, M.R., Atal, B.S.: `Stochastic coding of speech signals at very low bit rates', Proceedings of International Conference on Communications, May 1984, p. 1610–1613.
    13. 13)
      • Schroeder, M.R., Atal, B.S.: `Code-excited linear prediction (CELP): high-quality speech at very low bit rates', Proceedings of International Conference on Acoustics, speech and signal processing, March 1985, p. 937–940.
    14. 14)
      • Singhal, S.: `On encoding filter parameters for stochastic coders', Proceedings of International Conference on Acoustics, speech and signal processing, April 1987, p. 1633–1636.
    15. 15)
      • Trancoso, I.M., Almeida, L.B., Tribolet, J.M.: `A study on the relationships between stochastic and harmonic coding', Proceedings of International Conference on Acoustics, speech and signal processing, April 1986, p. 1709–1712.
    16. 16)
      • Trancoso, I.M., Atal, B.S.: `Efficient procedures for finding the optimum innovation in stochastic coders', Proceedings of International Conference on Acoustics, speech and signal processing, April 1986, p. 2375–2378.
    17. 17)
      • Trancoso, I.M., Tribolet, J.M.: `Harmonic postprocessing of speech synthesised by stochastic coders', Proceedings of European Conference on Speech technology, Edinburgh, September 1987, 2, p. 181–184.
    18. 18)
      • Tribolet, J.M., Crochiere, R.E.: `A modified adaptive transform coding scheme with postprocessing-enhancement', Proceedings of International Conference on Acoustics, speech and signal processing, April 1980, p. 336–339.
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-i-2.1989.0018
Loading

Related content

content/journals/10.1049/ip-i-2.1989.0018
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address