Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free Analysis by synthesis spatial audio coding

This study presents a novel spatial audio coding (SAC) technique, called analysis by synthesis SAC (AbS-SAC), with a capability of minimising signal distortion introduced during the encoding processes. The reverse one-to-two (R-OTT), a module applied in the MPEG Surround to down-mix two channels as a single channel, is first configured as a closed-loop system. This closed-loop module offers a capability to reduce the quantisation errors of the spatial parameters, leading to an improved quality of the synthesised audio signals. Moreover, a sub-optimal AbS optimisation, based on the closed-loop R-OTT module, is proposed. This algorithm addresses a problem of practicality in implementing an optimal AbS optimisation while it is still capable of improving further the quality of the reconstructed audio signals. In terms of algorithm complexity, the proposed sub-optimal algorithm provides scalability. The results of objective and subjective tests are presented. It is shown that significant improvement of the objective performance, when compared to the conventional open-loop approach, is achieved. On the other hand, subjective test show that the proposed technique achieves higher subjective difference grade scores than the tested advanced audio coding multichannel.

References

    1. 1)
      • 1. Herre, J., Faller, C., Disch, S., et al: ‘Spatial audio coding: next-generation efficient and compatible coding of multi-channel audio’. Presented at the 117th Convention of the Audio Engineering Society, San Fransisco, CA, USA, October 2004.
    2. 2)
      • 9. Schuijers, E., Breebaart, J., Purnhagen, H., Engdegard, J.: ‘Low complexity parametric stereo coding’. Presented at the 116th Convention of the Audio Engineering Society, Berlin, Germany, May 2004.
    3. 3)
      • 8. Moon, H.: ‘A low-complexity design for an mp3 multichannel audio decoding system’, IEEE Trans. Audio, Speech, Lang. Proc., 2012, 20, (1), pp. 314321 (doi: 10.1109/TASL.2011.2161081).
    4. 4)
      • 6. Faller, C., Baumgarte, F.: ‘Binaural cue coding-Part II: schemes and applications’, IEEE Trans. Speech Audio Process., 2003, 11, (6), pp. 520531 (doi: 10.1109/TSA.2003.818108).
    5. 5)
      • 14. Hilpert, J., Disch, S.: ‘The MPEG Surround audio coding standard [Standards in a nutshell]’, IEEE Signal Process. Mag., 2009, 26, (1), pp. 148152 (doi: 10.1109/MSP.2008.930433).
    6. 6)
      • 4. Munkong, R., Juang, B.: ‘Auditory perception and cognition’, IEEE Signal Process. Mag., 2008, 25, (3), pp. 98117 (doi: 10.1109/MSP.2008.918418).
    7. 7)
      • 26. ITU-R BS.1116-1: ‘Method for subjective assessment of small impairments in audio systems including multichannel sound systems’, 1997.
    8. 8)
      • 17. Breebaart, J., Hotho, G., Koppens, J., Schuijers, E., Oomen, W., van de Par, S.: ‘Background, concepts, and architecture for the recent MPEG Surround standard on multichannel audio compression’, J. Audio Eng. Soc., 2007, 55, (5), pp. 331351.
    9. 9)
      • 3. Blauert, J.: ‘Spatial hearing: the psychophysics of human sound localization’ (MIT Press, 1983, Rev. ed. 1997).
    10. 10)
      • 21. ISO/IEC 14496-3: ‘Information technology – Coding of audio-visual objects – Part 3: Audio’, 2009.
    11. 11)
      • 2. Herre, J., Disch, S.: ‘New concepts in parametric coding of spatial audio: from SAC to SAOC’. Proc. IEEE Int. Conf. Multimedia and Expo, San Fransisco, CA, USA, October 2007, pp. 18941897.
    12. 12)
      • 24. Liebetrau, J., Sporer, T., Kampf, S., Schneider, S.: ‘Standardization of PEAQ-MC: Extension of ITU-R BS.1387.1 to multichannel audio’. Presented at AES 40th Int. Conf, Spatial Audio: Sense the Sound of Space, Tokyo, Japan, October 2010.
    13. 13)
      • 10. Breebaart, J., van de Par, S., Kohlrausch, A., Schuijers, E.: ‘Parametric coding of stereo audio’, EURASIP J. Appl. Signal Process., 2005, 9, pp. 13051322.
    14. 14)
      • 25. Kabal, P.: ‘An examination and interpretation of ITU-R BS.1387: perceptual evaluation of audio quality’, Telecommunication and Signal Processing Laboratory, Department of Electrical and Computer Engineering, McGill University (URL: http://www-mmsp.ece.mcgill.ca/documents/Software/).
    15. 15)
      • 7. Herre, J., Faller, C., Ertel, C., Hilpert, J., Hoelzer, A., Spenger, C.: ‘MP3 surround: efficient and compatible coding of multi-channel audio’. Presented at the 116th Convention of the Audio Engineering Society, Berlin, Germany, May 2004.
    16. 16)
      • 27. Marston, D., Kozamernik, F., Stoll, G., Spikofski, G.: ‘Further EBU test of multichannel audio codecs’. Presented at the 126th Convention of the Audio Engineering Society, Munich, Germany, May 2009.
    17. 17)
      • 22. http//xwww.audiocoding.com, accessed December 2012.
    18. 18)
      • 13. Herre, J., Kjorling, K., Breebaart, J., et al: ‘MPEG Surround – the ISO/MPEG standard for efficient and compatible multichannel audio coding’, J. Audio Eng. Soc., 2008, 56, (11), pp. 932955.
    19. 19)
      • 18. Elftri, I., Gunel, B., Kondoz, A.: ‘Multichannel audio coding based on analysis by synthesis’, Proc. IEEE, 2011, 99, (4), pp. 657670 (doi: 10.1109/JPROC.2010.2102310).
    20. 20)
      • 16. Herre, J., Dietz, M.: ‘MPEG-4 high-efficiency AAC coding [Standards in a nutshell]’, IEEE Signal Process. Mag., 2008, 25, (3), pp. 137142 (doi: 10.1109/MSP.2008.918684).
    21. 21)
      • 19. Elftri, I., Kondoz, A., Gunel, B.: ‘Spatial audio coding’. UK Patent 2485979, June 2012.
    22. 22)
      • 15. Wolters, M., Kjorling, K., Homm, D., Purnhagen, H.: ‘A closer look into MPEG-4 high efficiency AAC’. Presented at the 115th Convention of the Audio Engineering Society, New York, USA, October 2003.
    23. 23)
      • 23. ITU-R BS.1387-1: ‘Method for objective measurements of perceived audio quality’, 2001.
    24. 24)
      • 12. Hotho, G., Villemoes, L., Breebaart, J.: ‘A backward-compatible multichannel audio codec’, IEEE Trans. Audio, Speech, Lang. Process., 2008, 16, (1), pp. 8393 (doi: 10.1109/TASL.2007.910768).
    25. 25)
      • 5. Baumgarte, F., Faller, C.: ‘Binaural cue coding-Part I: psychoacoustic fundamentals and design principles’, IEEE Trans. Speech Audio Process., 2003, 11, (6), pp. 509519 (doi: 10.1109/TSA.2003.818109).
    26. 26)
      • 20. ISO/IEC 23003-1: ‘Information technology – MPEG audio technologies – Part 1: MPEG Surround’, 2007.
    27. 27)
      • 11. Roden, J., Breebart, J., Hilpert, J., et al: ‘A study of the MPEG Surround quality versus bit-rate curve’. Presented at the 123th Convention of the Audio Engineering Society, New York, USA, October 2007.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-spr.2013.0015
Loading

Related content

content/journals/10.1049/iet-spr.2013.0015
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address