access icon free Faster mode determination algorithm using mode correlation for multi-view video coding

Multi-view video coding with a hierarchical B picture structure utilises intra-view and inter-view predictions to reduce the quantity of redundant information. The optimal coding mode is determined by exhaustively searching through all possible partition modes; however, a high degree of computational complexity is involved in such exhaustive searches. In this study, the authors statistically analyse the coding mode distribution in inter-view and intra-view and propose a fast mode decision algorithm to select the optimal mode in terms of rate–distortion optimisation. The probability density function of the rate-distortion cost and the degree of the homogeneity in motion are set as the multi-threshold in the algorithm to determine the optimal mode for base view coding. For the multi-view coding, the correlation of the modes in neighbouring views with similar regions is utilised to select the coding mode from the inter-view or intra-view predictions. The experimental results show that the encoding time for the base view and the multi-view is reduced by up to 85 and 69%, respectively, and the quality of the reconstructed video is nearly unchanged.

Inspec keywords: optimisation; computational complexity; video coding; image reconstruction; correlation methods

Other keywords: interview predictions; fast mode decision algorithm; multiview video coding; base view coding; rate-distortion optimisation; computational complexity; optimal coding mode; video reconstruction; intra-view predictions; multi-threshold mode; mode correlation; partition modes; mode determination algorithm; hierarchical B picture structure; homogeneity degree; probability density function

Subjects: Optimisation techniques; Computational complexity; Video signal processing; Image and video coding; Computer vision and image processing techniques; Optimisation techniques

References

    1. 1)
    2. 2)
    3. 3)
    4. 4)
    5. 5)
    6. 6)
    7. 7)
    8. 8)
    9. 9)
    10. 10)
      • 23. Su, Y.P., Vetro, A., Smolic, A.: ‘Common test conditions for multiview video coding’. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-T207, Klagenfurt, Austria, July 2006.
    11. 11)
      • 4. Fujii, T., Mori, K., Takeda, K., Mase, K., Tanimoto, M., Suenaga, Y.: ‘Multipoint measuring system for video and sound-100-camera and microphone system’. Proc. of IEEE Int. Conf. on Multimedia and Expo, July 2006, pp. 437440.
    12. 12)
      • 6. Ho, Y.S., Oh, K.J.: ‘Overview of multi-view video coding’. Proc. of IEEE Int. Conf. on Systems, Signals and Image Processing, June 2007, pp. 512.
    13. 13)
      • 13. Chen, Y., Ma, K.K., Cai, C.: ‘Histogram-offset-based color correction for multi-view video coding’. Proc. of IEEE Int. Conf. on Image Processing (ICIP), December 2010, pp. 977980.
    14. 14)
      • 20. Shen, L., Liu, Z., Liu, S., Zhang, Z., An, P.: ‘Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding’, IEEE Trans. Broadcast., 2009, 55, pp. 961766.
    15. 15)
      • 18. Han, D., Lee, Y.: ‘Fast mode decision using global disparity vector for multiview video coding’. Proc. of Int. Conf. on Future Generation Communication and Networking Symposia, 2008, vol. 3, pp. 209213.
    16. 16)
      • 22. Shen, L., Liu, Z., An, P., Ma, R., Zhang, Z.: ‘Low-complexity mode decision for MVC’, IEEE Trans. Circuits Syst. Video Technol., 2011, 21, (6), pp. 837843 (doi: 10.1109/TCSVT.2011.2130310).
    17. 17)
      • 26. Yang, M., Wang, W.S.: ‘Fast macroblock mode selection based on motion content classification in H.264/AVC’. Proc. of Int. Conf. on Image Processing, October 2004, vol. 2, pp. 741744.
    18. 18)
      • 7. Ohm, J.R.: ‘Stereo/multiview encoding using the MPEG family of standards’. Proc. of the Electronic Image ‘99, San Diego, USA, January 1999.
    19. 19)
      • 8. Ohm, J.R., Müller, K.: ‘Core experiments on multiview objects’. M3178, February 1998.
    20. 20)
      • 3. ISO/IEC JTC1/SC29/WG11: ‘Requirements on multi-view video coding v.2’. Doc. N7282, Poznan, Poland, July 2005.
    21. 21)
      • 15. Yu, M., Peng, Z.G., Liu, W.Y., Shao, F., Jian, G.G., Kim, Y.D.: ‘Fast macroblock selection algorithm for multiview video coding based on inter-view global disparity’, Congr. Image Signal Process., 2008, 1, pp. 575578.
    22. 22)
      • 27. Bjontegaard, G.: ‘Calculation of average PSNR differences between RD-Curves’. ITU-T SG16 Doc. VCEG-M33, March 2001.
    23. 23)
      • 21. Xiu, X., Pang, D., Liang, J.: ‘Rectification-based view interpolation and extrapolation for multiview video coding’, IEEE Trans. Circuits Syst. Video Technol., 2011, 21, (6), pp. 693707 (doi: 10.1109/TCSVT.2011.2129230).
    24. 24)
      • 11. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6: ‘Joint multi-view video model (JMVM) 8.0’. JVT-AA207, Geneva, Switzerland, April 2008.
    25. 25)
      • 17. Li, X., Zhao, D., Ma, S., Gao, W.: ‘Fast disparity and motion estimation based on correlations for multiview video coding’, IEEE Trans. Consum. Electron., 2008, 54, pp. 20372044 (doi: 10.1109/TCE.2008.4711270).
    26. 26)
      • 19. Ding, L.F., Tsung, P.K., Chen, W.Y., Chien, S.Y., Chen, L.G.: ‘Fast motion estimation with inter-view motion vector prediction for stereo and multi-view video coding’. Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, April 2008, pp. 13731376.
    27. 27)
      • 1. Redert, A., Beeck, M.O., Fehn, C., et alATTEST: advanced three-dimensional television system technologies’. Proc. of the First Int. Symp. on 3D Data Processing Visualization and Transmission (3DPVT ’02), 2002.
    28. 28)
      • 2. Tanimoto, M.: ‘Overview of free viewpoint television’. Proc. of IEEE Conf. on Image Processing, July 2006, vol. 21, no. 6, pp. 454461.
    29. 29)
      • 9. ISO/IEC JTC1/SC29/WG11: ‘MPEG-4 overview’, N4030, March 2001.
    30. 30)
      • 10. Smolic, A., McCutchen, D.: ‘3DAV exploration of video-based rendering technology in MPEG’, IEEE Trans. Circuits Syst. Video Technol., 2004, 14, (3), pp. 348356 (doi: 10.1109/TCSVT.2004.823395).
    31. 31)
      • 16. Peng, Z.G., Jiang, G., Yu, M., Dai, Q.H.: ‘Fast macroblock mode selection algorithm for multiview video coding’, EURASIP J. Image Video Process., 2008, 1, pp. 114 (doi: 10.1155/2008/393727).
    32. 32)
      • 25. Jung, H.K., Kim, H.S., Kim, B.G., Kim, C.K., Yoo, J.J.: ‘Fast intermode decision algorithm based on contextual mode and priority information for H.264/AVC video encoding system’, Opt. Eng., 2011, 50, (11), pp. 18 (doi: 10.1117/1.3647552).
    33. 33)
      • 14. Chiang, J.C., Chen, W.C., Liu, L.M., Hsu, K.F., Lie, W.N.: ‘A fast H.264/AVC-based stereo video encoding algorithm based on hierarchical two-stage neural classification’, IEEE J. Sel. Top. Signal Process., 2011, 5, (2), pp. 309320 (doi: 10.1109/JSTSP.2010.2066956).
    34. 34)
      • 24. Yang, E.H., Yu, X.: ‘Rate distortion optimization for H.264 interframe coding: a general framework and algorithm’, IEEE Trans. Image Process., 2007, 16, pp. 17741784 (doi: 10.1109/TIP.2007.896685).
    35. 35)
      • 5. Chen, Y., Wang, Y.K., Ugur, K., Hannuksela, M.M., Lainema, J., Gabbouj, M.: ‘The emerging MVC standard for 3D video services’, EURASIP J. Adv. Signal Process., 2009, 2009, (1), pp. 113.
    36. 36)
      • 12. Merkle, P., Smolic, A., Muller, K., Wiegand, T.: ‘Efficient prediction structure for multi-view video coding’, IEEE Trans. Circuit Syst. Video Technol., 2007, 17, (11), pp. 14611473 (doi: 10.1109/TCSVT.2007.903665).
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-spr.2012.0286
Loading

Related content

content/journals/10.1049/iet-spr.2012.0286
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading