Faster mode determination algorithm using mode correlation for multi-view video coding
- Author(s): Pei-Jun Lee 1 ; Ho-Ju Lin 2 ; Kuei-Ting Kuo 1
-
-
View affiliations
-
Affiliations:
1:
Electrical Engineering Department, National Chi Nan University, No.1, University Road, Puli, Nan Tou 545, Taiwan;
2: Hardware Graphic Department, VIA Technologies, Inc., 2F, 9, Li-Hsin Road V. Science-Based Industrial Park, Hsin-Chu 300, Taiwan
-
Affiliations:
1:
Electrical Engineering Department, National Chi Nan University, No.1, University Road, Puli, Nan Tou 545, Taiwan;
- Source:
Volume 8, Issue 5,
July 2014,
p.
565 – 578
DOI: 10.1049/iet-spr.2012.0286 , Print ISSN 1751-9675, Online ISSN 1751-9683
- « Previous Article
- Table of contents
- Next Article »
Multi-view video coding with a hierarchical B picture structure utilises intra-view and inter-view predictions to reduce the quantity of redundant information. The optimal coding mode is determined by exhaustively searching through all possible partition modes; however, a high degree of computational complexity is involved in such exhaustive searches. In this study, the authors statistically analyse the coding mode distribution in inter-view and intra-view and propose a fast mode decision algorithm to select the optimal mode in terms of rate–distortion optimisation. The probability density function of the rate-distortion cost and the degree of the homogeneity in motion are set as the multi-threshold in the algorithm to determine the optimal mode for base view coding. For the multi-view coding, the correlation of the modes in neighbouring views with similar regions is utilised to select the coding mode from the inter-view or intra-view predictions. The experimental results show that the encoding time for the base view and the multi-view is reduced by up to 85 and 69%, respectively, and the quality of the reconstructed video is nearly unchanged.
Inspec keywords: optimisation; computational complexity; video coding; image reconstruction; correlation methods
Other keywords: interview predictions; fast mode decision algorithm; multiview video coding; base view coding; rate-distortion optimisation; computational complexity; optimal coding mode; video reconstruction; intra-view predictions; multi-threshold mode; mode correlation; partition modes; mode determination algorithm; hierarchical B picture structure; homogeneity degree; probability density function
Subjects: Optimisation techniques; Computational complexity; Video signal processing; Image and video coding; Computer vision and image processing techniques; Optimisation techniques
References
-
-
1)
-
24. Yang, E.H., Yu, X.: ‘Rate distortion optimization for H.264 interframe coding: a general framework and algorithm’, IEEE Trans. Image Process., 2007, 16, pp. 1774–1784 (doi: 10.1109/TIP.2007.896685).
-
-
2)
-
25. Jung, H.K., Kim, H.S., Kim, B.G., Kim, C.K., Yoo, J.J.: ‘Fast intermode decision algorithm based on contextual mode and priority information for H.264/AVC video encoding system’, Opt. Eng., 2011, 50, (11), pp. 1–8 (doi: 10.1117/1.3647552).
-
-
3)
-
22. Shen, L., Liu, Z., An, P., Ma, R., Zhang, Z.: ‘Low-complexity mode decision for MVC’, IEEE Trans. Circuits Syst. Video Technol., 2011, 21, (6), pp. 837–843 (doi: 10.1109/TCSVT.2011.2130310).
-
-
4)
-
21. Xiu, X., Pang, D., Liang, J.: ‘Rectification-based view interpolation and extrapolation for multiview video coding’, IEEE Trans. Circuits Syst. Video Technol., 2011, 21, (6), pp. 693–707 (doi: 10.1109/TCSVT.2011.2129230).
-
-
5)
-
17. Li, X., Zhao, D., Ma, S., Gao, W.: ‘Fast disparity and motion estimation based on correlations for multiview video coding’, IEEE Trans. Consum. Electron., 2008, 54, pp. 2037–2044 (doi: 10.1109/TCE.2008.4711270).
-
-
6)
-
10. Smolic, A., McCutchen, D.: ‘3DAV exploration of video-based rendering technology in MPEG’, IEEE Trans. Circuits Syst. Video Technol., 2004, 14, (3), pp. 348–356 (doi: 10.1109/TCSVT.2004.823395).
-
-
7)
-
16. Peng, Z.G., Jiang, G., Yu, M., Dai, Q.H.: ‘Fast macroblock mode selection algorithm for multiview video coding’, EURASIP J. Image Video Process., 2008, 1, pp. 1–14 (doi: 10.1155/2008/393727).
-
-
8)
- P. Merkle , A. Smolic , K. Müller , T. Wiegand . Efficient prediction structures for multi-view video coding. IEEE Trans. Circuits Syst. Video Technol. , 11 , 1461 - 1473
-
9)
-
14. Chiang, J.C., Chen, W.C., Liu, L.M., Hsu, K.F., Lie, W.N.: ‘A fast H.264/AVC-based stereo video encoding algorithm based on hierarchical two-stage neural classification’, IEEE J. Sel. Top. Signal Process., 2011, 5, (2), pp. 309–320 (doi: 10.1109/JSTSP.2010.2066956).
-
-
10)
-
23. Su, Y.P., Vetro, A., Smolic, A.: ‘Common test conditions for multiview video coding’. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, JVT-T207, Klagenfurt, Austria, July 2006.
-
-
11)
-
4. Fujii, T., Mori, K., Takeda, K., Mase, K., Tanimoto, M., Suenaga, Y.: ‘Multipoint measuring system for video and sound-100-camera and microphone system’. Proc. of IEEE Int. Conf. on Multimedia and Expo, July 2006, pp. 437–440.
-
-
12)
-
6. Ho, Y.S., Oh, K.J.: ‘Overview of multi-view video coding’. Proc. of IEEE Int. Conf. on Systems, Signals and Image Processing, June 2007, pp. 5–12.
-
-
13)
-
13. Chen, Y., Ma, K.K., Cai, C.: ‘Histogram-offset-based color correction for multi-view video coding’. Proc. of IEEE Int. Conf. on Image Processing (ICIP), December 2010, pp. 977–980.
-
-
14)
-
20. Shen, L., Liu, Z., Liu, S., Zhang, Z., An, P.: ‘Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding’, IEEE Trans. Broadcast., 2009, 55, pp. 961–766.
-
-
15)
-
18. Han, D., Lee, Y.: ‘Fast mode decision using global disparity vector for multiview video coding’. Proc. of Int. Conf. on Future Generation Communication and Networking Symposia, 2008, vol. 3, pp. 209–213.
-
-
16)
-
22. Shen, L., Liu, Z., An, P., Ma, R., Zhang, Z.: ‘Low-complexity mode decision for MVC’, IEEE Trans. Circuits Syst. Video Technol., 2011, 21, (6), pp. 837–843 (doi: 10.1109/TCSVT.2011.2130310).
-
-
17)
-
26. Yang, M., Wang, W.S.: ‘Fast macroblock mode selection based on motion content classification in H.264/AVC’. Proc. of Int. Conf. on Image Processing, October 2004, vol. 2, pp. 741–744.
-
-
18)
-
7. Ohm, J.R.: ‘Stereo/multiview encoding using the MPEG family of standards’. Proc. of the Electronic Image ‘99, San Diego, USA, January 1999.
-
-
19)
-
8. Ohm, J.R., Müller, K.: ‘Core experiments on multiview objects’. M3178, February 1998.
-
-
20)
-
3. ISO/IEC JTC1/SC29/WG11: ‘Requirements on multi-view video coding v.2’. Doc. N7282, Poznan, Poland, July 2005.
-
-
21)
-
15. Yu, M., Peng, Z.G., Liu, W.Y., Shao, F., Jian, G.G., Kim, Y.D.: ‘Fast macroblock selection algorithm for multiview video coding based on inter-view global disparity’, Congr. Image Signal Process., 2008, 1, pp. 575–578.
-
-
22)
-
27. Bjontegaard, G.: ‘Calculation of average PSNR differences between RD-Curves’. ITU-T SG16 Doc. VCEG-M33, March 2001.
-
-
23)
-
21. Xiu, X., Pang, D., Liang, J.: ‘Rectification-based view interpolation and extrapolation for multiview video coding’, IEEE Trans. Circuits Syst. Video Technol., 2011, 21, (6), pp. 693–707 (doi: 10.1109/TCSVT.2011.2129230).
-
-
24)
-
11. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6: ‘Joint multi-view video model (JMVM) 8.0’. JVT-AA207, Geneva, Switzerland, April 2008.
-
-
25)
-
17. Li, X., Zhao, D., Ma, S., Gao, W.: ‘Fast disparity and motion estimation based on correlations for multiview video coding’, IEEE Trans. Consum. Electron., 2008, 54, pp. 2037–2044 (doi: 10.1109/TCE.2008.4711270).
-
-
26)
-
19. Ding, L.F., Tsung, P.K., Chen, W.Y., Chien, S.Y., Chen, L.G.: ‘Fast motion estimation with inter-view motion vector prediction for stereo and multi-view video coding’. Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing, April 2008, pp. 1373–1376.
-
-
27)
-
1. Redert, A., Beeck, M.O., Fehn, C., et al ‘ATTEST: advanced three-dimensional television system technologies’. Proc. of the First Int. Symp. on 3D Data Processing Visualization and Transmission (3DPVT ’02), 2002.
-
-
28)
-
2. Tanimoto, M.: ‘Overview of free viewpoint television’. Proc. of IEEE Conf. on Image Processing, July 2006, vol. 21, no. 6, pp. 454–461.
-
-
29)
-
9. ISO/IEC JTC1/SC29/WG11: ‘MPEG-4 overview’, N4030, March 2001.
-
-
30)
-
10. Smolic, A., McCutchen, D.: ‘3DAV exploration of video-based rendering technology in MPEG’, IEEE Trans. Circuits Syst. Video Technol., 2004, 14, (3), pp. 348–356 (doi: 10.1109/TCSVT.2004.823395).
-
-
31)
-
16. Peng, Z.G., Jiang, G., Yu, M., Dai, Q.H.: ‘Fast macroblock mode selection algorithm for multiview video coding’, EURASIP J. Image Video Process., 2008, 1, pp. 1–14 (doi: 10.1155/2008/393727).
-
-
32)
-
25. Jung, H.K., Kim, H.S., Kim, B.G., Kim, C.K., Yoo, J.J.: ‘Fast intermode decision algorithm based on contextual mode and priority information for H.264/AVC video encoding system’, Opt. Eng., 2011, 50, (11), pp. 1–8 (doi: 10.1117/1.3647552).
-
-
33)
-
14. Chiang, J.C., Chen, W.C., Liu, L.M., Hsu, K.F., Lie, W.N.: ‘A fast H.264/AVC-based stereo video encoding algorithm based on hierarchical two-stage neural classification’, IEEE J. Sel. Top. Signal Process., 2011, 5, (2), pp. 309–320 (doi: 10.1109/JSTSP.2010.2066956).
-
-
34)
-
24. Yang, E.H., Yu, X.: ‘Rate distortion optimization for H.264 interframe coding: a general framework and algorithm’, IEEE Trans. Image Process., 2007, 16, pp. 1774–1784 (doi: 10.1109/TIP.2007.896685).
-
-
35)
-
5. Chen, Y., Wang, Y.K., Ugur, K., Hannuksela, M.M., Lainema, J., Gabbouj, M.: ‘The emerging MVC standard for 3D video services’, EURASIP J. Adv. Signal Process., 2009, 2009, (1), pp. 1–13.
-
-
36)
-
12. Merkle, P., Smolic, A., Muller, K., Wiegand, T.: ‘Efficient prediction structure for multi-view video coding’, IEEE Trans. Circuit Syst. Video Technol., 2007, 17, (11), pp. 1461–1473 (doi: 10.1109/TCSVT.2007.903665).
-
-
1)