Effect of scaling on the area and performance of the H.264/AVC full-search fractional motion estimation algorithm on field-programmable gate arrays

Access Full Text

Effect of scaling on the area and performance of the H.264/AVC full-search fractional motion estimation algorithm on field-programmable gate arrays

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Computers & Digital Techniques — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Fractional motion estimation (FME) is an important part of the H.264/AVC video encoding standard. The algorithm can significantly increase the compression ratio of video encoders while preserving high video quality. The full-search FME algorithm, however, is computationally expensive and can consist of over 45% of the total motion estimation process. To maximise the performance and efficiency of FME implementations on field-programmable gate arrays (FPGAs), one needs to efficiently exploit the inherent parallelism in the algorithm. The authors investigate the scalability of the full-search FME algorithm on FPGAs and also implemented six scaled versions of the algorithm on Xilinx Virtex-5 FPGAs. The authors found that scaling the algorithm vertically within a 4×4 sub-block is more efficient than scaling horizontally across several sub-blocks. It is shown that, with four reference frames, the best vertically scaled design can achieve 96 frames-per-second (fps) performance while encoding full 1920×1088 progressive HDTV video, and the design only consumes 25.5 K LUTS and 28.7 K registers.

Inspec keywords: video coding; field programmable gate arrays; motion estimation

Other keywords: field-programmable gate arrays; full-search FME algorithm; high video quality; Xilinx Virtex-5 FPGA; HDTV video; fractional motion estimation; H.264/AVC video encoding standard

Subjects: Video signal processing; Computer vision and image processing techniques; Logic circuits; Logic and switching circuits; Image and video coding

References

    1. 1)
      • Chen, T., Lian, C., Chen, L.: `Hardware architecture design of an H.264/AVC video codec', Proc. Asia and South Pacific Design Automation Conf., January 2006, Yokohama, Japan, p. 750–757.
    2. 2)
    3. 3)
      • Joint Video Team Reference Software JM7.3, http://iphome.hhi.de/suehring/tml/download/, accessed November 2010.
    4. 4)
      • Chen, T., Huang, Y., Chen, L.: `Analysis and design of macroblock pipelining for H.264/AVC VLSI architecture', Proc. IEEE Int. Symp. on Circuits and Systems, May 2004, Vancouver, Canada, p. 273–276.
    5. 5)
      • Yang, L., Yu, K., Li, J., Li, S.: `Prediction-based directional fractional pixel motion estimation for H.264 video coding', Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, March 2005, Philadelphia, PA, p. II901–II904.
    6. 6)
      • Yang, C., Goto, S., Ikenaga, T.: `High performance VLSI architecture of fractional motion estimation in H.264 for HDTV', Proc. IEEE Int. Symp. on Circuits and Systems, May 2006, Island of Kos, Greece, p. 2605–2608.
    7. 7)
    8. 8)
    9. 9)
    10. 10)
    11. 11)
      • Zhang, L., Gao, W.: `Improved FFSBM algorithm and its VLSI architecture for variable block size motion estimation of H.264', Proc. Int. Symp. on Intelligent Signal Processing and Communication Systems, December 2005, Hong Kong, p. 445–448.
    12. 12)
      • Song, Y., Ma, Y., Liu, Z., Ikenaga, T., Goto, S.: `Hardware-oriented direction-based fast fractional motion estimation algorithm in H.264/AVC', Proc. IEEE Int. Conf. on Multimedia and Expo, June 2008, Hannover, Germany, p. 1009–1012.
    13. 13)
      • Wang, T., Huang, Y., Fang, H., Chen, L.: `Parallel 4′4 2D transform and inverse transform architecture for MPEG-4 AVC/H.264', Proc. IEEE Int. Symp. on Circuits and Systems, May 2003, Bangkok, Thailand, p. 800–803.
    14. 14)
      • P. Kuhn . Algorithms, complexity analysis and VLSI architectures for MPEG-4 motion estimation.
    15. 15)
      • C. Chen , S. Chien , Y. Huang , T. Chen , T. Wang , L. Chen . Analysis and architecture design of variable block size motion estimation for H.264/AVC. IEEE Trans. Circuits Syst. , 2 , 578 - 593
    16. 16)
    17. 17)
      • Chao, W., Chen, T., Chang, Y., Hsu, C., Chen, L.: `Computationally controllable integer, half, and quarter-PEL motion estimation for MPEG-4 advanced simple profile', Proc. Int. Symp. on Circuits Systems, May 2003, Bangkok, Thailand, p. II788–II791.
    18. 18)
    19. 19)
    20. 20)
      • Maro-Campos, A., Ballester-Merelo, F., Martinez-Peiro, M., Canals-Esteve, J.: `High parallel-pipeline integer-PEL and fractional-PEL motion estimation VLSI architectures for H.264/AVC', Proc. SPIE: VLSI Circuits and Systems III, May 2007, Maspalomas, Spain, p. 659010-1–659010-11.
    21. 21)
      • Chen, T., Huang, Y., Chen, L.: `Fully utilized and reusable architecture for fractional motion estimation for H.264/AVC', Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Montreal, May 2004, Canada, p. 9–12.
    22. 22)
      • Chen, Y., Chuang, T., Tsai, C., Chen, Y., Chen, L.: `A cost-efficient residual prediction VLSI architecture for H.264/AVC scalable extension', Proc. Picture Coding Symp., November 2007, Lisboa, Portugal, p. 13–17.
    23. 23)
    24. 24)
      • Chang, J., Leou, J.: `A quadratic prediction based fractional-pixel motion estimation algorithm for H.264', Proc. IEEE Int. Symp. Multimedia, December 2005, Irvine, CA, p. 491–498.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cdt.2010.0167
Loading

Related content

content/journals/10.1049/iet-cdt.2010.0167
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading