Energy-delay efficient filter cache hierarchy using pattern prediction scheme

Access Full Text

Energy-delay efficient filter cache hierarchy using pattern prediction scheme

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings - Computers and Digital Techniques — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Filter cache (FC) is an auxiliary cache much smaller than the main cache. The FC is closest in hierarchy to the instruction fetch unit and it must be small in size to achieve energy-efficient realisations. A pattern prediction scheme is adapted to maximise energy savings in the FC hierarchy. The pattern prediction mechanism proposed relies on the spatial hit or miss pattern of the instruction access stream over previous FC line accesses. Unlike existing techniques, which make predominantly incorrect hit predictions, the proposed approach aims to minimise this, thereby reducing the performance and power penalties associated with it. Simulation results on an extensive set of multimedia benchmarks are presented as proof of its efficacy. The prediction technique results in energy-delay savings of up to 6.8% over the NFPT predictor, which has been proposed in the past as the preferred prediction scheme for FC structures. Investigations conclusively demonstrate that the performance of the proposed prediction scheme is comparable with and in most cases better than that based on NFPT. Unlike NFPT, the new proposed prediction technique lends well for VLSI efficient implementation, making it the preferred choice for energy aware implementations.

Inspec keywords: memory architecture; cache storage; VLSI; instruction sets; prediction theory

Other keywords: instruction fetch unit; filter cache hierarchy; energy-delay; pattern prediction scheme; energy-efficient realisation; energy aware implementation; VLSI efficient implementation; NFPT predictor

Subjects: Semiconductor integrated circuits; Memory circuits; Storage system design; Semiconductor storage

References

    1. 1)
      • Ernst, D., Austin, T.M., Mudge, T., Brown, R.B.: `MiBench: a free commercially representative embedded benchmark suite', Proc. 4th IEEE Int. Workshop on Workload characterization, Dec. 2001, p. 3–14.
    2. 2)
      • , : `Power products reference manual V3.5', Synopsys, 1996.
    3. 3)
      • www.SystemC.org.
    4. 4)
      • Tang, W., Gupta, R., Nicolau, A.: `Design of a predictive filter cache for energy savings in high-performance processor architectures', Proc. Int. Conf. on Computer Design, 2001, p. 68–73.
    5. 5)
      • Anderson, T., Agarwala, S.: `Effective hardware-based two-way loop cache for high-performance low-power processors', Proc. Int. Conf. on Computer Design, 2000, p. 403–407.
    6. 6)
      • Lee, C., Potkonjak, M., Mangione-Smith, W.H.: `Media-bench: A tool for evaluating and synthesizing multimedia and communications systems', Proc. 30th Int. Symp on Microarchitecture, 1997, p. 330–335.
    7. 7)
    8. 8)
      • J. Montanaro , R.T. Witek , K. Anne , A.J. Black , E.M. Cooper , D.W. Dobberpuhl , P.M. Donahue , J. Eno , W. Hoeppner , D. Kruckemyer , T.H. Lee , P.C.M. Lin , L. Madden , D. Murray , M.H. Pearce , S. Santhanam , K.J. Snyder , R. Stehpany , S.C. Thierauf . A 160-MHz, 32-b, 0.5-W CMOS RISC microprocessor. IEEE J. Solid-State Circuits , 11 , 1703 - 1714
    9. 9)
      • D.A. Patterson , J.L. Hennessy . (2007) Computer architecture: a quantitative approach.
    10. 10)
      • Yeh, T.Y., Patt, Y.N.: `Alternative implementation of two-level adaptive branch prediction', Proc. 19th Int. Symp. on Computer Architecture, 1992, p. 124–124.
    11. 11)
      • www.simplescalar.com.
    12. 12)
      • Tang, W., Gupta, R., Nicolau, A.: `Power savings in embedded processors through decode filter cache', Proc. Int. Conf. on Design Automation & Test in Europe, March 2002, p. 443–448.
    13. 13)
      • Shivakumar, P., Jouppi, N.: `An integrated cache timing, power and area model', Tech. Report, Compaq Western Research Lab, Palo Alto, CA, 2001/2.
    14. 14)
      • Synopsys CoCentric Compiler Behavioral User and Modeling Guide, v2002.05 [http://www.synopsys.com].
    15. 15)
      • Yoaz, A., Erez, M., Ronen, R., Jourdan, S.: `Speculation techniques for improving load related instruction scheduling', Proc. 26th Int. Symp. on Computer Architecture, May 1999, p. 42–53.
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-cdt_20040032
Loading

Related content

content/journals/10.1049/ip-cdt_20040032
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading