http://iet.metastore.ingenta.com
1887

Energy-delay efficient filter cache hierarchy using pattern prediction scheme

Energy-delay efficient filter cache hierarchy using pattern prediction scheme

For access to this article, please select a purchase option:

Buy article PDF
$19.95
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IEE Proceedings - Computers and Digital Techniques — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Filter cache (FC) is an auxiliary cache much smaller than the main cache. The FC is closest in hierarchy to the instruction fetch unit and it must be small in size to achieve energy-efficient realisations. A pattern prediction scheme is adapted to maximise energy savings in the FC hierarchy. The pattern prediction mechanism proposed relies on the spatial hit or miss pattern of the instruction access stream over previous FC line accesses. Unlike existing techniques, which make predominantly incorrect hit predictions, the proposed approach aims to minimise this, thereby reducing the performance and power penalties associated with it. Simulation results on an extensive set of multimedia benchmarks are presented as proof of its efficacy. The prediction technique results in energy-delay savings of up to 6.8% over the NFPT predictor, which has been proposed in the past as the preferred prediction scheme for FC structures. Investigations conclusively demonstrate that the performance of the proposed prediction scheme is comparable with and in most cases better than that based on NFPT. Unlike NFPT, the new proposed prediction technique lends well for VLSI efficient implementation, making it the preferred choice for energy aware implementations.

References

    1. 1)
      • D.A. Patterson , J.L. Hennessy . (2007) Computer architecture: a quantitative approach.
    2. 2)
      • J. Montanaro , R.T. Witek , K. Anne , A.J. Black , E.M. Cooper , D.W. Dobberpuhl , P.M. Donahue , J. Eno , W. Hoeppner , D. Kruckemyer , T.H. Lee , P.C.M. Lin , L. Madden , D. Murray , M.H. Pearce , S. Santhanam , K.J. Snyder , R. Stehpany , S.C. Thierauf . A 160-MHz, 32-b, 0.5-W CMOS RISC microprocessor. IEEE J. Solid-State Circuits , 11 , 1703 - 1714
    3. 3)
      • , : `Power products reference manual V3.5', Synopsys, 1996.
    4. 4)
      • Anderson, T., Agarwala, S.: `Effective hardware-based two-way loop cache for high-performance low-power processors', Proc. Int. Conf. on Computer Design, 2000, p. 403–407.
    5. 5)
    6. 6)
      • Tang, W., Gupta, R., Nicolau, A.: `Design of a predictive filter cache for energy savings in high-performance processor architectures', Proc. Int. Conf. on Computer Design, 2001, p. 68–73.
    7. 7)
      • Tang, W., Gupta, R., Nicolau, A.: `Power savings in embedded processors through decode filter cache', Proc. Int. Conf. on Design Automation & Test in Europe, March 2002, p. 443–448.
    8. 8)
      • Yeh, T.Y., Patt, Y.N.: `Alternative implementation of two-level adaptive branch prediction', Proc. 19th Int. Symp. on Computer Architecture, 1992, p. 124–124.
    9. 9)
      • Yoaz, A., Erez, M., Ronen, R., Jourdan, S.: `Speculation techniques for improving load related instruction scheduling', Proc. 26th Int. Symp. on Computer Architecture, May 1999, p. 42–53.
    10. 10)
      • www.simplescalar.com.
    11. 11)
      • Shivakumar, P., Jouppi, N.: `An integrated cache timing, power and area model', Tech. Report, Compaq Western Research Lab, Palo Alto, CA, 2001/2.
    12. 12)
      • Lee, C., Potkonjak, M., Mangione-Smith, W.H.: `Media-bench: A tool for evaluating and synthesizing multimedia and communications systems', Proc. 30th Int. Symp on Microarchitecture, 1997, p. 330–335.
    13. 13)
      • Ernst, D., Austin, T.M., Mudge, T., Brown, R.B.: `MiBench: a free commercially representative embedded benchmark suite', Proc. 4th IEEE Int. Workshop on Workload characterization, Dec. 2001, p. 3–14.
    14. 14)
      • www.SystemC.org.
    15. 15)
      • Synopsys CoCentric Compiler Behavioral User and Modeling Guide, v2002.05 [http://www.synopsys.com].
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-cdt_20040032
Loading

Related content

content/journals/10.1049/ip-cdt_20040032
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address