Filter cache (FC) is an auxiliary cache much smaller than the main cache. The FC is closest in hierarchy to the instruction fetch unit and it must be small in size to achieve energy-efficient realisations. A pattern prediction scheme is adapted to maximise energy savings in the FC hierarchy. The pattern prediction mechanism proposed relies on the spatial hit or miss pattern of the instruction access stream over previous FC line accesses. Unlike existing techniques, which make predominantly incorrect hit predictions, the proposed approach aims to minimise this, thereby reducing the performance and power penalties associated with it. Simulation results on an extensive set of multimedia benchmarks are presented as proof of its efficacy. The prediction technique results in energy-delay savings of up to 6.8% over the NFPT predictor, which has been proposed in the past as the preferred prediction scheme for FC structures. Investigations conclusively demonstrate that the performance of the proposed prediction scheme is comparable with and in most cases better than that based on NFPT. Unlike NFPT, the new proposed prediction technique lends well for VLSI efficient implementation, making it the preferred choice for energy aware implementations.
References
-
-
1)
-
Ernst, D., Austin, T.M., Mudge, T., Brown, R.B.: `MiBench: a free commercially representative embedded benchmark suite', Proc. 4th IEEE Int. Workshop on Workload characterization, Dec. 2001, p. 3–14.
-
2)
-
, : `Power products reference manual V3.5', Synopsys, 1996.
-
3)
-
4)
-
Tang, W., Gupta, R., Nicolau, A.: `Design of a predictive filter cache for energy savings in high-performance processor architectures', Proc. Int. Conf. on Computer Design, 2001, p. 68–73.
-
5)
-
Anderson, T., Agarwala, S.: `Effective hardware-based two-way loop cache for high-performance low-power processors', Proc. Int. Conf. on Computer Design, 2000, p. 403–407.
-
6)
-
Lee, C., Potkonjak, M., Mangione-Smith, W.H.: `Media-bench: A tool for evaluating and synthesizing multimedia and communications systems', Proc. 30th Int. Symp on Microarchitecture, 1997, p. 330–335.
-
7)
-
J. Kin ,
M. Gupta ,
W.H. Mangione-Smith
.
Filtering memory references to increase energy efficiency.
IEEE Trans. Comput.
,
1 ,
1 -
15
-
8)
-
J. Montanaro ,
R.T. Witek ,
K. Anne ,
A.J. Black ,
E.M. Cooper ,
D.W. Dobberpuhl ,
P.M. Donahue ,
J. Eno ,
W. Hoeppner ,
D. Kruckemyer ,
T.H. Lee ,
P.C.M. Lin ,
L. Madden ,
D. Murray ,
M.H. Pearce ,
S. Santhanam ,
K.J. Snyder ,
R. Stehpany ,
S.C. Thierauf
.
A 160-MHz, 32-b, 0.5-W CMOS RISC microprocessor.
IEEE J. Solid-State Circuits
,
11 ,
1703 -
1714
-
9)
-
D.A. Patterson ,
J.L. Hennessy
.
(2007)
Computer architecture: a quantitative approach.
-
10)
-
Yeh, T.Y., Patt, Y.N.: `Alternative implementation of two-level adaptive branch prediction', Proc. 19th Int. Symp. on Computer Architecture, 1992, p. 124–124.
-
11)
-
12)
-
Tang, W., Gupta, R., Nicolau, A.: `Power savings in embedded processors through decode filter cache', Proc. Int. Conf. on Design Automation & Test in Europe, March 2002, p. 443–448.
-
13)
-
Shivakumar, P., Jouppi, N.: `An integrated cache timing, power and area model', Tech. Report, Compaq Western Research Lab, Palo Alto, CA, 2001/2.
-
14)
-
Synopsys CoCentric Compiler Behavioral User and Modeling Guide, v2002.05 [http://www.synopsys.com].
-
15)
-
Yoaz, A., Erez, M., Ronen, R., Jourdan, S.: `Speculation techniques for improving load related instruction scheduling', Proc. 26th Int. Symp. on Computer Architecture, May 1999, p. 42–53.
http://iet.metastore.ingenta.com/content/journals/10.1049/ip-cdt_20040032
Related content
content/journals/10.1049/ip-cdt_20040032
pub_keyword,iet_inspecKeyword,pub_concept
6
6