Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free SkipCache: application aware cache management for chip multi-processors

With the advent of multiple cores on a single chip, it is common for the systems to have multi-level caches. Multiple levels of cache reduce the pressure on the memory bandwidth by allowing applications to store their frequently accessed data in them. The levels of cache nearer to the core filter the locality in the application access, which can result in high miss rates at farther levels. This piece of study revolves around one question: are all levels of cache needed by all applications during all phases of their execution? The study observes the effect of 2-level and 3-level cache hierarchies on the performance of different applications. On the basis of this study, this study proposes an application aware cache management policy called ‘SkipCache’, which allows an application to choose a 2-level or 3-level cache hierarchy during run-time. SkipCache dynamically tracks the applications at shared last-level cache (LLC) to identify the applications that do not obtain advantage by using the LLC. Such applications can completely skip the LLC so that other co-scheduled cache friendly applications can efficiently use it. Evaluation of SkipCache in a 4-core chip multi-processor with multi-programmed workloads shows significant performance improvement. SkipCache is orthogonal to other cache management techniques and can be used along with other optimisation techniques to improve the system performance.

References

    1. 1)
    2. 2)
    3. 3)
    4. 4)
    5. 5)
    6. 6)
    7. 7)
      • 2. Jaleel, A., Theobald, K.B., Steely, Jr.S.C., Emer, J.: ‘High performance cache replacement using re-reference interval prediction (RRIP)’. Proc. ISCA, 2010, pp. 6071.
    8. 8)
      • 21. ‘SPEC CPU benchmark suite’. Available at http://www.spec.org.
    9. 9)
      • 9. Xie, Y., Loh, G.H.: ‘Scalable shared-cache management by containing thrashing workloads’. Proc. HiPEAC, 2010, pp. 262276.
    10. 10)
    11. 11)
      • 30. Coleman, J.A., Srivastava, D.: ‘Controlling a processor cache using a real-time attribute’. WO Patent App. PCT/US2011/066,973,2013.
    12. 12)
      • 29. Qu, N., Gou, X., Cheng, X.: ‘Using uncacheable memory to improve unity Linux performance’. Proc. workshop on Interaction between Operating System and Computer Architecture, Austin, TX, October 2005. pp. 2732.
    13. 13)
    14. 14)
    15. 15)
      • 23. Muralimanohar, N., Balasubramonian, R., Jouppi, N.: ‘Optimizing NUCA organizations and wiring alternatives for large caches with CACTI 6.0’. Proc. MICRO, 2007, pp. 314.
    16. 16)
      • 28. Sim, J., Lee, J., Qureshi, M.K., Kim, H.: ‘Flexclusion: balancing cache capacity and on-chip bandwidth via flexible exclusion’. Proc. ISCA, 2012, pp. 321332.
    17. 17)
      • 11. Sanchez, D., Kozyrakis, C.: ‘Vantage: scalable and efficient fine-grain cache partitioning’. Proc. ISCA, 2011, pp. 5768.
    18. 18)
      • 10. Xie, Y., Loh, G.H.: ‘PIPP: promotion/insertion pseudo-partitioning of multi-core shared caches’. Proc. ISCA, 2009, pp. 174183.
    19. 19)
      • 27. Gupta, S., Gao, H., Zhou, H.: ‘Adaptive cache bypassing for inclusive last level caches’. Proc. IPDPS, 2013, pp. 12431253.
    20. 20)
      • 20. Sanchez, D., Yen, L., Hill, M.D., Sankaralingam, K.: ‘Implementing signatures for transactional memory’. Proc. of MICRO, 2007, pp. 123133.
    21. 21)
      • 8. Qureshi, M.K., Patt, Y.N.: ‘Utility-based cache partitioning: a low-overhead, high-performance, runtime mechanism to partition shared caches’. Proc. MICRO, 2006, pp. 423432.
    22. 22)
      • 16. Jaleel, A.: ‘Memory characterization of workloads using instrumentation-driven simulation – a pin-based memory characterization of the SPEC CPU2000 and SPEC CPU2006 benchmark suites’. Versatile Systems and Simulation Advanced Development Technical Report, 2007.
    23. 23)
      • 17. Qureshi, M.K., Jaleel, A., Patt, Y.N., Steely, S.C., Emer, J.: ‘Adaptive insertion policies for high-performance caching’. Proc. ISCA, 2007, pp. 381391.
    24. 24)
    25. 25)
    26. 26)
      • 13. Hayenga, M., Nere, A., Lipasti, M.: ‘MadCache: a PC-aware cache insertion policy’. JILP Workshop on Computer Architecture Competitions: Cache Replacement Championship, 2010.
    27. 27)
      • 4. Feng, M., Tian, C., Lin, C., Gupta, R.: ‘Dynamic access distance driven cache replacement’, ACM TACO, 2011, 8, (3), pp. 14:114:30.
    28. 28)
      • 5. Petoumenos, P., Keramidas, G., Kaxiras, S.: ‘Instruction-based reuse-distance prediction for effective cache management’. Proc. SAMOS, 2009, pp. 4958.
    29. 29)
      • 26. Manikantan, R., Rajan, K., Govindarajan, R.: ‘NUcache: an efficient multicore cache organization based on next-use distance’. Proc. HPCA, 2011, pp. 243253.
    30. 30)
      • 7. Duong, N., Zhao, D., Kim, T., Cammarota, R., Valero, M., Veidenbaum, A.V.: ‘Improving cache management policies using dynamic reuse distances’. Proc. MICRO, 2012, pp. 389400.
    31. 31)
      • 3. Wu, C.-J., Jaleel, A., Hasenplaugh, W., Martonosi, M., Steely, Jr.S.C., Emer, J.: ‘Ship: signature-based hit predictor for high performance caching’. Proc. MICRO, 2011, pp. 430441.
    32. 32)
    33. 33)
      • 12. Memik, G., Reinman, G., Mangione-Smith, W.-H.: ‘Just say no: benefits of early cache miss determination’. Proc. HPCA, 2003, pp. 307316.
    34. 34)
      • 25. Kaxiras, S., Hu, Z., Martonosi, M.: ‘Cache decay: exploiting generational behavior to reduce cache leakage power’. Proc. ISCA, 2001, pp. 240251.
    35. 35)
      • 6. Chaudhuri, M., Gaur, J., Bashyam, N., Subramoney, S., Nuzman, J.: ‘Introducing hierarchy-awareness in replacement and bypass algorithms for last-level caches’. Proc. of PACT, 2012, pp. 293304.
    36. 36)
      • 14. Li, L., Tong, D., Xie, Z., Lu, J., Cheng, X.: ‘Optimal bypass monitor for high performance last-level caches’. Proc. PACT, 2012, pp. 315324.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cdt.2014.0150
Loading

Related content

content/journals/10.1049/iet-cdt.2014.0150
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address