access icon openaccess Hierarchical resource allocation and consolidation framework in a multi-core server cluster using a Markov decision process model

This paper investigates a service level agreements (SLAs)-based resource allocation problem in a server cluster. The objective is to maximise the total profit, which is the total revenue minus the operational cost of the server cluster. The total revenue depends on the average request response time, whereas the operating cost depends on the total energy consumption of the server cluster. A joint optimisation framework is proposed, comprised of request dispatching, dynamic voltage and frequency scaling (DVFS) for individual cores of the servers, as well as server- and core-level consolidations. Each DVFS-enabled core in the server cluster is modelled by using a continuous-time Markov decision process (CTMDP). A near-optimal solution comprised of a central manager and distributed local agents is presented. Each local agent employs linear programming-based CTMDP solving method to solve the DVFS problem for the corresponding core. On the other hand, the central manager solves the request dispatch problem and finds the optimal number of ON cores and servers, thereby achieving a desirable tradeoff between service response time and power consumption. To reduce the computational overhead, a two-tier hierarchical solution is utilized. Experimental results demonstrate the outstanding performance of the proposed algorithm over the baseline algorithms.

Inspec keywords: cloud computing; power aware computing; resource allocation; Markov processes; contracts; multiprocessing systems; linear programming

Other keywords: joint optimisation framework; service level agreements; hierarchical resource allocation and consolidation framework; SLA-based resource allocation problem; multicore server cluster; dynamic voltage and frequency scaling; continuous-time Markov decision process; request dispatch problem; DVFS; linear programming-based CTMDP solving method; cloud computing; request dispatching; two-tier hierarchical solution

Subjects: Multiprocessing systems; Operating systems; Markov processes; Internet software; Optimisation techniques

References

    1. 1)
      • 17. Wang, Y., Chen, S., Pedram, M.: ‘Service level agreement-based joint application environment assignment and resource allocation in cloud computing systems’. Proc. of IEEE Green Technologies Conf. (GreenTech), 2013.
    2. 2)
      • 26. Srikantaiah, S., Kansal, A., Zhao, F.: ‘Energy aware consolidation for cloud computing’. Workshop on Power Aware Computing and Systems (HotPower’ 08), 2008.
    3. 3)
      • 29. Zhang, Z., Towsley, D., Kurose, J.: ‘Statistical analysis of generalized processor sharing scheduling discipline’. ACM SIGCOMM'94 Conf. on Communications Architectures, Protocols and Applications.
    4. 4)
      • 23. Krauter, K., Buyya, R., Maheswaran, M.: ‘A taxonomy and survey of grid resource management systems for distributed computing’, Softw. Practice Exp., 2002, 32, (2), pp. 135164.
    5. 5)
      • 27. Hwang, I., Kam, T., Pedram, M.: ‘A study of the effectiveness of CPU consolidation in a virtualized multi-core server system’. Proc. of Int. Symp. on Low Power Electronics and Design (ISLPED), 2012.
    6. 6)
      • 7. Wei, G., Vasilakos, A.V., Zheng, Y., et al: ‘A game-theoretic method of fair resource allocation for cloud computing services’, J. Supercomput., 2010, 54, (2), pp. 252269.
    7. 7)
      • 2. Mell, P.: ‘The NIST definition of cloud computing’, 2011, NIST Special Publication 800-145.
    8. 8)
      • 6. Armbrusk, M., Fox, A., Griffith, R., et al: ‘A view of cloud computing’, Commun. ACM, 2010, 53, (4), pp. 5058.
    9. 9)
      • 13. Polverini, M., Cianfrani, A., Ren, S., et al: ‘Thermal-aware scheduling of batch jobs in geographically distributed data centers’, IEEE Trans. Cloud Comput., 2014, 2, (1), pp. 7184.
    10. 10)
      • 32. Burd, T., Pering, T., Stratakos, A., et al: ‘A dynamic voltage-scaled microprocessor system’. IEEE Int. Solid-State Circuits Conf., Digest of Technical Papers, 2000.
    11. 11)
      • 10. Chen, K., Hu, C., Zhang, X., et al: ‘Survey on routing in data centers: insights and future directions’, IEEE Netw., 2011, 25, (4), pp. 610.
    12. 12)
      • 12. Katz, R.H.: ‘Tech titans building boom’, IEEE Spectr., 2009, 46, (2), pp. 4054.
    13. 13)
      • 36. Chen, D.Z., Daescu, O., Dai, Y., et al: ‘Efficient algorithms and implementations for optimizing the sum of linear fractional functions, with applications’, J. Comb. Optim., 2005, 9, (1), pp. 6990.
    14. 14)
      • 20. Zhang, L., Ardagna, D.: ‘SLA based profit optimization in autonomic computing systems’. 2nd Int. Conf. on Service Oriented Computing, 2004.
    15. 15)
      • 3. Wang, B., Qi, Z., Ma, R., et al: ‘A survey on data center networking for cloud computing’, Comput. Netw., 2015, 91, pp. 528547.
    16. 16)
      • 16. Goudarzi, H., Pedram, M.: ‘Multi-dimensional SLA-based resource allocation for multi-tier cloud computing systems’. Proc. of IEEE Int. Conf. on Cloud Computing (CLOUD), 2011.
    17. 17)
      • 33. Dubois, M., Annavaram, M., Stenström, P.: ‘Parallel computer organization and design’ (Cambridge University Press, MA, 2012).
    18. 18)
      • 9. Wang, L., Zhang, F., Vasilakos, A.V., et al: ‘Joint virtual machine assignment and traffic engineering for green data center networks’, ACM SIGMETRICS Perf. Eval. Rev., 2014, 41, (3), pp. 107112.
    19. 19)
      • 34. ‘AMD Processor Power and Thermal Data Sheet’. Available at http://support.amd.com/us/Processor_TechDocs/40036.pdf, accessed 12 March 2015.
    20. 20)
      • 14. Wang, L., Zhang, F., Aroca, J.A., et al: ‘GreenDCN: a general framework for achieving energy efficiency in data center networks’, IEEE J. Sel. Areas Commun., 2014, 32, (1), pp. 415.
    21. 21)
      • 37. Kernighan, B.W., Lin, S.: ‘An efficient heuristic procedure for partitioning graphs’, Bell Syst. Tech. J., 1970, 49, (2), pp. 291307.
    22. 22)
      • 28. Papoulis, A.: ‘Probability, random variables, and stochastic processes’ (McGraw-Hill, 1991, 3rd edn.).
    23. 23)
      • 25. Puterman, M.L.: ‘Markov decision processes: discrete stochastic dynamic programming’ (Wiley Publisher, New York, 1994).
    24. 24)
      • 24. Chandra, A., Gongt, W., Shenoy, P.: ‘Dynamic resource allocation for shared clusters using online measurements’. Int. Conf. on Measurement and Modeling of Computer Systems (SIGMETRICS), 2003, 31, (1), pp. 300301.
    25. 25)
      • 11. Barroso, L.A., Holzle, U.: ‘The case for energy-proportional computing’, IEEE Comput., 2007, 40, (12), pp. 3337.
    26. 26)
      • 19. Liu, Z., Squillante, M.S., Wolf, J.L.: ‘On maximizing service-level-agreement profits’. 3rd ACM Conf. on Electronic Commerce (EC), 2001.
    27. 27)
      • 4. Buyya, R.: ‘Market-oriented cloud computing: vision, hype, and reality of delivering computing as the 5th utility’. 9th IEEE/ACM Int. Symp. on Cluster Computing and the Grid (CCGrid), 2009.
    28. 28)
      • 30. Jung, H., Pedram, M.: ‘Stochastic dynamic thermal management: a Markovian decision-based approach’. Int. Conf. on Computer Design (ICCD), 2006.
    29. 29)
      • 18. Wei, G., Vasilakos, A.V., Zheng, Y., et al: ‘A game-theoretic method for fair resource allocation for cloud computing services’, J. Supercomput., 2010, 54, (2), pp. 252269.
    30. 30)
      • 31. Kleinrock, L.: ‘Queueing systems, volume I: theory’ (Wiley, New York, 1975).
    31. 31)
      • 21. Ardagna, D., Trubian, M., Zhang, L.: ‘SLA based resource allocation policies in autonomic environments’, J. Parallel Distrib. Comput., 2007, 67, (3), pp. 259270.
    32. 32)
      • 8. Mashayekhy, L., Nejad, M.M., Grosu, D., et al: ‘An online mechanism for resource allocation and pricing in clouds’, IEEE Trans. Comput., 2016, 65, (4), pp. 11721184.
    33. 33)
      • 35. Andersen, E.D., Andersen, K.D.: ‘The MOSEK interior point optimizer for linear programming: an implementation of the homogeneous algorithm’, in Frenk, H., Roos, K., Terlaky, T., Zhang, S. (Eds.): ‘High Performance Optimization. Applied Optimization’, (Springer, Boston, MA, 2000), vol. 33, pp. 197232.
    34. 34)
      • 22. Buyya, R., Murshed, M.: ‘Gridsim: a toolkit for the modelling and simulation of distributed resource management and scheduling for grid computing’, Concurrency Comput. Pract. Exp., 2002, 14, (13), pp. 11751220.
    35. 35)
      • 5. Pedram, M.: ‘Energy-efficient datacenters’, IEEE Trans. Comput.-Aided Des., 2012, 31, (10), pp. 14651484.
    36. 36)
      • 1. Hayes, B.: ‘Cloud computing’, Commun. ACM, 2008, 51, (7), pp. 911.
    37. 37)
      • 15. Wang, Y., Chen, S., Goudarzi, H., et al: ‘Resource allocation and consolidation in a multi-core server cluster using a Markov decision process model’. Proc. of Int. Symp. on Quality Electronic Design (ISQED), 2013.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cps.2017.0060
Loading

Related content

content/journals/10.1049/iet-cps.2017.0060
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading