http://iet.metastore.ingenta.com
1887

access icon openaccess Adaptive embedded control of cyber-physical systems using reinforcement learning

  • XML
    133.841796875Kb
  • PDF
    1.703354835510254MB
  • HTML
    164.236328125Kb
Loading full text...

Full text loading...

/deliver/fulltext/10.1049/iet-cps.2017.0048/IET-CPS.2017.0048.html;jsessionid=5i4dt52co3e05.x-iet-live-01?itemId=%2fcontent%2fjournals%2f10.1049%2fiet-cps.2017.0048&mimeType=html&fmt=ahah

References

    1. 1)
      • D.C. Juan , S. Garg , J. Park .
        1. Juan, D.C., Garg, S., Park, J., et al: ‘Learning the optimal operating point for many-core systems with extended range voltage/frequency scaling’. 2013 Int. Conf. Hardware/Software Codesign and System Synthesis (CODES+ISSS), 2013, pp. 110.
        . 2013 Int. Conf. Hardware/Software Codesign and System Synthesis (CODES+ISSS) , 1 - 10
    2. 2)
      • H.M. Buini , S. Peter , T. Givargis .
        2. Buini, H.M., Peter, S., Givargis, T.: ‘Including variability of physical models into the design automation of cyber-physical systems’. 2015 52nd ACM/EDAC/IEEE Design Automation Conf. (DAC), 2015, pp. 16.
        . 2015 52nd ACM/EDAC/IEEE Design Automation Conf. (DAC) , 1 - 6
    3. 3)
      • A. Sala .
        3. Sala, A.: ‘Computer control under time-varying sampling period: an LMI gridding approach’, Automatica, 2005, 41, (12), pp. 20772082.
        . Automatica , 12 , 2077 - 2082
    4. 4)
      • A. Cervin , M. Velasco , P. Martí .
        4. Cervin, A., Velasco, M., Martí, P., et al: ‘Optimal online sampling period assignment: theory and experiments’, IEEE Trans. Control Syst. Technol., 2011, 19, (4), pp. 902910.
        . IEEE Trans. Control Syst. Technol. , 4 , 902 - 910
    5. 5)
      • S. El Tantawy , B. Abdulhai , H. Abdelgawad .
        5. El Tantawy, S., Abdulhai, B., Abdelgawad, H.: ‘Multiagent reinforcement learning for integrated network of adaptive traffic signal controllers (marlin-atsc): methodology and large-scale application on downtown Toronto’, IEEE Trans. Intell. Transport. Syst., 2013, 14, (3), pp. 11401150.
        . IEEE Trans. Intell. Transport. Syst. , 3 , 1140 - 1150
    6. 6)
      • E.C. Kara , M. Berges , B. Krogh .
        6. Kara, E.C., Berges, M., Krogh, B., et al: ‘Using smart devices for system-level management and control in the smart grid: A reinforcement learning framework’. 2012 IEEE Third Int. Conf. Smart Grid Communications (SmartGridComm), 2012, pp. 8590.
        . 2012 IEEE Third Int. Conf. Smart Grid Communications (SmartGridComm) , 85 - 90
    7. 7)
      • T.P. Lillicrap , J.J. Hunt , A. Pritzel .
        7. Lillicrap, T.P., Hunt, J.J., Pritzel, A., et al: ‘Continuous control with deep reinforcement learning’. 2015arXiv preprint arXiv:150902971.
        .
    8. 8)
      • H. Neema , Z. Lattmann , P. Meijer .
        8. Neema, H., Lattmann, Z., Meijer, P., et al: ‘Design space exploration and manipulation for cyber physical systems’. IFIP First Int. Workshop on Design Space Exploration of Cyber-Physical Systems (IDEAL), 2014.
        . IFIP First Int. Workshop on Design Space Exploration of Cyber-Physical Systems (IDEAL)
    9. 9)
      • D. Henriksson , A. Cervin .
        9. Henriksson, D., Cervin, A.: ‘Optimal on-line sampling period assignment for real-time control tasks based on plant state information’. 44th IEEE Conf. Decision and Control, 2005 and 2005 European Control Conf. CDC-ECC'05, 2005, pp. 44694474.
        . 44th IEEE Conf. Decision and Control, 2005 and 2005 European Control Conf. CDC-ECC'05 , 4469 - 4474
    10. 10)
      • D. Simon , D. Robert , O. Sename .
        10. Simon, D., Robert, D., Sename, O.: ‘Robust control/scheduling co-design: application to robot control’. 11th IEEE Real Time and Embedded Technology and Applications Symp., 2005. RTAS 2005, 2005, pp. 118127.
        . 11th IEEE Real Time and Embedded Technology and Applications Symp., 2005. RTAS 2005 , 118 - 127
    11. 11)
      • P. Albertos , J. Salt .
        11. Albertos, P., Salt, J.: ‘Non-uniform sampled-data control of MIMO systems’, Annu. Rev. Control, 2011, 35, (1), pp. 6576.
        . Annu. Rev. Control , 1 , 65 - 76
    12. 12)
      • M. Morari , A. Balluchi , P. Murrieri , A.L. Sangiovanni Vincentelli . (2005)
        12. Balluchi, A., Murrieri, P., Sangiovanni Vincentelli, A.L.: ‘Controller synthesis on non-uniform and uncertain discrete–time domains’, in Morari, M. (Ed.), ‘Hybrid systems: computation and control’ (Springer, 2005), pp. 118133.
        .
    13. 13)
      • S. Khan , R.M. Goodall , R. Dixon .
        13. Khan, S., Goodall, R.M., Dixon, R.: ‘Non-uniform sampling strategies for digital control’, Int. J. Syst. Sci., 2013, 44, (12), pp. 22342254.
        . Int. J. Syst. Sci. , 12 , 2234 - 2254
    14. 14)
      • P. Albertos , A. Crespo .
        14. Albertos, P., Crespo, A.: ‘Real-time control of non-uniformly sampled systems’, Control Eng. Pract., 1999, 7, (4), pp. 445458.
        . Control Eng. Pract. , 4 , 445 - 458
    15. 15)
      • S.G. Khan , G. Herrmann , F.L. Lewis .
        15. Khan, S.G., Herrmann, G., Lewis, F.L., et al: ‘Reinforcement learning and optimal adaptive control: an overview and implementation examples’, Annu. Rev. Control, 2012, 36, (1), pp. 4259.
        . Annu. Rev. Control , 1 , 42 - 59
    16. 16)
      • N. Marchand , S. Durand , J.F.G. Castellanos .
        16. Marchand, N., Durand, S., Castellanos, J.F.G.: ‘A general formula for the stabilization of event-based controlled systems’. 2011 50th IEEE Conf. Decision and Control and European Control Conf., 2011, pp. 81998204.
        . 2011 50th IEEE Conf. Decision and Control and European Control Conf. , 8199 - 8204
    17. 17)
      • C.J. Watkins , P. Dayan .
        17. Watkins, C.J., Dayan, P.: ‘Q-learning’, Mach. Learn., 1992, 8, (3-4), pp. 279292.
        . Mach. Learn. , 279 - 292
    18. 18)
      • R.S. Sutton , A.G. Barto . (1998)
        18. Sutton, R.S., Barto, A.G.: ‘Reinforcement learning: an introduction’ (MIT Press, 1998).
        .
    19. 19)
      • K. Ahnert , M. Mulansky . (2011)
        19. Ahnert, K., Mulansky, M.: ‘Odeint-solving ordinary differential equations in C++’, arXiv preprint arXiv:11103397, 2011.
        .
    20. 20)
      • S. Durand , J.F.G. Castellanos , N. Marchand .
        20. Durand, S., Castellanos, J.F.G., Marchand, N., et al: ‘Event-based control of the inverted pendulum: swing up and stabilization’, J. Control Eng. Appl. Inform., 2013, 15, (3), pp. 96104.
        . J. Control Eng. Appl. Inform. , 3 , 96 - 104
    21. 21)
      • A.W. Moore . (1990)
        21. Moore, A.W.: ‘Efficient memory-based learning for robot control’. 1990.
        .
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cps.2017.0048
Loading

Related content

content/journals/10.1049/iet-cps.2017.0048
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address