In this chapter we present a continuous-time adaptive dynamic programming (ADP) procedure that uses the idea of integral reinforcement learning (IRL) to find online the Nash-equilibrium solution for the two-player zero-sum (ZS) differential game. We consider continuous-time (CT) linear dynamics of the form x= Ax + B1w + B2u, where u(t), w(t) are the control actions of the two players, and an infinite-horizon quadratic cost. This work is from Vrabie and Lewis (2010).
Integral reinforcement learning for zero-sum two-player games, Page 1 of 2
< Previous page Next page > /docserver/preview/fulltext/books/ce/pbce081e/PBCE081E_ch11-1.gif /docserver/preview/fulltext/books/ce/pbce081e/PBCE081E_ch11-2.gif