Integral reinforcement learning for zero-sum two-player games

Access Full Text

Integral reinforcement learning for zero-sum two-player games

For access to this article, please select a purchase option:

Buy chapter PDF
£10.00
(plus tax if applicable)
Buy Knowledge Pack
10 chapters for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Author(s): Draguna Vrabie ; Kyriakos G. Vamvoudakis ; Frank L. Lewis
Source: Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles,2012
Publication date January 2012

In this chapter we present a continuous-time adaptive dynamic programming (ADP) procedure that uses the idea of integral reinforcement learning (IRL) to find online the Nash-equilibrium solution for the two-player zero-sum (ZS) differential game. We consider continuous-time (CT) linear dynamics of the form x= Ax + B1w + B2u, where u(t), w(t) are the control actions of the two players, and an infinite-horizon quadratic cost. This work is from Vrabie and Lewis (2010).

Chapter Contents:

  • 11.1 Zero-sum games for linear systems
  • 11.1.1 Background
  • 11.1.2 Offline algorithm to solve the game algebraic Riccati equation
  • 11.1.3 Continuous-time HDP algorithm to solve Riccati equation
  • 11.2 Online algorithm to solve the zero-sum differential game
  • 11.3 Online load-frequency controller design for a power system
  • 11.4 Conclusion

Inspec keywords: learning (artificial intelligence); differential games; continuous time systems; dynamic programming

Other keywords: integral reinforcement learning; ADP procedure; infinite-horizon quadratic cost; continuous-time linear dynamics; IRL; continuous-time adaptive dynamic programming procedure; CT linear dynamics; Nash-equilibrium solution; ZS differential game; zero-sum two-player games

Subjects: Optimisation techniques; Game theory; Learning in AI (theory)

Preview this chapter:
Zoom in
Zoomout

Integral reinforcement learning for zero-sum two-player games, Page 1 of 2

| /docserver/preview/fulltext/books/ce/pbce081e/PBCE081E_ch11-1.gif /docserver/preview/fulltext/books/ce/pbce081e/PBCE081E_ch11-2.gif

Related content

content/books/10.1049/pbce081e_ch11
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading