Journal of Guangdong University of Technology ›› 2024, Vol. 41 ›› Issue (02): 129-138.doi: 10.12052/gdutxb.230010

• Comprehensive Studies • Previous Articles    

Nash Differential Game for Discrete-time Markov Jump System with Partially Unknown Transition Probabilities

Zhang Cheng-ke1, Xu Meng2, Yang Lu3   

  1. 1. School of Economics, Guangdong University of Technology, Guangzhou 510520, China;
    2. School of Management, Guangdong University of Technology, Guangzhou 510520, China;
    3. School of Management, Guangdong Polytechnic Normal University, Guangzhou 510180, China
  • Received:2023-01-28 Online:2024-03-25 Published:2024-04-23

Abstract: It is noted that transition probability matrix elements cannot be fully known. How to study Nash differential game for discrete-time Markov jump system (MJS) under the condition of unknown transition probability is one of the problems to be solved. This problem can provide theoretical support for the application of Nash differential game theory in Markov jump systems with partial unknown transition probability to management problems. Based on it, the case of one-player game, which is called the ε-suboptimal control problem, is firstly studied. By using the free-connection weighting matrix and “complete square” method, the sufficient conditions for the existence of the ε-suboptimal cntrol strategy are obtained, and an explicit expression of the upper bound of the cost function is given. Then, the conditions for the existence of ε-suboptimal Nash equilibrium strategy are equivalent to solving the optimization problem, which satisfied the bilinear matrix inequalities (BMIs) and matrix inequalities. The heuristic algorithm is used to solve the optimization problem to obtain the ε-suboptimal Nash equilibrium strategies. Finally, the numerical examples are provided to demonstrate the validity of the main conclusions.

Key words: discrete time Markov jump system, ε-suboptimal control, ε-suboptimal Nash equilibrium

CLC Number: 

  • F224
[1] CHENG J, WANG B, PARK J H, KANG W. Sampled-data reliable control for T–S fuzzy semi-Markovian jump system and its application to single-link robot arm model [J]. IET Control Theory and Applications, 1991, 11(2): 1904-1912.
[2] SHI P, LI F B, WU L G, et al. Neural network-based passive filtering for delayed neutral-type semi-Markovian jump systems [J]. IEEE Transactions on Neural Networks and Learning Systems, 2017, 28(9): 2101-2114.
[3] MOMENZADE M, MOMENZADE S, RABBANI H. Using hidden Markov model to predict recurrence of breast cancer based on sequential patterns in gene expression profiles [J]. Journal of Biomedical Informatics, 2020, 111: 103570.
[4] SAKTHIVEL R, KANAGARAJ R, WANG C. Non-fragile sampled-data guaranteed cost control for bio-economic fuzzy singular Markovian jump systems [J]. IET Control Theory and Applications, 2019, 13(2): 279-287.
[5] SIU T K, ELLIOTT R, BEHAVIORISM J. Hedging options in a doubly Markov-modulated financial market via stochastic flows [J]. International Journal of Theoretical and Applied Finance, 2019, 22(08): 1950047.
[6] CUI D Y, WAND H, SU Y, et al. Fuzzy-model-based tracking control of Markov jump nonlinear systems with incomplete mode information [J]. Journal of the Franklin Institute, 2021, 358(7): 3633-3650.
[7] YAO D Y, LIU M, LIU R Q. Adaptive sliding mode controller design of Markov jump systems with time-varying actuator faults and partly unknown transition probabilities [J]. Journal of the Franklin Institute, 2018, 28: 105-122.
[8] LI Y C, MA S P. Finite and infinite horizon indefinite linear quadratic optimal control for discrete-time singular Markov jump systems [J]. IEEE Transactions on Automatic Control, 2021, 358(17): 8993-9022.
[9] 王志刚. 应用随机过程[M]. 合肥: 中国科学技术大学出版社, 2009.
[10] ZHANG L X, BOUKAS E K. Stability and stabilization of Markovian jump linear systems with partially unknown transition probability [J]. Automatica, 2009, 45(2): 463-468.
[11] ZHANG Y, HE Y, WU M, et al. Stabilization for Markovian jump systems with partial information on transition probability based on free-connection weighting matrices [J]. Automatica, 2011, 47(1): 79-84.
[12] YAN Z G, ZHANG W H, PARK J H. Finite-time guaranteed cost control for Itô Stochastic Markovian jump systems with incomplete transition rates [J]. International Journal of Robust and Nonlinear Control, 2017, 27(1): 66-83.
[13] SONG Q S, YIN G G, ZHANG Z M. Numerical solutions for stochastic differential games with regime switching [J]. IEEE Trans on Automatic Control, 2008, 53(2): 509-521.
[14] ZHU H N, ZHANG C K, BIN N. Infinite horizon linear quadratic stochastic Nash differential games of Markov jump linear systems with its application [J]. International Journal of Systems Science, 2014, 45(5): 1196-1201.
[15] SUN H Y, JIANG L, ZHANG W H. Feedback control on Nash equilibrium for discrete-time stochastic systems with Markovian jumps: Finite-horizon case [J]. International Journal of Control Automation and Systems, 2012, 10(5): 940-946.
[16] MUKAIDANI H, XU H, ZHUANG W H. Robust static output feedback Nash strategy for uncertain Markov jump linear stochastic systems [J]. IET Control Theory and Applications, 2021, 15(11): 1559-1570.
[17] ZHANG C K, LI F C. Non-zero sum differential game for stochastic Markovian jump systems with partially unknown transition probabilities [J]. Journal of the Franklin Institute, 2021, 358(15): 7528-7558.
[18] LI D, LIU S Q, CUI J A. Threshold dynamics and ergodicity of an SIRS epidemic model with Markovian switching [J]. Journal of Differential Equations, 2017, 263(12): 8873-8915.
[19] SUN H Y, Jiang L Y, Zhang W H. Infinite horizon linear quadratic differential games for discrete-time stochastic systems [J]. Journal of Control Theory and Applications, 2012, 10(3): 391-396.
[20] SHI P, BOUKAS E K, SHI Y. et al. Optimal guaranteed cost control of uncertain discrete time-delay systems [J]. Journal of Computational and Applied Mathematics, 2003, 157(2): 435-451.
[21] ZHENG F, WANG Q G, LEE T H. A heuristic approach to solving a class of bilinear matrix inequality problems [J]. Systems & Control Letters., 2002, 47(2): 111-119.
[1] Hong Ze-bin, Wang Feng. An Optimized Transceiver Design for Wireless Powered Over-the-air Computation Systems [J]. Journal of Guangdong University of Technology, 2024, 41(02): 116-121.
[2] Liang Jing-xuan, Wang Feng. Optimized Design for Multiuser Cache-enabled Mobile Edge Computing [J]. Journal of Guangdong University of Technology, 2023, 40(05): 73-80.
[3] Wang Mei-lin, Zeng Jun-jie, Cheng Ke-qiang, Chen Xiao-hang. A Research on Improved Genetic Algorithm for Flexible Job Shop Scheduling Problem Based on MPN [J]. Journal of Guangdong University of Technology, 2021, 38(05): 24-32.
[4] Li Sha-sha, Cui Tie-jun. A Research on the Analysis Method of System Income Based on the Behavior Game Between Operators and Managers [J]. Journal of Guangdong University of Technology, 2021, 38(04): 35-40.
[5] Li Jie-ming, Zhu Huai-nian. Nash Differential Games for Delayed Nonlinear Stochastic Systems with State-and Control-dependent Noise [J]. Journal of Guangdong University of Technology, 2018, 35(01): 41-45,60.
[6] Chen Hai-xia, Zhong Ying-hong. An E-commerce Platform Anti-counterfeiting Strategy Research Based on Cumulative Prospect Theory and Evolutionary Game Theory [J]. Journal of Guangdong University of Technology, 2017, 34(04): 52-57.
[7] Zhou Haiying, Zhang Chengke, Zhu Huainian . Finitetime Nonzerosum Games for Stochastic Singular Systems [J]. Journal of Guangdong University of Technology, 2014, 31(2): 32-35.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!