×

Neural-network-based output-feedback control with stochastic communication protocols. (English) Zbl 1429.93112

Summary: This paper is concerned with the neural-network-based (NN-based) output-feedback control issue for a class of nonlinear systems. For the purpose of effectively mitigating the phenomena of data congestion/collision, the stochastic communication protocols are favorably utilized to orchestrate the data transmissions, and the resultant closed-loop plant is represented by a so-called protocol-induced Markovian jump system with uncertain transition probability matrices. Taking such an uncertainty probability into account, a novel iterative adaptive dynamic programming (ADP) algorithm is developed to obtain the desired suboptimal solution with the help of auxiliary quasi-HJB equation, and the algorithm convergence is also investigated via the intensive use of the mathematical analysis. In this ADP framework, an NN-based observer with a novel adaptive tuning law is first adopted to reconstruct the system states. Then, based on the reconfigurable system, an actor-critic NN scheme with online learning is developed to realize the considered control strategy. Furthermore, in light of the Lyapunov theory, some sufficient conditions are derived to guarantee the stability of the zero equilibrium point of the closed-loop system as well as the boundedness of the estimation errors for critic and actor NN weights. Finally, a simulation example is employed to demonstrate the effectiveness of the developed suboptimal control scheme.

MSC:

93B52 Feedback control
93E03 Stochastic systems in control theory (general)
93C10 Nonlinear systems in control theory
90C39 Dynamic programming
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Al-Tamimi, A.; Lewis, F. L.; Abu-Khalaf, M., Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof, IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, 38, 4, 943-949 (2008)
[2] Dierks, T.; Jagannathan, S., Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update, IEEE Transactions on Neural Networks and Learning Systems, 23, 7, 1118-1129 (2012)
[3] Ding, D.; Wang, Z.; Han, Q.-L.; Ge, X., A survey on model-based distributed control and filtering for industrial cyber-physical systems, IEEE Transactions on Industrial Informatics (2019), (in press)
[4] Ding, D.; Wang, Z.; Han, Q.-L.; Wei, G., Neural-network-based output-feedback control under Round-Robin scheduling protocols, IEEE Transactions on Cybernetics, 49, 6, 2372-2384 (2019)
[5] Ge, S. S.; Lee, T. H.; Li, G. Y.; Zhang, J., Adaptive NN control for a class of discrete-time non-linear systems, International Journal of Control, 76, 4, 334-354 (2003) · Zbl 1048.93055
[6] Jagannathan, S.; Vandegrift, M. W.; Lewis, F. L., Adaptive fuzzy logic control of discrete-time dynamical systems, Automatica, 36, 2, 229-241 (2000) · Zbl 0942.93020
[7] Kiumarsi, B.; Lewis, F. L., Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems, IEEE Transactions on Neural Networks and Learning Systems, 26, 1, 140-151 (2015)
[8] Liu, D.; Huang, Y.; Wang, D.; Wei, Q., Neural network observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming, International Journal of Control, 86, 9, 1554-1566 (2013) · Zbl 1278.93145
[9] Modares, H.; Lewis, F. L.; Naghibi-Sistani, M.-B., Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks, IEEE Transactions on Neural Networks and Learning Systems, 24, 10, 1513-1525 (2013)
[10] Patrinos, P.; Sopasakis, P.; Sarimveis, H.; Bemporad, A., Stochastic model predictive control for constrained discrete-time Markovian switching systems, Automatica, 50, 2504-2514 (2014) · Zbl 1301.93173
[11] Vamvodakis, K. G.; Lewis, F. L., Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem, Automatica, 46, 5, 878-888 (2010) · Zbl 1191.49038
[12] Vamvoudakis, K. G., Event-triggered optimal adaptive control algorithm for continuous-time nonlinear systems, IEEE/CAA Journal of Automatica Sinica, 1, 3, 282-293 (2014)
[13] Wang, Z.; Lam, J.; Liu, X., Filtering for a class of nonlinear discrete-time stochastic systems with state delays, Journal of Computational and Applied Mathematics, 201, 153-163 (2007) · Zbl 1152.93053
[14] Wei, Q.; Lewis, F. L.; Sun, Q.; Yan, P.; Song, R., Discrete-time deterministic q-learning: A novel convergence analysis, IEEE Transactions on Cybernetics, 47, 5, 1224-1237 (2017)
[15] Xu, H.; Zhao, Q.; Jagannathan, S., Finite-horizon near-optimal output feedback neural network control of quantized nonlinear discrete-time systems with input constraint, IEEE Transactions on Neural Networks and Learning Systems, 26, 8, 1776-1788 (2015)
[16] Zhang, H.; Luo, Y.; Liu, D., Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints, IEEE Transactions on Neural Networks, 20, 9, 1490-1503 (2009)
[17] Zhong, X.; He, H.; Zhang, H.; Wang, Z., Optimal control for unknown discrete-time nonlinear Markov jump systems using adaptive dynamic programming, IEEE Transactions on Neural Networks and Learning Systems, 25, 12, 2141-2155 (2014)
[18] Zou, L.; Wang, Z.; Gao, H., Observer-based \(H_\infty\) control of networked systems with stochastic communication protocol: The finite-horizon case, Automatica, 63, 366-373 (2016) · Zbl 1329.93041
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.