Yao, Shixuan; Liu, Xiaochen; Zhang, Yinghui; Cui, Ze An approach to solving optimal control problems of nonlinear systems by introducing detail-reward mechanism in deep reinforcement learning. (English) Zbl 1509.49003 Math. Biosci. Eng. 19, No. 9, 9258-9290 (2022). MSC: 49J20 93C10 35F21 PDFBibTeX XMLCite \textit{S. Yao} et al., Math. Biosci. Eng. 19, No. 9, 9258--9290 (2022; Zbl 1509.49003) Full Text: DOI
Duan, Dandan; Liu, Chunsheng; Zhang, Shaojie Robust optimal control for finite-horizon zero-sum differential games via a plug-n-play event-triggered scheme. (English) Zbl 1441.93175 J. Franklin Inst. 357, No. 10, 5989-6017 (2020). MSC: 93C65 93B35 49N70 91A23 91A05 PDFBibTeX XMLCite \textit{D. Duan} et al., J. Franklin Inst. 357, No. 10, 5989--6017 (2020; Zbl 1441.93175) Full Text: DOI
Xiao, Geyang; Zhang, Huaguang; Qu, Qiuxia; Jiang, He General value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systems. (English) Zbl 1393.90129 J. Franklin Inst. 355, No. 5, 2610-2630 (2018). MSC: 90C39 49M30 68T05 92B20 PDFBibTeX XMLCite \textit{G. Xiao} et al., J. Franklin Inst. 355, No. 5, 2610--2630 (2018; Zbl 1393.90129) Full Text: DOI
Luo, Biao; Liu, Derong; Huang, Tingwen; Yang, Xiong; Ma, Hongwen Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems. (English) Zbl 1433.49032 Inf. Sci. 411, 66-83 (2017). MSC: 49K21 90C39 93C55 93D20 PDFBibTeX XMLCite \textit{B. Luo} et al., Inf. Sci. 411, 66--83 (2017; Zbl 1433.49032) Full Text: DOI
Zhang, Jilie; Liang, Hongjing; Feng, Tao Optimal control for nonlinear continuous systems by adaptive dynamic programming based on fuzzy basis functions. (English) Zbl 1465.49020 Appl. Math. Modelling 40, No. 13-14, 6766-6774 (2016). MSC: 49K15 49L20 93C42 PDFBibTeX XMLCite \textit{J. Zhang} et al., Appl. Math. Modelling 40, No. 13--14, 6766--6774 (2016; Zbl 1465.49020) Full Text: DOI
Lun, Shu-xian; Yao, Xian-shuang; Hu, Hai-feng A new echo state network with variable memory length. (English) Zbl 1428.68245 Inf. Sci. 370-371, 103-119 (2016). MSC: 68T05 62M10 62M20 PDFBibTeX XMLCite \textit{S.-x. Lun} et al., Inf. Sci. 370--371, 103--119 (2016; Zbl 1428.68245) Full Text: DOI
Wei, Qinglai; Liu, Derong; Xu, Yancai Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach. (English) Zbl 1369.93318 Soft Comput. 20, No. 2, 697-706 (2016). MSC: 93C40 93C55 49L20 93C10 PDFBibTeX XMLCite \textit{Q. Wei} et al., Soft Comput. 20, No. 2, 697--706 (2016; Zbl 1369.93318) Full Text: DOI
Zhang, Jilie; Zhang, Huaguang; Wang, Binrui; Cai, Tiaoyang Nearly data-based optimal control for linear discrete model-free systems with delays via reinforcement learning. (English) Zbl 1333.93270 Int. J. Syst. Sci., Princ. Appl. Syst. Integr. 47, No. 7, 1563-1573 (2016). MSC: 93E20 93E10 93C05 93C55 68T05 PDFBibTeX XMLCite \textit{J. Zhang} et al., Int. J. Syst. Sci., Princ. Appl. Syst. Integr. 47, No. 7, 1563--1573 (2016; Zbl 1333.93270) Full Text: DOI
Luo, Biao; Wu, Huai-Ning; Huang, Tingwen; Liu, Derong Reinforcement learning solution for HJB equation arising in constrained optimal control problem. (English) Zbl 1397.49044 Neural Netw. 71, 150-158 (2015). MSC: 49M30 49M37 68U20 68T05 92B20 PDFBibTeX XMLCite \textit{B. Luo} et al., Neural Netw. 71, 150--158 (2015; Zbl 1397.49044) Full Text: DOI
Wei, Qinglai; Liu, Derong; Lewis, Frank L. Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games. (English) Zbl 1386.93023 Inf. Sci. 317, 96-113 (2015). MSC: 93A14 90C39 93C40 68T42 91A10 49N70 PDFBibTeX XMLCite \textit{Q. Wei} et al., Inf. Sci. 317, 96--113 (2015; Zbl 1386.93023) Full Text: DOI
Song, Ruizhuo; Xiao, Wendong; Wei, Qinglai; Sun, Changyin Neural-network-based approach to finite-time optimal control for a class of unknown nonlinear systems. (English) Zbl 1326.49041 Soft Comput. 18, No. 8, 1645-1653 (2014). MSC: 49L20 93C40 90C39 93C10 92B20 PDFBibTeX XMLCite \textit{R. Song} et al., Soft Comput. 18, No. 8, 1645--1653 (2014; Zbl 1326.49041) Full Text: DOI
Yang, Xiong; Liu, Derong; Wang, Ding Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints. (English) Zbl 1317.93158 Int. J. Control 87, No. 3, 553-566 (2014). MSC: 93C40 68T05 93C10 93C15 49N90 PDFBibTeX XMLCite \textit{X. Yang} et al., Int. J. Control 87, No. 3, 553--566 (2014; Zbl 1317.93158) Full Text: DOI
Yang, Xiong; Liu, Derong; Wang, Ding; Wei, Qinglai Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning. (English) Zbl 1308.93116 Neural Netw. 55, 30-41 (2014). MSC: 93C40 68T05 92B20 PDFBibTeX XMLCite \textit{X. Yang} et al., Neural Netw. 55, 30--41 (2014; Zbl 1308.93116) Full Text: DOI
Song, Ruizhuo; Xiao, Wendong; Wei, Qinglai Multi-objective optimal control for a class of nonlinear time-delay systems via adaptive dynamic programming. (English) Zbl 1316.49038 Soft Comput. 17, No. 11, 2109-2115 (2013). MSC: 49M30 90C39 90C29 PDFBibTeX XMLCite \textit{R. Song} et al., Soft Comput. 17, No. 11, 2109--2115 (2013; Zbl 1316.49038) Full Text: DOI
Heydari, Ali; Balakrishnan, S. N. Fixed-final-time optimal control of nonlinear systems with terminal constraints. (English) Zbl 1297.93109 Neural Netw. 48, 61-71 (2013). MSC: 93C55 68T05 92B20 PDFBibTeX XMLCite \textit{A. Heydari} and \textit{S. N. Balakrishnan}, Neural Netw. 48, 61--71 (2013; Zbl 1297.93109) Full Text: DOI
Mohler, Ronald R.; Kolodziej, Wojciech J. Optimal control of a class of nonlinear stochastic systems. (English) Zbl 0474.93075 IEEE Trans. Autom. Control 26, 1048-1053 (1981). MSC: 93E20 93C10 93E11 49J55 60H10 34F05 PDFBibTeX XMLCite \textit{R. R. Mohler} and \textit{W. J. Kolodziej}, IEEE Trans. Autom. Control 26, 1048--1053 (1981; Zbl 0474.93075) Full Text: DOI