Modares, Hamidreza; Nageshrao, Subramanya P.; Lopes, Gabriel A. Delgado; Babuška, Robert; Lewis, Frank L. Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning. (English) Zbl 1343.93006 Automatica 71, 334-341 (2016). MSC: 93A14 68T05 68T42 PDFBibTeX XMLCite \textit{H. Modares} et al., Automatica 71, 334--341 (2016; Zbl 1343.93006) Full Text: DOI
Abouheaf, Mohammed I.; Lewis, Frank L.; Vamvoudakis, Kyriakos G.; Haesaert, Sofie; Babuska, Robert Multi-agent discrete-time graphical games and reinforcement learning solutions. (English) Zbl 1367.91032 Automatica 50, No. 12, 3038-3053 (2014). MSC: 91A25 91A43 91A55 PDFBibTeX XMLCite \textit{M. I. Abouheaf} et al., Automatica 50, No. 12, 3038--3053 (2014; Zbl 1367.91032) Full Text: DOI Link