Liu, Fei; Zeng, Guangzhou An online multi-agent co-operative learning algorithm in POMDPs. (English) Zbl 1160.68505 J. Exp. Theor. Artif. Intell. 20, No. 4, 335-344 (2008). MSC: 68T05 PDFBibTeX XMLCite \textit{F. Liu} and \textit{G. Zeng}, J. Exp. Theor. Artif. Intell. 20, No. 4, 335--344 (2008; Zbl 1160.68505) Full Text: DOI
Banerjee, Bikramjit; Sen, Sandip; Peng, Jing On-policy concurrent reinforcement learning. (English) Zbl 1066.68106 J. Exp. Theor. Artif. Intell. 16, No. 4, 245-260 (2004). MSC: 68T05 68N99 PDFBibTeX XMLCite \textit{B. Banerjee} et al., J. Exp. Theor. Artif. Intell. 16, No. 4, 245--260 (2004; Zbl 1066.68106) Full Text: DOI
Cichosz, Pawel TD(lambda) learning without eligibility traces: A theoretical analysis. (English) Zbl 1069.68575 J. Exp. Theor. Artif. Intell. 11, No. 2, 239-263 (1999). MSC: 68T05 PDFBibTeX XMLCite \textit{P. Cichosz}, J. Exp. Theor. Artif. Intell. 11, No. 2, 239--263 (1999; Zbl 1069.68575) Full Text: DOI
Carmel, David; Markovitch, Shaul Model-based learning of interaction strategies in multi-agent systems. (English) Zbl 1053.68591 J. Exp. Theor. Artif. Intell. 10, No. 3, 309-332 (1998). MSC: 68T05 68N99 PDFBibTeX XMLCite \textit{D. Carmel} and \textit{S. Markovitch}, J. Exp. Theor. Artif. Intell. 10, No. 3, 309--332 (1998; Zbl 1053.68591) Full Text: DOI
Vidal, Jose M.; Durfee, Edmund H. Learning nested agent models in an information economy. (English) Zbl 1053.68658 J. Exp. Theor. Artif. Intell. 10, No. 3, 291-308 (1998). MSC: 68T05 68U35 PDFBibTeX XMLCite \textit{J. M. Vidal} and \textit{E. H. Durfee}, J. Exp. Theor. Artif. Intell. 10, No. 3, 291--308 (1998; Zbl 1053.68658) Full Text: DOI arXiv