Konda, Vijay R.; Tsitsiklis, John N. Convergence rate of linear two-time-scale stochastic approximation. (English) Zbl 1094.62103 Ann. Appl. Probab. 14, No. 2, 796-819 (2004). MSC: 62L20 60F05 PDFBibTeX XMLCite \textit{V. R. Konda} and \textit{J. N. Tsitsiklis}, Ann. Appl. Probab. 14, No. 2, 796--819 (2004; Zbl 1094.62103) Full Text: DOI arXiv
Konda, Vijay R.; Tsitsiklis, John N. Linear stochastic approximation driven by slowly varying Markov chains. (English) Zbl 1157.93533 Syst. Control Lett. 50, No. 2, 95-102 (2003). MSC: 93E12 62L20 PDFBibTeX XMLCite \textit{V. R. Konda} and \textit{J. N. Tsitsiklis}, Syst. Control Lett. 50, No. 2, 95--102 (2003; Zbl 1157.93533) Full Text: DOI
Konda, Vijay R.; Tsitsiklis, John N. On actor-critic algorithms. (English) Zbl 1049.93095 SIAM J. Control Optimization 42, No. 4, 1143-1166 (2003). MSC: 93E35 68T05 PDFBibTeX XMLCite \textit{V. R. Konda} and \textit{J. N. Tsitsiklis}, SIAM J. Control Optim. 42, No. 4, 1143--1166 (2003; Zbl 1049.93095) Full Text: DOI