Hachiya, Hirotaka; Peters, Jan; Sugiyama, Masashi Reward-weighted regression with sample reuse for direct policy search in reinforcement learning. (English) Zbl 1237.68147 Neural Comput. 23, No. 11, 2798-2832 (2011). MSC: 68T05 PDFBibTeX XMLCite \textit{H. Hachiya} et al., Neural Comput. 23, No. 11, 2798--2832 (2011; Zbl 1237.68147) Full Text: DOI
Chen, Hong; Li, Luoqing; Peng, Jiangtao Semi-supervised learning based on high density region estimation. (English) Zbl 1396.68089 Neural Netw. 23, No. 7, 812-818 (2010). MSC: 68T05 PDFBibTeX XMLCite \textit{H. Chen} et al., Neural Netw. 23, No. 7, 812--818 (2010; Zbl 1396.68089) Full Text: DOI
Yamada, Makoto; Sugiyama, Masashi; Matsui, Tomoko Semi-supervised speaker identification under covariate shift. (English) Zbl 1194.94154 Signal Process. 90, No. 8, 2353-2361 (2010). MSC: 94A12 PDFBibTeX XMLCite \textit{M. Yamada} et al., Signal Process. 90, No. 8, 2353--2361 (2010; Zbl 1194.94154) Full Text: DOI
Hachiya, Hirotaka; Akiyama, Takayuki; Sugiayma, Masashi; Peters, Jan Adaptive importance sampling for value function approximation in off-policy reinforcement learning. (English) Zbl 1396.68091 Neural Netw. 22, No. 10, 1399-1410 (2009). MSC: 68T05 PDFBibTeX XMLCite \textit{H. Hachiya} et al., Neural Netw. 22, No. 10, 1399--1410 (2009; Zbl 1396.68091) Full Text: DOI
Sugiyama, Masashi; Suzuki, Taiji; Nakajima, Shinichi; Kashima, Hisashi; von Bünau, Paul; Kawanabe, Motoaki Direct importance estimation for covariate shift adaptation. (English) Zbl 1294.62069 Ann. Inst. Stat. Math. 60, No. 4, 699-746 (2008). MSC: 62G05 62G08 PDFBibTeX XMLCite \textit{M. Sugiyama} et al., Ann. Inst. Stat. Math. 60, No. 4, 699--746 (2008; Zbl 1294.62069) Full Text: DOI