Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems

Published in American Control Conference, 2018

Recommended citation: E. Tolstaya, A. Koppel, E. Stump, and A. Ribeiro, ”Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems,”, American Control Conference, June 27-29, 2018. http://katetolstaya.github.io/files/c_2018_tolstaya_etal_a.pdf