https://hal-auf.archives-ouvertes.fr/hal-01358345 Contributor : François LévyConnect in order to contact the contributor Submitted on : Wednesday, August 31, 2016 - 3:19:17 PM Last modification on : Thursday, July 9, 2020 - 3:04:42 AM
Pegah Alizadeh, Yann Chevaleyre, François Lévy. Advantage Based Value Iteration for Markov Decision Processes with Unknown Rewards. International Joint Conference on Neural Networks (IJCNN 2016), Jul 2016, Vancouver, Canada. ⟨hal-01358345⟩