Popa, Veronica <1992>
(Università Ca' Foscari Venezia, 2019-03-20)
In this thesis I consider a Reinforcement Learning (RL) approach for policy evaluation, in particular the Q-Learning algorithm (QLa). The QLa is able to dynamically optimize, in real time, its behaviour on the basis of the ...