gaips_bea image1 image2 image3 image4 image5 gaips_ecute_beach_bar_banner gaips_ecute_train_incorrect_ticket_banner
A new convergent variant of Q-learning with linear function approximation


Year 2020
Keywords Reinforcement Learning;
Authors Diogo Carvalho, Francisco S. Melo, Pedro Santos
Booktitle NeurIPS 2020: Thirty-fourth Conference on Neural Information Processing Systems, International Conference on Machine Learning
Pages 21-55
Month December
Pdf File
BibTex bib icon or see it here down icon

@inproceedings { carvalho20, booktitle = {NeurIPS 2020: Thirty-fourth Conference on Neural Information Processing Systems, International Conference on Machine Learning}, keywords = {Reinforcement Learning;}, month = {December}, pages = {21-55}, title = {A new convergent variant of Q-learning with linear function approximation}, year = {2020}, author = {Diogo Carvalho and Francisco S. Melo and Pedro Santos} }

up icon hide this content