What if the world were different? Gradient-based exploration for new optimal policies R. Silva, F. S. Melo, M. Veloso In Proc. 4th Global Conf. Artificial Intelligence, pp. 229-242, 2018 (best paper award)