Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents
Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Authors: F. L. Da Silva, P. Hernandez-Leal , B. Kartal , M. E. Taylor
Diachronic Embedding for Temporal Knowledge Graph Completion
Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Authors: R. Goel, S. M. Kazemi , M. A. Brubaker , P. Poupart
A Survey and Critique of Multiagent Deep Reinforcement Learning
Journal of Autonomous Agents and Multiagent Systems (JAAMAS), 2019
Towers of Saliency: A Reinforcement Learning Visualization Using Immersive Environments
ACM Interactive Surfaces and Spaces (ISS), 2019
Authors: N. Douglas, D. Yim, B. Kartal , P. Hernandez-Leal , M. E. Taylor , F. Maurer
Maximum Entropy Monte-Carlo Planning
Neural Information Processing Systems (NeurIPS), 2019
Authors: C. Xiao, R. Huang , J. Mei, D. Schuurmans, M. Müller
Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces
Neural Information Processing Systems (NeurIPS), 2019
Authors: B. Wang, N. Hegde
Noise Flow: Noise Modeling with Conditional Normalizing Flows
International Conference on Computer Vision (ICCV), 2019
Authors: A. Abdelhamed, M. A. Brubaker , M. S. Brown
Similarity-Preserving Knowledge Distillation
International Conference on Computer Vision (ICCV), 2019
Authors: F. Tung , G. Mori
Lifelong GAN: Continual Learning for Conditional Image Generation
International Conference on Computer Vision (ICCV), 2019
Authors: *M. Zhai, *L. Chen, F. Tung , J. He , M. Nawhal, G. Mori
* Denotes equal contribution
On Hard Exploration for Reinforcement Learning: a Case Study in Pommerman
AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE), 2019