Better Long-Range Dependency by Bootstrapping A Mutual Information Regularizer
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Authors: Y. Cao , P. Xu
Multi Type Mean Field Reinforcement Learning
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2020
Authors: S. Subramanian, P. Poupart , M. E. Taylor , N. Hegde
Adapting Grad-CAM for Embedding Networks
IEEE Winter Conference on Applications of Computer Vision (WACV), 2020
Authors: L. Chen, J. Chen , H. Hajimirsadeghi , G. Mori
Diachronic Embedding for Temporal Knowledge Graph Completion
Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Authors: R. Goel, S. M. Kazemi , M. A. Brubaker , P. Poupart
Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents
Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Authors: F. L. Da Silva, P. Hernandez-Leal , B. Kartal , M. E. Taylor
Evaluating Lossy Compression Rates of Deep Generative Models
Workshop on Bayesian Deep Learning (NeurIPS), 2019
Authors: S. Huang, A. Makhzani, Y. Cao , R. Grosse
Maximum Entropy Monte-Carlo Planning
Neural Information Processing Systems (NeurIPS), 2019
Authors: C. Xiao, R. Huang , J. Mei, D. Schuurmans, M. Müller
Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces
Neural Information Processing Systems (NeurIPS), 2019
Authors: B. Wang, N. Hegde
Towers of Saliency: A Reinforcement Learning Visualization Using Immersive Environments
ACM Interactive Surfaces and Spaces (ISS), 2019
Authors: N. Douglas, D. Yim, B. Kartal , P. Hernandez-Leal , M. E. Taylor , F. Maurer
Similarity-Preserving Knowledge Distillation
International Conference on Computer Vision (ICCV), 2019
Authors: F. Tung , G. Mori