Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2019
Authors: K. Young, B. Wang, M. E. Taylor
Learning a Deep ConvNet for Multi-label Classification with Partial Labels
Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Authors: T. Durand , N. Mehrasa, G. Mori
A Variational Auto-Encoder Model for Stochastic Point Process
Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Authors: N. Mehrasa, A. Jyothi, T. Durand , J. He , Prof. L. Sigal , G. Mori
Uniform Stability and High Order Approximation of SGLD in Non-Convex Learning
Workshop on Understanding and Improving Generalization in Deep Learning (ICML), 2019
Authors: *M. Gazeau, *M. Li
* Denotes equal contribution
A Survey and Critique of Multiagent Deep Reinforcement Learning
Journal of Autonomous Agents and Multiagent Systems (JAAMAS), 2019
Authors: P. Hernandez-Leal , B. Kartal, M. E. Taylor
Safer Deep RL with Shallow MCTS: A Case Study in Pommerman
Workshop on Adaptive Learning Agents (AAMAS), 2019
Authors: B. Kartal, P. Hernandez-Leal , C. Gao, M. E. Taylor
On the Sensitivity of Adversarial Robustness to Input Data Distributions
International Conference on Learning Representations (ICLR), 2019
Authors: G. Weiguang Ding , K. Lui , T. Jin, L. Wang, R. Huang
Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL
Workshop on Reinforcement Learning in Games (AAAI), 2019
Authors: B. Kartal, P. Hernandez-Leal , M. E. Taylor
Few-Shot Self Reminder to Overcome Catastrophic Forgetting
Workshop on Continual Learning (NeurIPS), 2018
Authors: J. Wen, Y. Cao , R. Huang
Skill Reuse in Partially Observable Multiagent Environments
Workshop on Latinx in AI Coalition (NeurIPS), 2018
Authors: P. Hernandez-Leal , B. Kartal, M. E. Taylor
On Learning Wire-Length Efficient Neural Networks
Workshop on Compact Deep Neural Network Representation (NeurIPS), 2018
Authors: L. Wang, G. Castiglione , C. Srinivasa , M. Brubaker