I’m a research scientist at Facebook AI Research (FAIR), and an assistant professor in the School of Interactive Computing at Georgia Tech.
From 2013-2016, I was an assistant professor in the Bradley Department of Electrical and Computer Engineering at Virginia Tech, where I led the VT Machine Learning and Perception group and was a member of the Virginia Center for Autonomous Systems (VaCAS) and the VT Discovery Analytics Center (DAC). From 2010-2012, I was a research assistant professor at Toyota Technological Institute at Chicago (TTIC), a philanthropically endowed academic computer science institute located on the University of Chicago campus. I received my MS and PhD degrees from Carnegie Mellon University in 2007 and 2010 respectively, advised by Tsuhan Chen.
I am a recipient of the Office of Naval Research (ONR) Young Investigator Program (YIP) award, the National Science Foundation (NSF) CAREER award (2014), Army Research Office (ARO) Young Investigator Program (YIP) award (2014), Virginia Tech College of Engineering Outstanding New Assistant Professor award (2015), two Google Faculty Research Awards (2013, 2015), Amazon Academic Research award (2016), Carnegie Mellon Dean’s Fellowship (2007) and several teaching commendations at Virginia Tech. Research from my lab has been featured in Bloomberg Business, The Boston Globe, MIT Technology Review, Newsweek, WVTF Radio IQ and a number of popular press magazines and newspapers.
Interests
Machine learning, computer vision, and AI, with a focus on developing intelligent systems that are able to concisely summarize their beliefs about the world with diverse predictions, integrate information and beliefs across different sub-components or "modules" of AI (vision, language, reasoning) to extract a holistic view of the world, and explain why they believe what they believe
Latest Publications
IJCAI - January 5, 2021
IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL
Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam
EMNLP - November 30, 2020
Where Are You? Localization from Embodied Dialog
Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson
NeurIPS - November 30, 2020
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data
Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Dhruv Batra, Devi Parikh
ECCV - August 23, 2020
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das
ECCV - August 23, 2020
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Jacob Krantz, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee
ECCV - August 23, 2020
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra
ECCV - August 23, 2020
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh
ECCV - August 23, 2020
Spatially Aware Multimodal Transformers for TextVQA
Yash Kant, Dhruv Batra, Peter Anderson, Alexander Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal
ICLR - April 26, 2020
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames
Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra
NeurIPS - December 9, 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee
NeurIPS - December 8, 2019
Chasing Ghosts: Instruction Following as Bayesian State Tracking
Peter Anderson, Ayush Shrivastava, Devi Parikh, Dhruv Batra, Stefan Lee
EMNLP - November 3, 2019
Improving Generative Visual Dialog by Answering Diverse Questions
Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das
ICCV - October 31, 2019
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
Ramprasaath R. Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Shalini Ghosh, Larry Heck, Dhruv Batra, Devi Parikh
ICCV - October 27, 2019
Habitat: A Platform for Embodied AI Research
Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra
ICCV - October 27, 2019
Embodied Amodal Recognition: Learning to Move to Perceive Objects
Jianwei Yang, Zhile Ren, Mingze Xu, Xinlei Chen, David J. Crandall, Devi Parikh, Dhruv Batra
ICCV - October 27, 2019
NoCaps: Novel object captioning at scale
Harsh Agrawal, Karan Desai, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson
ACL - July 28, 2019
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication
Jin-Hwa Kim, Nikita Kitaev, Xinlei Chen, Marcus Rohrbach, Byoung-Tak Zhang, Yuandong Tian, Dhruv Batra, Devi Parikh
CVPR - June 18, 2019
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra
CVPR - June 16, 2019
Towards VQA Models That Can Read
Amanpreet Singh, Vivek Natarajan, Meet Shah, Yu Jiang, Xinlei Chen, Dhruv Batra, Devi Parikh, Marcus Rohrbach
ICML - June 11, 2019
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering
Ramakrishna Vedantam, Karan Desai, Stefan Lee, Marcus Rohrbach, Dhruv Batra, Devi Parikh
NAACL - June 5, 2019
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog
Satwik Kottur, José M.F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach
ICLR - May 10, 2019
Insights on Visual Representations for Embodied Navigation Tasks
Erik Wijmans, Julian Straub, Dhruv Batra, Judy Hoffman, Ari Morcos
ICLR - May 6, 2019
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future
Nan Rosemary Ke, Amanpreet Singh, Ahmed Touati, Anirudh Goyal, Yoshua Bengio, Devi Parikh, Dhruv Batra
CoRL 2018 - October 29, 2018
Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition
Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh
ECCV - September 14, 2018
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
Satwik Kottur, José M.F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach
ECCV 2018 - September 9, 2018
Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance
Ramprasaath R. Selvaraju, Prithvijit Chattopadhyay, Mohamed Elhoseiny, Tilak Sharma, Dhruv Batra, Devi Parikh, Stefan Lee
ECCV 2018 - September 9, 2018
Graph R-CNN for Scene Graph Generation
Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh
CoRL - August 1, 2018
Neural Modular Control for Embodied Question Answering
Abhishek Das, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra
CVPR 2018 - June 18, 2018
Embodied Question Answering
Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra
CVPR 2018 - June 18, 2018
Don’t Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering
Aishwarya Agrawal, Dhruv Batra, Devi Parikh, Aniruddha Kembhavi
NIPS 2017 - December 4, 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu, Anitha Kannan, Jianwei Yang, Devi Parikh, Dhruv Batra
HCOMP - October 24, 2017
Evaluating Visual Conversational Agents via Cooperative Human-AI Games
Prithvijit Chattopadhyay, Deshraj Yadav, Viraj Prabhu, Arjun Chandrasekaran, Abhishek Das, Stefan Lee, Dhruv Batra, Devi Parikh
ICCV 2017 - October 22, 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das, Satwik Kottur, José M.F. Moura, Stefan Lee, Dhruv Batra
ICCV 2017 - October 22, 2017
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra
EMNLP 2017 - September 9, 2017
ParlAI: A Dialog Research Software Platform
Alexander H. Miller, Will Feng, Adam Fisch, Jiasen Lu, Dhruv Batra, Antoine Bordes, Devi Parikh, Jason Weston
EMNLP - September 7, 2017
Natural Language Does Not Emerge ‘Naturally’ in Multi-Agent Dialog
Satwik Kottur, José M.F. Moura, Stefan Lee, Dhruv Batra
ICLR 2017 - April 24, 2017
LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation
Jianwei Yang, Anitha Kannan, Dhruv Batra, Devi Parikh
EMNLP 2017 - September 8, 2017
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Mike Lewis, Denis Yarats, Yann Dauphin, Devi Parikh, Dhruv Batra
Latest News

May 2, 2018
Embodied Question Answering: A goal-driven approach to autonomous agents
External Blog

Videos
All Videos
Improving Vision-and-Language Navigation with Web Image-Text Pairs
1:30 | August 24, 2020