Inferring and Executing Programs for Visual Reasoning (clevr-iep)

Existing methods for visual reasoning attempt to directly map inputs to outputs using black-box architectures without explicitly modeling the underlying reasoning processes. As a result, these black-box models often learn to exploit biases in the data rather than learning to perform visual reasoning.


This repository contains a Torch implementation for the ResNeXt algorithm for image classification. ResNeXt is a simple, highly modularized network architecture for image classification.