IndoLayout: Leveraging Attention for Extended Indoor Layout Estimation from an RGB Image
Shantanu Singh
Jaidev Shriram
Shaantanu Kulkarni
Brojeshwar Bhowmick
K. Madhava Krishna
IIIT Hyderabad
IIIT Hyderabad
IIIT Hyderabad
TCS Research
IIIT Hyderabad
Unlike typical layout estimation methods that only predict occupancy values for regions visible in the RGB image, IndoLayout predicts the occupancy of occluded areas (as shown) using learnt priors.

Abstract

We propose IndoLayout, a novel real-time approach for generating high-quality occupancy maps from an RGB image for indoor scenes. Such occupancy maps are often crucial for path-planning and mapping in indoor environments but are often built using only information contained in the ego view. In contrast, our approach also predicts occupancy values beyond immediately visible regions from just a monocular image, leveraging learnt priors from indoor scenes. Hence, our proposed network can produce a hallucinated, amodal scene layout that includes areas occluded in the RGB image, such as a navigable floor behind a desk. Specifically, we propose a novel architecture that uses self-attention and adversarial learning to vastly improve the quality of the predicted layout. We evaluate our model on several photorealistic indoor datasets and outperform previous relevant work on all metrics that measure layout quality, including newly adopted ones. Finally, we demonstrate the effectiveness of our method by showing significant improvements on the PointNav task over similar approaches using IndoLayout.


IROS 2022 Presentation


[Slides]

Proposed Approach

The goal of our method is, given an input RGB image, generate the corresponding top-view occupancy layout in metric scale. Given the nature of this task, we adopt the GAN framework, and the self-attention module to leverage the benefits of each in our proposed model architecture as shown in the figure above.

We provide access to projects for generating the datasets, as well as training and evaluating our models here: Github.


Paper and Supplementary Material

Shantanu Singh, Jaidev Shriram, Shaantanu Kulkarni, Brojeshwar Bhowmick, K. Madhava Krishna
IndoLayout: Amodal indoor layout estimation
In IROS, 2022.
(hosted on IEEE Explore)


[Bibtex]


This was a template originally made by Richard Zhang for Colorful Image Colorization. The code can be found in this repository.