MCTS with Refinement for Proposals Selection Games in Scene Understanding
Sinisa Stekovic, Mahdi Rad, Alireza Moradi, Friedrich Fraundorfer, and, Vincent Lepetit

TL;DR
This paper introduces a novel MCTS-based method for scene understanding that jointly selects and refines proposals, demonstrated on floor plan reconstruction and 3D room layout estimation, achieving state-of-the-art results.
Contribution
It adapts Monte Carlo Tree Search for proposal selection and refinement in scene understanding, including a differentiable renderer for polygonal shapes, applicable to 2D and 3D layout reconstruction.
Findings
Significant improvements over state-of-the-art on Structured3D and Floor-SP datasets.
Effective joint proposal selection and optimization without hard constraints.
High accuracy in 3D room layout reconstruction from images.
Abstract
We propose a novel method applicable in many scene understanding problems that adapts the Monte Carlo Tree Search (MCTS) algorithm, originally designed to learn to play games of high-state complexity. From a generated pool of proposals, our method jointly selects and optimizes proposals that minimize the objective term. In our first application for floor plan reconstruction from point clouds, our method selects and refines the room proposals, modelled as 2D polygons, by optimizing on an objective function combining the fitness as predicted by a deep network and regularizing terms on the room shapes. We also introduce a novel differentiable method for rendering the polygonal shapes of these proposals. Our evaluations on the recent and challenging Structured3D and Floor-SP datasets show significant improvements over the state-of-the-art, without imposing hard constraints nor assumptions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications
