3D Multi-View Stylization with Pose-Free Correspondences Matching for Robust 3D Geometry Preservation
Shirsha Bose

TL;DR
This paper presents a multi-view 3D stylization method that preserves geometry and structure without relying on camera poses, using correspondence and depth regularization to improve downstream 3D tasks.
Contribution
It introduces a novel style transfer approach with a correspondence-based consistency loss and depth preservation, enabling robust multi-view stylization for 3D applications.
Findings
Correspondence and depth regularization improve structural consistency.
The method enhances SLAM stability and geometry reconstruction.
It maintains competitive stylization quality across diverse scenes.
Abstract
Artistic style transfer is well studied for images and videos, but extending it to multi-view 3D scenes remains difficult because stylization can disrupt correspondences needed by geometry-aware pipelines. Independent per-view stylization often causes texture drift, warped edges, and inconsistent shading, degrading SLAM, depth prediction, and multi-view reconstruction. This thesis addresses multi-view stylization that remains usable for downstream 3D tasks without assuming camera poses or an explicit 3D representation during training. We introduce a feed-forward stylization network trained with per-scene test-time optimization under a composite objective coupling appearance transfer with geometry preservation. Stylization is driven by an AdaIN-inspired loss from a frozen VGG-19 encoder, matching channel-wise moments to a style image. To stabilize structure across viewpoints, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
