Scene Depth Estimation from Traditional Oriental Landscape Paintings
Sungho Kang, YeongHyeon Park, Hyunkyu Park, Juneho Yi

TL;DR
This paper introduces a novel two-step framework combining image translation and matching to estimate scene depth from traditional oriental landscape paintings, aiding visually impaired appreciation.
Contribution
It presents the first method to estimate depth in oriental landscape paintings using a combination of CycleGAN, CLIP, and depth models, bridging art and 3D understanding.
Findings
Effective depth prediction for oriental paintings
First approach to depth estimation in this art style
Potential aid for visually impaired viewers
Abstract
Scene depth estimation from paintings can streamline the process of 3D sculpture creation so that visually impaired people appreciate the paintings with tactile sense. However, measuring depth of oriental landscape painting images is extremely challenging due to its unique method of depicting depth and poor preservation. To address the problem of scene depth estimation from oriental landscape painting images, we propose a novel framework that consists of two-step Image-to-Image translation method with CLIP-based image matching at the front end to predict the real scene image that best matches with the given oriental landscape painting image. Then, we employ a pre-trained SOTA depth estimation model for the generated real scene image. In the first step, CycleGAN converts an oriental landscape painting image into a pseudo-real scene image. We utilize CLIP to semantically match landscape…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
MethodsResidual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · Instance Normalization · PatchGAN · Batch Normalization · Residual Block · Sigmoid Activation · Cycle Consistency Loss
