PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual   Quality-Guided Distillation

Eleftherios Ioannou; Steve Maddock

arXiv:2502.16996·cs.CV·February 25, 2025

PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation

Eleftherios Ioannou, Steve Maddock

PDF

TL;DR

PQDAST introduces a depth-aware, real-time style transfer method for games that maintains high stylisation quality and temporal stability while significantly reducing memory and processing requirements.

Contribution

It presents the first depth-aware, arbitrary style transfer framework integrated into game pipelines using perceptual quality-guided distillation and synthetic training data.

Findings

01

Achieves superior temporal consistency in style transfer for games.

02

Reduces memory usage and processing time compared to existing methods.

03

Maintains comparable stylisation quality with state-of-the-art approaches.

Abstract

Artistic style transfer is concerned with the generation of imagery that combines the content of an image with the style of an artwork. In the realm of computer games, most work has focused on post-processing video frames. Some recent work has integrated style transfer into the game pipeline, but it is limited to single styles. Integrating an arbitrary style transfer method into the game pipeline is challenging due to the memory and speed requirements of games. We present PQDAST, the first solution to address this. We use a perceptual quality-guided knowledge distillation framework and train a compressed model using the FLIP evaluator, which substantially reduces both memory usage and processing time with limited impact on stylisation quality. For better preservation of depth and fine details, we utilise a synthetic dataset with depth and temporal considerations during training. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFLIP · Knowledge Distillation · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings