The Role of Text-to-Image Models in Advanced Style Transfer Applications: A Case Study with DALL-E 3
Ebubechukwu Ike

TL;DR
This paper explores integrating DALL-E 3 with neural style transfer to improve artistic diversity and quality, demonstrating that it enhances output while maintaining competitive processing times.
Contribution
It introduces a novel combination of DALL-E 3 generated style images with neural style transfer, improving artistic quality and diversity in stylized images.
Findings
DALL-E 3 increases diversity and artistic quality of stylized images.
The integrated method reduces overall processing time by 2.5 seconds.
The approach balances improved visual results with acceptable processing time trade-offs.
Abstract
While DALL-E 3 has gained popularity for its ability to generate creative and complex images from textual descriptions, its application in the domain of style transfer remains slightly underexplored. This project investigates the integration of DALL-E 3 with traditional neural style transfer techniques to assess the impact of generated style images on the quality of the final output. DALL-E 3 was employed to generate style images based on the descriptions provided and combine these with the Magenta Arbitrary Image Stylization model. This integration is evaluated through metrics such as the Structural Similarity Index Measure (SSIM) and Peak Signal-to-Noise Ratio (PSNR), as well as processing time assessments. The findings reveal that DALL-E 3 significantly enhances the diversity and artistic quality of stylized images. Although this improvement comes with a slight increase in style…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
