# A model integrating attention mechanism and generative adversarial network for image style transfer

**Authors:** Miaomiao Fu, Yixing Liu, Rongrong Ma, Binbin Zhang, Linli Wu, Lingli Zhu

PMC · DOI: 10.7717/peerj-cs.2332 · PeerJ Computer Science · 2024-09-20

## TL;DR

This paper introduces a new image style transfer model that uses attention mechanisms and a cycle-consistent network to better handle long-distance pixel dependencies.

## Contribution

The novel contribution is integrating attention mechanisms with a cycle-consistent GAN to improve style transfer performance.

## Key findings

- The model effectively handles long-distance dependencies between image content and style.
- Experiments on the monet2photo dataset showed a 45% misjudgment rate in AMT perceptual studies.
- The attention mechanism suppresses non-target style features while enhancing relevant ones.

## Abstract

Image style transfer is an important way to combine different styles and contents to generate new images, which plays an important role in computer vision tasks such as image reconstruction and image texture synthesis. In style transfer tasks, there are often long-distance dependencies between pixels of different styles and contents, and existing neural network-based work cannot handle this problem well. This paper constructs a generation model for style transfer based on the cycle-consistent network and the attention mechanism. The forward and backward learning process of the cycle-consistent mechanism could make the network complete the mismatch conversion between the input and output of the image. The attention mechanism enhances the model’s ability to perceive the long-distance dependencies between pixels in process of learning feature representation from the target content and the target styles, and at the same time suppresses the style feature information of the non-target area. Finally, a large number of experiments were carried out in the monet2photo dataset, and the results show that the misjudgment rate of Amazon Mechanical Turk (AMT) perceptual studies achieves 45%, which verified that the cycle-consistent network model with attention mechanism has certain advantages in image style transfer.

## Full-text entities

- **Genes:** ASPM (assembly factor for spindle microtubules) [NCBI Gene 259266] {aka ASP, Calmbp1, MCPH5}
- **Diseases:** stroke (MESH:D020521), CIN (MESH:D020763)
- **Chemicals:** CycleGAN (-), water (MESH:D014867), oil (MESH:D009821)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11419672/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11419672/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC11419672/full.md

---
Source: https://tomesphere.com/paper/PMC11419672