Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting
Purushothaman Natarajan, Kamal Basha, Athira Nambiar

TL;DR
Synth-SONAR introduces a novel sonar image synthesis framework using dual diffusion models and GPT prompting, significantly improving diversity and realism of synthetic sonar data for underwater applications.
Contribution
It is the first to integrate GPT prompting with sonar diffusion models, creating a large, diverse, and realistic sonar dataset from textual prompts.
Findings
Achieves state-of-the-art quality in synthetic sonar images
Enhances diversity and realism of generated sonar datasets
Bridges textual descriptions with sonar image synthesis
Abstract
Sonar image synthesis is crucial for advancing applications in underwater exploration, marine biology, and defence. Traditional methods often rely on extensive and costly data collection using sonar sensors, jeopardizing data quality and diversity. To overcome these limitations, this study proposes a new sonar image synthesis framework, Synth-SONAR leveraging diffusion models and GPT prompting. The key novelties of Synth-SONAR are threefold: First, by integrating Generative AI-based style injection techniques along with publicly available real/simulated data, thereby producing one of the largest sonar data corpus for sonar research. Second, a dual text-conditioning sonar diffusion model hierarchy synthesizes coarse and fine-grained sonar images with enhanced quality and diversity. Third, high-level (coarse) and low-level (detailed) text-based sonar generation methods leverage advanced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Image Retrieval and Classification Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Weight Decay · Cosine Annealing · Dropout · Layer Normalization · Byte Pair Encoding · Softmax · Multi-Head Attention · Dense Connections
