Retrieval-Augmented Fine-Tuning With Preference Optimization For Visual Program Generation
Deokhyung Kang, Jeonghun Cho, Yejin Jeon, Sunbin Jang, Minsub Lee, Jawoon Cho, Gary Geunbae Lee

TL;DR
This paper introduces a two-stage training approach combining retrieval-augmented fine-tuning and preference optimization to improve the accuracy of visual program generation in industrial automation, specifically for Ladder Diagrams.
Contribution
It presents a novel training strategy that outperforms prompting-based methods and enhances industrial VPL code generation accuracy using retrieval and preference techniques.
Findings
Over 10% improvement in program accuracy over supervised fine-tuning
Retrieval-augmented training effectively leverages subroutine reuse
Preference optimization guides models toward more accurate outputs
Abstract
Visual programming languages (VPLs) allow users to create programs through graphical interfaces, which results in easier accessibility and their widespread usage in various domains. To further enhance this accessibility, recent research has focused on generating VPL code from user instructions using large language models (LLMs). Specifically, by employing prompting-based methods, these studies have shown promising results. Nevertheless, such approaches can be less effective for industrial VPLs such as Ladder Diagram (LD). LD is a pivotal language used in industrial automation processes and involves extensive domain-specific configurations, which are difficult to capture in a single prompt. In this work, we demonstrate that training-based methods outperform prompting-based methods for LD generation accuracy, even with smaller backbone models. Building on these findings, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Teaching and Learning Programming
