GUI Element Detection Using SOTA YOLO Deep Learning Models

Seyed Shayan Daneshvar; Shaowei Wang

arXiv:2408.03507·cs.CV·August 8, 2024

GUI Element Detection Using SOTA YOLO Deep Learning Models

Seyed Shayan Daneshvar, Shaowei Wang

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the performance of the latest YOLO deep learning models specifically for detecting GUI elements, addressing domain-specific challenges like overlapping objects and limited classes.

Contribution

It provides a comparative analysis of recent YOLO models' effectiveness in GUI element detection, a domain-specific application of object detection.

Findings

01

YOLO models achieve high accuracy in GUI element detection

02

Performance varies depending on GUI element types

03

Deep learning models outperform traditional image processing methods

Abstract

Detection of Graphical User Interface (GUI) elements is a crucial task for automatic code generation from images and sketches, GUI testing, and GUI search. Recent studies have leveraged both old-fashioned and modern computer vision (CV) techniques. Oldfashioned methods utilize classic image processing algorithms (e.g. edge detection and contour detection) and modern methods use mature deep learning solutions for general object detection tasks. GUI element detection, however, is a domain-specific case of object detection, in which objects overlap more often, and are located very close to each other, plus the number of object classes is considerably lower, yet there are more objects in the images compared to natural images. Hence, the studies that have been carried out on comparing various object detection models, might not apply to GUI element detection. In this study, we evaluate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shayandaneshvar/gui-element-detection
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection

MethodsYou Only Look Once