Evaluating Usage of Images for App Classification
Kushal Singla, Niloy Mukherjee, Hari Manassery Koduvely, Joy Bose

TL;DR
This paper evaluates various methods of using app images to improve app classification accuracy, especially when text data is insufficient, demonstrating that image-based techniques can significantly enhance classification results.
Contribution
The study systematically compares multiple image-based approaches for app classification, showing their effectiveness in supplementing text-based methods.
Findings
Image-based methods improved classification accuracy up to 96%.
Combining OCR and image vectorization enhances app categorization.
Object detection voting technique effectively classifies apps based on images.
Abstract
App classification is useful in a number of applications such as adding apps to an app store or building a user model based on the installed apps. Presently there are a number of existing methods to classify apps based on a given taxonomy on the basis of their text metadata. However, text based methods for app classification may not work in all cases, such as when the text descriptions are in a different language, or missing, or inadequate to classify the app. One solution in such cases is to utilize the app images to supplement the text description. In this paper, we evaluate a number of approaches in which app images can be used to classify the apps. In one approach, we use Optical character recognition (OCR) to extract text from images, which is then used to supplement the text description of the app. In another, we use pic2vec to convert the app images into vectors, then train an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSupport Vector Machine
