Can Small GenAI Language Models Rival Large Language Models in Understanding Application Behavior?

Mohammad Meymani; Hamed Jelodar; Parisa Hamedi; Roozbeh Razavi-Far; and Ali A. Ghorbani

arXiv:2511.12576·cs.SE·November 18, 2025

Can Small GenAI Language Models Rival Large Language Models in Understanding Application Behavior?

Mohammad Meymani, Hamed Jelodar, Parisa Hamedi, Roozbeh Razavi-Far, and Ali A. Ghorbani

PDF

Open Access

TL;DR

This paper evaluates small and large GenAI language models in understanding application behavior, especially malware detection, showing small models are competitive in precision and recall while being more resource-efficient.

Contribution

It systematically compares small and large GenAI models in application behavior analysis, highlighting the practical viability of small models for resource-constrained environments.

Findings

01

Small models maintain competitive precision and recall.

02

Large models achieve higher overall accuracy.

03

Small models offer advantages in computational efficiency.

Abstract

Generative AI (GenAI) models, particularly large language models (LLMs), have transformed multiple domains, including natural language processing, software analysis, and code understanding. Their ability to analyze and generate code has enabled applications such as source code summarization, behavior analysis, and malware detection. In this study, we systematically evaluate the capabilities of both small and large GenAI language models in understanding application behavior, with a particular focus on malware detection as a representative task. While larger models generally achieve higher overall accuracy, our experiments show that small GenAI models maintain competitive precision and recall, offering substantial advantages in computational efficiency, faster inference, and deployment in resource-constrained environments. We provide a detailed comparison across metrics such as accuracy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Adversarial Robustness in Machine Learning