From Release to Adoption: Challenges in Reusing Pre-trained AI Models for Downstream Developers
Peerachai Banyongrakkul, Mansooreh Zahedi, Patanamon Thongtanunam, Christoph Treude, Haoyu Gao

TL;DR
This study analyzes 840 GitHub issue reports to identify key challenges faced by developers reusing pre-trained models, revealing seven main challenge categories and longer resolution times for PTM issues.
Contribution
It systematically categorizes PTM-related challenges faced by downstream developers and compares resolution times, providing insights for improving PTM reuse practices.
Findings
Seven key challenge categories identified, including model usage and output quality.
PTM-related issues take significantly longer to resolve than other issues.
Variation in resolution times across different challenge categories.
Abstract
Pre-trained models (PTMs) have gained widespread popularity and achieved remarkable success across various fields, driven by their groundbreaking performance and easy accessibility through hosting providers. However, the challenges faced by downstream developers in reusing PTMs in software systems are less explored. To bridge this knowledge gap, we qualitatively created and analyzed a dataset of 840 PTM-related issue reports from 31 OSS GitHub projects. We systematically developed a comprehensive taxonomy of PTM-related challenges that developers face in downstream projects. Our study identifies seven key categories of challenges that downstream developers face in reusing PTMs, such as model usage, model performance, and output quality. We also compared our findings with existing taxonomies. Additionally, we conducted a resolution time analysis and, based on statistical tests, found…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
