TL;DR
This paper introduces ReproDroid, a framework for fair comparison of Android taint analysis tools, and evaluates six prominent tools on a standardized benchmark to assess their accuracy and feature claims.
Contribution
ReproDroid enables precise, unbiased evaluation of Android taint analysis tools and provides an improved DroidBench test suite for benchmarking.
Findings
Four tools violate some promises concerning features and accuracy
ReproDroid supports automatic application of tools to benchmarks and ground truth inference
Evaluation on equal grounds reveals varying tool performance
Abstract
In recent years, researchers have developed a number of tools to conduct taint analysis of Android applications. While all the respective papers aim at providing a thorough empirical evaluation, comparability is hindered by varying or unclear evaluation targets. Sometimes, the apps used for evaluation are not precisely described. In other cases, authors use an established benchmark but cover it only partially. In yet other cases, the evaluations differ in terms of the data leaks searched for, or lack a ground truth to compare against. All those limitations make it impossible to truly compare the tools based on those published evaluations. We thus present ReproDroid, a framework allowing the accurate comparison of Android taint analysis tools. ReproDroid supports researchers in inferring the ground truth for data leaks in apps, in automatically applying tools to benchmarks, and in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
