A Machine Learning Approach To Prevent Malicious Calls Over Telephony Networks
Huichen Li, Xiaojun Xu, Chang Liu, Teng Ren, Kun Wu, Xuezhi Cao,, Weinan Zhang, Yong Yu, Dawn Song

TL;DR
This paper introduces a machine learning-based system to detect malicious telephony calls, achieving high accuracy and efficiency without relying on network infrastructure assumptions, by leveraging user-tagged data and extensive call logs.
Contribution
It presents the first infrastructure-agnostic machine learning approach for malicious call detection, utilizing a large-scale user-tagged dataset and novel feature design.
Findings
Up to 90% reduction in unblocked malicious calls
Precision over 99.99% on benign traffic
Efficient implementation with minimal latency
Abstract
Malicious calls, i.e., telephony spams and scams, have been a long-standing challenging issue that causes billions of dollars of annual financial loss worldwide. This work presents the first machine learning-based solution without relying on any particular assumptions on the underlying telephony network infrastructures. The main challenge of this decade-long problem is that it is unclear how to construct effective features without the access to the telephony networks' infrastructures. We solve this problem by combining several innovations. We first develop a TouchPal user interface on top of a mobile App to allow users tagging malicious calls. This allows us to maintain a large-scale call log database. We then conduct a measurement study over three months of call logs, including 9 billion records. We design 29 features based on the results, so that machine learning algorithms can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
