A Machine Learning Approach To Prevent Malicious Calls Over Telephony   Networks

Huichen Li; Xiaojun Xu; Chang Liu; Teng Ren; Kun Wu; Xuezhi Cao,; Weinan Zhang; Yong Yu; Dawn Song

arXiv:1804.02566·cs.CR·May 30, 2018

A Machine Learning Approach To Prevent Malicious Calls Over Telephony Networks

Huichen Li, Xiaojun Xu, Chang Liu, Teng Ren, Kun Wu, Xuezhi Cao,, Weinan Zhang, Yong Yu, Dawn Song

PDF

TL;DR

This paper introduces a machine learning-based system to detect malicious telephony calls, achieving high accuracy and efficiency without relying on network infrastructure assumptions, by leveraging user-tagged data and extensive call logs.

Contribution

It presents the first infrastructure-agnostic machine learning approach for malicious call detection, utilizing a large-scale user-tagged dataset and novel feature design.

Findings

01

Up to 90% reduction in unblocked malicious calls

02

Precision over 99.99% on benign traffic

03

Efficient implementation with minimal latency

Abstract

Malicious calls, i.e., telephony spams and scams, have been a long-standing challenging issue that causes billions of dollars of annual financial loss worldwide. This work presents the first machine learning-based solution without relying on any particular assumptions on the underlying telephony network infrastructures. The main challenge of this decade-long problem is that it is unclear how to construct effective features without the access to the telephony networks' infrastructures. We solve this problem by combining several innovations. We first develop a TouchPal user interface on top of a mobile App to allow users tagging malicious calls. This allows us to maintain a large-scale call log database. We then conduct a measurement study over three months of call logs, including 9 billion records. We design 29 features based on the results, so that machine learning algorithms can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.