Evaluating Link-Based Techniques for Detecting Fake Pharmacy Websites
Ahmed Abbasi, Siddharth Kaza, F. Mariam Zahedi

TL;DR
This paper evaluates link-based techniques for detecting fake pharmacy websites, demonstrating that class propagation algorithms utilizing inlink and outlink information at the site level achieve over 90% accuracy on a large-scale dataset.
Contribution
It compares multiple link-based detection algorithms on a large dataset, highlighting the effectiveness of dual class propagation methods at the site level.
Findings
QoC and QoL algorithms achieved over 90% accuracy.
Site-level analysis outperforms page-level analysis.
Algorithms using inlink and outlink information are more effective.
Abstract
Fake online pharmacies have become increasingly pervasive, constituting over 90% of online pharmacy websites. There is a need for fake website detection techniques capable of identifying fake online pharmacy websites with a high degree of accuracy. In this study, we compared several well-known link-based detection techniques on a large-scale test bed with the hyperlink graph encompassing over 80 million links between 15.5 million web pages, including 1.2 million known legitimate and fake pharmacy pages. We found that the QoC and QoL class propagation algorithms achieved an accuracy of over 90% on our dataset. The results revealed that algorithms that incorporate dual class propagation as well as inlink and outlink information, on page-level or site-level graphs, are better suited for detecting fake pharmacy websites. In addition, site-level analysis yielded significantly better results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Text Analysis Techniques
