DomainLynx: Leveraging Large Language Models for Enhanced Domain   Squatting Detection

Daiki Chiba; Hiroki Nakano; Takashi Koide

arXiv:2410.02095·cs.CR·February 17, 2025

DomainLynx: Leveraging Large Language Models for Enhanced Domain Squatting Detection

Daiki Chiba, Hiroki Nakano, Takashi Koide

PDF

Open Access

TL;DR

DomainLynx is an AI system that uses Large Language Models to detect novel and less prominent domain squatting techniques more accurately and efficiently than existing methods, enhancing Internet security.

Contribution

This paper introduces DomainLynx, a novel AI system leveraging LLMs for improved detection of sophisticated and emerging domain squatting tactics, especially for less prominent brands.

Findings

01

Achieved 94.7% accuracy on a curated dataset of squatting domains.

02

Detected 34,359 squatting domains in a month-long real-world test.

03

Outperformed baseline methods by 2.5 times in detection performance.

Abstract

Domain squatting poses a significant threat to Internet security, with attackers employing increasingly sophisticated techniques. This study introduces DomainLynx, an innovative compound AI system leveraging Large Language Models (LLMs) for enhanced domain squatting detection. Unlike existing methods focusing on predefined patterns for top-ranked domains, DomainLynx excels in identifying novel squatting techniques and protecting less prominent brands. The system's architecture integrates advanced data processing, intelligent domain pairing, and LLM-powered threat assessment. Crucially, DomainLynx incorporates specialized components that mitigate LLM hallucinations, ensuring reliable and context-aware detection. This approach enables efficient analysis of vast security data from diverse sources, including Certificate Transparency logs, Passive DNS records, and zone files. Evaluated on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling