WebGeoInfer: A Structure-Free and Multi-Stage Framework for Geolocation Inference of Devices Exposing Information
Huipeng Yang, Li Yang, Lichuan Ma, Lu Zhou, Junbo Jia, Anyuan Sang, Xinyue Wang

TL;DR
WebGeoInfer is a novel, structure-free multi-stage framework that accurately infers device locations from unstructured web data, enhancing cybersecurity by identifying exposed geographical information.
Contribution
It introduces a multi-stage, structure-free approach using clustering, search engine enhancement, and language models to extract geographical info from unstructured device web pages.
Findings
Inferred locations for 5,435 devices across 94 countries and 2,056 cities.
Achieved 96.96% accuracy at country level, 88.05% at city level, and 79.70% at street level.
Effectively bypassed structural limitations in unstructured web data.
Abstract
Remote management devices facilitate critical infrastructure monitoring for administrators but simultaneously increase asset exposure. Sensitive geographical information overlooked in exposed device management pages poses substantial security risks. Therefore, identifying devices that reveal location information due to administrator negligence is crucial for cybersecurity regulation. Despite the rich information exposed by web interfaces of remote management devices, automatically discovering geographical locations remains challenging due to unstructured formats, varying styles, and incomplete geographical details. This study introduces WebGeoInfer, a structure-free geolocation inference framework utilizing multi-stage information enhancement. WebGeoInfer clusters similar device web pages and analyzes inter-cluster differences to extract potential geographical information, bypassing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSeismology and Earthquake Studies
