Mining Points of Interest via Address Embeddings: An Unsupervised Approach
Abhinav Ganesan, Anubhav Gupta, and Jose Mathew

TL;DR
This paper introduces an unsupervised method to extract polygon representations of points of interest from address data and embeddings, significantly improving PoI detection and accuracy using deep learning and map data integration.
Contribution
The work presents a novel unsupervised approach combining address embeddings and clustering to identify PoIs, enhanced by post-processing with OpenStreetMap data for better recall.
Findings
Identified 74.8% more PoIs than baseline algorithms.
Achieved median area recall of 70% after post-processing.
Improved F-score to 0.69 with map data integration.
Abstract
Digital maps are commonly used across the globe for exploring places that users are interested in, commonly referred to as points of interest (PoI). In online food delivery platforms, PoIs could represent any major private compounds where customers could order from such as hospitals, residential complexes, office complexes, educational institutes and hostels. In this work, we propose an end-to-end unsupervised system design for obtaining polygon representations of PoIs (PoI polygons) from address locations and address texts. We preprocess the address texts using locality names and generate embeddings for the address texts using a deep learning-based architecture, viz. RoBERTa, trained on our internal address dataset. The PoI candidates are identified by jointly clustering the anonymised customer phone GPS locations (obtained during address onboarding) and the embeddings of the address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Geographic Information Systems Studies · Web Data Mining and Analysis
MethodsAttention Is All You Need · Linear Layer · Attention Dropout · Multi-Head Attention · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Weight Decay · Residual Connection · Adam
