HAMLET: Healthcare-focused Adaptive Multilingual Learning Embedding-based Topic Modeling
Hajar Sakai, Sarah S. Lam

TL;DR
HAMLET is a novel graph-driven, multilingual healthcare topic modeling approach that refines LLM-generated topics using neural-enhanced semantic fusion, GNNs, and cross-lingual embeddings to improve coherence and interpretability.
Contribution
This paper introduces HAMLET, a new architecture combining LLMs, GNNs, and semantic fusion for improved cross-lingual healthcare topic modeling.
Findings
HAMLET outperforms baseline models on healthcare datasets.
Effective cross-lingual topic coherence demonstrated in English and French datasets.
Refined embeddings lead to more interpretable and coherent topics.
Abstract
Traditional topic models often struggle with contextual nuances and fail to adequately handle polysemy and rare words. This limitation typically results in topics that lack coherence and quality. Large Language Models (LLMs) can mitigate this issue by generating an initial set of topics. However, these raw topics frequently lack refinement and representativeness, which leads to redundancy without lexical similarity and reduced interpretability. This paper introduces HAMLET, a graph-driven architecture for cross-lingual healthcare topic modeling that uses LLMs. The proposed approach leverages neural-enhanced semantic fusion to refine the embeddings of topics generated by the LLM. Instead of relying solely on statistical co-occurrence or human interpretation to extract topics from a document corpus, this method introduces a topic embedding refinement that uses Bidirectional Encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Computational and Text Analysis Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Layer Normalization · Attention Dropout · Softmax · Residual Connection · WordPiece · Linear Layer
