TagRec: Automated Tagging of Questions with Hierarchical Learning Taxonomy
Venktesh V, Mukesh Mohania, Vikram Goyal

TL;DR
This paper introduces TagRec, a semantic similarity-based approach using transformer models for automatic hierarchical tagging of questions, effectively handling unseen labels and improving over traditional classification methods.
Contribution
The paper presents a novel similarity-based retrieval method that leverages question-answer pairs and transformer embeddings to improve hierarchical question tagging.
Findings
Outperforms state-of-the-art methods by 6% in Recall@k.
Handles unseen labels without re-training.
Utilizes question-answer augmentation for richer semantic representation.
Abstract
Online educational platforms organize academic questions based on a hierarchical learning taxonomy (subject-chapter-topic). Automatically tagging new questions with existing taxonomy will help organize these questions into different classes of hierarchical taxonomy so that they can be searched based on the facets like chapter. This task can be formulated as a flat multi-class classification problem. Usually, flat classification based methods ignore the semantic relatedness between the terms in the hierarchical taxonomy and the questions. Some traditional methods also suffer from the class imbalance issues as they consider only the leaf nodes ignoring the hierarchy. Hence, we formulate the problem as a similarity-based retrieval task where we optimize the semantic relatedness between the taxonomy and the questions. We demonstrate that our method helps to handle the unseen labels and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Topic Modeling · Advanced Text Analysis Techniques
