HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent
Weijie Xu, Zicheng Huang, Wenxiang Hu, Xi Fang, Rajesh Kumar, Cherukuri, Naumaan Nayyar, Lorenzo Malandri, Srinivasan H. Sengamedu

TL;DR
This paper introduces HR-MultiWOZ, a novel, fully-labeled dataset of 550 HR-related conversations designed to facilitate NLP research and development of LLM agents in the HR domain, addressing privacy and data scarcity issues.
Contribution
It presents the first open-source HR conversation dataset, a detailed, adaptable data generation pipeline using LLMs with minimal human effort, and comprehensive data analysis and evaluation.
Findings
First HR domain-specific conversation dataset
Data generation pipeline is efficient and adaptable
Human evaluations confirm data quality
Abstract
Recent advancements in Large Language Models (LLMs) have been reshaping Natural Language Processing (NLP) task in several domains. Their use in the field of Human Resources (HR) has still room for expansions and could be beneficial for several time consuming tasks. Examples such as time-off submissions, medical claims filing, and access requests are noteworthy, but they are by no means the sole instances. However, the aforementioned developments must grapple with the pivotal challenge of constructing a high-quality training dataset. On one hand, most conversation datasets are solving problems for customers not employees. On the other hand, gathering conversations with HR could raise privacy concerns. To solve it, we introduce HR-Multiwoz, a fully-labeled dataset of 550 conversations spanning 10 HR domains to evaluate LLM Agent. Our work has the following contributions: (1) It is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI and HR Technologies · Topic Modeling
