HOME-KGQA: A Benchmark Dataset for Multimodal Knowledge Graph Question Answering on Household Daily Activities

Shusaku Egami; Aoi Ohta; Tomoki Tsujimura; Masaki Asada; Tatsuya Ishigaki; Ken Fukuda; Masahiro Hamasaki; Hiroya Takamura

arXiv:2605.09348·cs.CL·May 12, 2026

HOME-KGQA: A Benchmark Dataset for Multimodal Knowledge Graph Question Answering on Household Daily Activities

Shusaku Egami, Aoi Ohta, Tomoki Tsujimura, Masaki Asada, Tatsuya Ishigaki, Ken Fukuda, Masahiro Hamasaki, Hiroya Takamura

PDF

1 Repo

TL;DR

HOME-KGQA introduces a challenging multimodal knowledge graph question answering benchmark focused on household activities, emphasizing multi-hop, spatiotemporal, and multimodal reasoning to advance real-world AI applications.

Contribution

It presents a novel, complex KGQA dataset with multi-level reasoning and multimodal grounding, addressing limitations of existing benchmarks for real-world scenarios.

Findings

01

LLM-based KGQA methods perform poorly on HOME-KGQA compared to existing datasets.

02

HOME-KGQA includes multi-hop, spatiotemporal, and multimodal questions.

03

The dataset exposes significant challenges for deploying KGQA in real-world environments.

Abstract

Large Language Models (LLMs) provide flexible natural language processing capabilities, while knowledge graphs (KGs) offer explicit and structured knowledge. Integrating these two in a complementary manner enables the development of reliable and verifiable AI systems. In particular, knowledge graph question answering (KGQA) has attracted attention as a means to reduce LLM hallucinations and to leverage knowledge beyond the training data. However, existing KGQA benchmark datasets are biased toward encyclopedic knowledge, limited to a single modality, and lack fine-grained spatiotemporal data, which limits their applicability to real-world scenarios targeted by Embodied AI. We introduce HOME-KGQA, a novel KGQA benchmark dataset built on a multimodal KG of daily household activities. HOME-KGQA consists of complex, multi-hop natural language questions paired with graph database query…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aistairc/home-kgqa
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.