# KlusTree: Clustering Answer Trees from Keyword Search on Graphs

**Authors:** Madhulika Mohanty, Maya Ramanath

arXiv: 1705.09808 · 2017-05-30

## TL;DR

KlusTree is a novel clustering method for answer trees from keyword graph queries that enhances user experience by providing better clustered results, revealing patterns, and aiding interpretation.

## Contribution

It introduces a language model-based clustering approach using JS divergence for answer trees, improving over existing isomorphism and tree-edit distance methods.

## Key findings

- KlusTree outperforms existing clustering methods in user evaluations.
- Enhanced clustering reveals interesting patterns in answer trees.
- Improved user experience and result interpretation.

## Abstract

Graph structured data on the web is now massive as well as diverse, ranging from social networks, web graphs to knowledge-bases. Effectively querying this graph structured data is non-trivial and has led to research in a variety of directions -- structured queries, keyword and natural language queries, automatic translation of these queries to structured queries, etc. We are concerned with a class of queries called relationship queries, which are usually expressed as a set of keywords (each keyword denoting a named entity). The results returned are a set of ranked trees, each of which denotes relationships among the various keywords. The result list could consist of hundreds of answers. The problem of keyword search on graphs has been explored for over a decade now, but an important aspect that is not as extensively studied is that of user experience. We propose KlusTree, which presents clustered results to the users instead of a list of all the results. In our approach, the result trees are represented using language models and are clustered using JS divergence as a distance measure. We compare KlusTree with the well-known approaches based on isomorphism and tree-edit distance based clustering. The user evaluations show that KlusTree outperforms the other two in providing better clustering, thereby enriching user experience, revealing interesting patterns and improving result interpretation by the user.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.09808/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1705.09808/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1705.09808/full.md

---
Source: https://tomesphere.com/paper/1705.09808