# Identifying Unclear Questions in Community Question Answering Websites

**Authors:** Jan Trienes, Krisztian Balog

arXiv: 1901.06168 · 2019-05-21

## TL;DR

This paper introduces a novel approach to classify questions on community Q&A sites as clear or unclear using similar questions, aiding in better question formulation and improving user experience.

## Contribution

It presents the first dataset and classification method based on similar questions for identifying unclear questions in community Q&A platforms.

## Key findings

- Similar questions approach outperforms baseline classifiers
- Supports development of user interfaces for clearer question formulation
- Provides a new dataset for question clarity classification

## Abstract

Thousands of complex natural language questions are submitted to community question answering websites on a daily basis, rendering them as one of the most important information sources these days. However, oftentimes submitted questions are unclear and cannot be answered without further clarification questions by expert community members. This study is the first to investigate the complex task of classifying a question as clear or unclear, i.e., if it requires further clarification. We construct a novel dataset and propose a classification approach that is based on the notion of similar questions. This approach is compared to state-of-the-art text classification baselines. Our main finding is that the similar questions approach is a viable alternative that can be used as a stepping stone towards the development of supportive user interfaces for question formulation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.06168/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1901.06168/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/1901.06168/full.md

---
Source: https://tomesphere.com/paper/1901.06168