A matter of words: NLP for quality evaluation of Wikipedia medical articles
Vittoria Cozza, Marinella Petrocchi, Angelo Spognardi

TL;DR
This paper enhances Wikipedia medical article quality evaluation by incorporating domain-specific features and NLP techniques, leading to improved classification accuracy and actionable insights for article improvement.
Contribution
It introduces a domain-oriented, NLP-based evaluation model that leverages biomedical vocabulary and article features to better assess and suggest improvements for medical Wikipedia articles.
Findings
Improved classification accuracy for medical articles.
Domain features outperform generic features in quality assessment.
Model provides actionable suggestions for article enhancement.
Abstract
Automatic quality evaluation of Web information is a task with many fields of applications and of great relevance, especially in critical domains like the medical one. We move from the intuition that the quality of content of medical Web documents is affected by features related with the specific domain. First, the usage of a specific vocabulary (Domain Informativeness); then, the adoption of specific codes (like those used in the infoboxes of Wikipedia articles) and the type of document (e.g., historical and technical ones). In this paper, we propose to leverage specific domain features to improve the results of the evaluation of Wikipedia medical articles. In particular, we evaluate the articles adopting an "actionable" model, whose features are related to the content of the articles, so that the model can also directly suggest strategies for improving a given article quality. We rely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWikis in Education and Collaboration · Natural Language Processing Techniques · Cancer-related gene regulation
