# Semantic classifier approach to document classification

**Authors:** Piotr Borkowski, Krzysztof Ciesielski, Mieczys{\l}aw A., K{\l}opotek

arXiv: 1701.04292 · 2017-01-17

## TL;DR

This paper introduces SEMCOM, a novel document classification method that effectively bridges the semantic gap between training and application data, outperforming traditional classifiers and ensembles.

## Contribution

The paper presents SEMCOM, a new approach combining categorization with classifiers to improve document classification accuracy across different datasets.

## Key findings

- SEMCOM outperforms classical classification methods.
- It effectively bridges the semantic gap in textual data.
- The approach enhances classification robustness across datasets.

## Abstract

In this paper we propose a new document classification method, bridging discrepancies (so-called semantic gap) between the training set and the application sets of textual data. We demonstrate its superiority over classical text classification approaches, including traditional classifier ensembles. The method consists in combining a document categorization technique with a single classifier or a classifier ensemble (SEMCOM algorithm - Committee with Semantic Categorizer).

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1701.04292/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1701.04292/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/1701.04292/full.md

---
Source: https://tomesphere.com/paper/1701.04292