# Identification of taxon through classification with partial reject   options

**Authors:** M{\aa}ns Karlsson, Ola H\"ossjer

arXiv: 1906.04538 · 2021-09-17

## TL;DR

This paper introduces a Bayesian classification method for identifying taxa using mixed trait data, allowing for uncertainty and partial rejection, with applications demonstrated on ornithological datasets.

## Contribution

It presents a novel Bayesian discriminant analysis with partial reject options and outlier safeguarding, applicable to mixed continuous and ordinal traits.

## Key findings

- Effective classification with uncertainty handling
- Outlier detection using Bayesian p-value analogue
- Validated on original ornithological datasets

## Abstract

Identification of taxa can significantly be assisted by statistical classification based on trait measurements in two major ways; either individually or by phylogenetic (clustering) methods. In this paper we present a general Bayesian approach for classifying species individually based on measurements of a mixture of continuous and ordinal traits as well as any type of covariates. It is assumed that the trait vector is derived from a latent variable with a multivariate Gaussian distribution. Decision rules based on supervised learning are presented that estimate model parameters through blockwise Gibbs sampling. These decision regions allow for uncertainty (partial rejection), so that not necessarily one specific category (taxon) is output when new subjects are classified, but rather a set of categories including the most probable taxa. This type of discriminant analysis employs reward functions with a set-valued input argument, so that an optimal Bayes classifier can be defined. We also present a way of safeguarding against outlying new observations, using an analogue of a $p$-value within our Bayesian setting. Our method is illustrated on an original ornithological data set of birds. We also incorporate model selection through cross-validation, examplified on another original data set of birds.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.04538/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/1906.04538/full.md

## References

86 references — full list in the complete paper: https://tomesphere.com/paper/1906.04538/full.md

---
Source: https://tomesphere.com/paper/1906.04538