# Visualizing and Describing Fine-grained Categories as Textures

**Authors:** Tsung-Yu Lin, Mikayla Timm, Chenyun Wu, Subhransu Maji

arXiv: 1907.05288 · 2019-07-12

## TL;DR

This paper explores how fine-grained visual categories can be characterized by their textures through visualization and automatic description, enhancing understanding of subtle differences in species classification.

## Contribution

It introduces a method to visualize and describe categories in FGVC using texture-based deep networks and a new dataset for texture captioning.

## Key findings

- Texture-based models highlight discriminative features.
- Automatic texture descriptions provide language explanations.
- Visualizations improve interpretability of fine-grained categories.

## Abstract

We analyze how categories from recent FGVC challenges can be described by their textural content. The motivation is that subtle differences between species of birds or butterflies can often be described in terms of the texture associated with them and that several top-performing networks are inspired by texture-based representations. These representations are characterized by orderless pooling of second-order filter activations such as in bilinear CNNs and the winner of the iNaturalist 2018 challenge. Concretely, for each category we (i) visualize the "maximal images" by obtaining inputs x that maximize the probability of the particular class according to a texture-based deep network, and (ii) automatically describe the maximal images using a set of texture attributes. The models for texture captioning were trained on our ongoing efforts on collecting a dataset of describable textures building on the DTD dataset. These visualizations indicate what aspects of the texture is most discriminative for each category while the descriptions provide a language-based explanation of the same.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.05288/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1907.05288/full.md

## References

14 references — full list in the complete paper: https://tomesphere.com/paper/1907.05288/full.md

---
Source: https://tomesphere.com/paper/1907.05288