Unsupervised Network Pretraining via Encoding Human Design

Ming-Yu Liu; Arun Mallya; Oncel C. Tuzel; Xi Chen

arXiv:1502.05689·cs.CV·November 15, 2016

Unsupervised Network Pretraining via Encoding Human Design

Ming-Yu Liu, Arun Mallya, Oncel C. Tuzel, Xi Chen

PDF

TL;DR

This paper introduces a novel unsupervised pretraining method for deep neural networks in computer vision by teaching them to replicate hand-designed feature extraction processes, leading to improved classification performance.

Contribution

It presents a new pretraining approach that incorporates human-designed feature extraction processes into neural network training, enhancing visual object recognition.

Findings

01

Pretraining via feature replication improves classification accuracy.

02

Replicating HOG and region covariance features enhances model performance.

03

Method outperforms baseline approaches after finetuning.

Abstract

Over the years, computer vision researchers have spent an immense amount of effort on designing image features for the visual object recognition task. We propose to incorporate this valuable experience to guide the task of training deep neural networks. Our idea is to pretrain the network through the task of replicating the process of hand-designed feature extraction. By learning to replicate the process, the neural network integrates previous research knowledge and learns to model visual objects in a way similar to the hand-designed features. In the succeeding finetuning step, it further learns object-specific representations from labeled data and this boosts its classification power. We pretrain two convolutional neural networks where one replicates the process of histogram of oriented gradients feature extraction, and the other replicates the process of region covariance feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.