Optimizing Neural Network Hyperparameters with Gaussian Processes for   Dialog Act Classification

Franck Dernoncourt; Ji Young Lee

arXiv:1609.08703·cs.CL·September 29, 2016

Optimizing Neural Network Hyperparameters with Gaussian Processes for Dialog Act Classification

Franck Dernoncourt, Ji Young Lee

PDF

1 Repo

TL;DR

This paper demonstrates that using Gaussian process-based Bayesian optimization to tune neural network hyperparameters significantly improves dialog act classification performance and reduces tuning time compared to traditional search methods.

Contribution

The study applies Gaussian process-based Bayesian optimization to hyperparameter tuning in neural networks for dialog act classification, achieving better results faster.

Findings

01

Hyperparameter optimization with GPs improves model accuracy.

02

GP-based tuning reduces computational time by a factor of 4.

03

Enhanced performance over manual and random search methods.

Abstract

Systems based on artificial neural networks (ANNs) have achieved state-of-the-art results in many natural language processing tasks. Although ANNs do not require manually engineered features, ANNs have many hyperparameters to be optimized. The choice of hyperparameters significantly impacts models' performances. However, the ANN hyperparameters are typically chosen by manual, grid, or random search, which either requires expert experiences or is computationally expensive. Recent approaches based on Bayesian optimization using Gaussian processes (GPs) is a more systematic way to automatically pinpoint optimal or near-optimal machine learning hyperparameters. Using a previously published ANN model yielding state-of-the-art results for dialog act classification, we demonstrate that optimizing hyperparameters using GP further improves the results, and reduces the computational time by a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Franck-Dernoncourt/slt2016
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.