Hidden Tree Markov Networks: Deep and Wide Learning for Structured Data

Davide Bacciu

arXiv:1711.07784·cs.LG·November 22, 2017

Hidden Tree Markov Networks: Deep and Wide Learning for Structured Data

Davide Bacciu

PDF

TL;DR

The paper proposes Hidden Tree Markov Networks, a hybrid deep and wide neural architecture combining generative models for trees with neural discriminative layers, improving structured data classification.

Contribution

It introduces a modular, hybrid model that fuses generative tree models with neural networks, enabling deep and wide learning for structured data.

Findings

01

Outperforms state-of-the-art syntactic kernels

02

Outperforms generative kernels based on the same probabilistic model

03

Demonstrates effectiveness on structured data tasks

Abstract

The paper introduces the Hidden Tree Markov Network (HTN), a neuro-probabilistic hybrid fusing the representation power of generative models for trees with the incremental and discriminative learning capabilities of neural networks. We put forward a modular architecture in which multiple generative models of limited complexity are trained to learn structural feature detectors whose outputs are then combined and integrated by neural layers at a later stage. In this respect, the model is both deep, thanks to the unfolding of the generative models on the input structures, as well as wide, given the potentially large number of generative modules that can be trained in parallel. Experimental results show that the proposed approach can outperform state-of-the-art syntactic kernels as well as generative kernels built on the same probabilistic model as the HTN.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.