AOGNets: Compositional Grammatical Architectures for Deep Learning
Xilai Li, Xi Song, Tianfu Wu

TL;DR
AOGNets introduce a novel deep compositional grammatical architecture that combines grammar models with deep neural networks, enhancing feature learning, interpretability, and robustness across multiple vision benchmarks.
Contribution
This paper proposes AOGNets, a new neural architecture integrating AND-OR Grammar with deep learning, offering improved performance and interpretability over existing models.
Findings
Outperforms ResNet and variants on CIFAR and ImageNet benchmarks.
Achieves the best interpretability scores in network dissection.
Shows enhanced adversarial robustness and object detection performance.
Abstract
Neural architectures are the foundation for improving performance of deep neural networks (DNNs). This paper presents deep compositional grammatical architectures which harness the best of two worlds: grammar models and DNNs. The proposed architectures integrate compositionality and reconfigurability of the former and the capability of learning rich features of the latter in a principled way. We utilize AND-OR Grammar (AOG) as network generator in this paper and call the resulting networks AOGNets. An AOGNet consists of a number of stages each of which is composed of a number of AOG building blocks. An AOG building block splits its input feature map into N groups along feature channels and then treat it as a sentence of N words. It then jointly realizes a phrase structure grammar and a dependency grammar in bottom-up parsing the "sentence" for better feature exploration and reuse. It…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsRegion Proposal Network · Interpretability · Sigmoid Activation · RoIAlign · Average Pooling · ResNeXt Block · Concatenated Skip Connection · Squeeze-and-Excitation Block · Mask R-CNN · Dense Block
