Hybrid CNN and Dictionary-Based Models for Scene Recognition and Domain Adaptation
Guo-Sen Xie, Xu-Yao Zhang, Shuicheng Yan, Cheng-Lin Liu

TL;DR
This paper introduces a hybrid approach combining CNN features with dictionary-based models like MLR and CFV to enhance scene recognition and domain adaptation, achieving state-of-the-art results.
Contribution
It proposes novel combinations of CNN and dictionary-based features, including new representations MLR and CFV, for improved visual scene recognition and domain adaptation.
Findings
Hybrid features outperform individual models.
The approach achieves state-of-the-art accuracy.
Hybrid representations are complementary across different CNN architectures.
Abstract
Convolutional neural network (CNN) has achieved state-of-the-art performance in many different visual tasks. Learned from a large-scale training dataset, CNN features are much more discriminative and accurate than the hand-crafted features. Moreover, CNN features are also transferable among different domains. On the other hand, traditional dictionarybased features (such as BoW and SPM) contain much more local discriminative and structural information, which is implicitly embedded in the images. To further improve the performance, in this paper, we propose to combine CNN with dictionarybased models for scene recognition and visual domain adaptation. Specifically, based on the well-tuned CNN models (e.g., AlexNet and VGG Net), two dictionary-based representations are further constructed, namely mid-level local representation (MLR) and convolutional Fisher vector representation (CFV). In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods1x1 Convolution · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · Grouped Convolution · Dropout · How do I speak to a person at Expedia?-/+/ · GoogLeNet · Dense Connections
