KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Xuanhan Wang, Jingkuan Song, Xiaojia Chen, Lechao Cheng, Lianli Gao,, Heng Tao Shen

TL;DR
This paper introduces KE-RCNN, a knowledge-embedded framework for part-level attribute parsing that leverages implicit and explicit knowledge to improve visual understanding of body parts.
Contribution
It proposes novel implicit and explicit knowledge modules integrated into RCNNs, enhancing attribute prediction by modeling part relations and prior knowledge.
Findings
Achieves around 3% AP improvement on Fashionpedia
Reaches about 4% accuracy gain on Kinetics-TPS
Demonstrates strong generalizability across benchmarks
Abstract
Part-level attribute parsing is a fundamental but challenging task, which requires the region-level visual understanding to provide explainable details of body parts. Most existing approaches address this problem by adding a regional convolutional neural network (RCNN) with an attribute prediction head to a two-stage detector, in which attributes of body parts are identified from local-wise part boxes. However, local-wise part boxes with limit visual clues (i.e., part appearance only) lead to unsatisfying parsing results, since attributes of body parts are highly dependent on comprehensive relations among them. In this article, we propose a Knowledge Embedded RCNN (KE-RCNN) to identify attributes by leveraging rich knowledges, including implicit knowledge (e.g., the attribute ``above-the-hip'' for a shirt requires visual/geometry relations of shirt-hip) and explicit knowledge (e.g., the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Handwritten Text Recognition Techniques · Generative Adversarial Networks and Image Synthesis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Residual Connection · Convolution · HRNet
