High-Resolution Representations for Labeling Pixels and Regions
Ke Sun, Yang Zhao, Borui Jiang, Tianheng Cheng, Bin Xiao, Dong Liu,, Yadong Mu, Xinggang Wang, Wenyu Liu, Jingdong Wang

TL;DR
This paper enhances high-resolution neural network architectures by aggregating multi-resolution features, leading to improved performance across various vision tasks like segmentation, landmark detection, and object detection.
Contribution
The paper introduces a simple modification to HRNet that aggregates features from all resolutions, significantly improving performance on multiple vision benchmarks.
Findings
Achieved top results in semantic segmentation on Cityscapes, LIP, and PASCAL Context.
Outperformed existing models in facial landmark detection on multiple datasets.
Surpassed state-of-the-art in COCO object detection with the proposed multi-resolution approach.
Abstract
High-resolution representation learning plays an essential role in many vision problems, e.g., pose estimation and semantic segmentation. The high-resolution network (HRNet)~\cite{SunXLW19}, recently developed for human pose estimation, maintains high-resolution representations through the whole process by connecting high-to-low resolution convolutions in \emph{parallel} and produces strong high-resolution representations by repeatedly conducting fusions across parallel convolutions. In this paper, we conduct a further study on high-resolution representations by introducing a simple yet effective modification and apply it to a wide range of vision tasks. We augment the high-resolution representation by aggregating the (upsampled) representations from all the parallel convolutions rather than only the representation from the high-resolution convolution as done in~\cite{SunXLW19}. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Image Processing Techniques · Medical Image Segmentation Techniques · Image Retrieval and Classification Techniques
Methods14 Ways To Contact To Someone At JetBlue Airlines USA™: A Step-by-Step Guide · Region Proposal Network · Softmax · RoIPool · Faster R-CNN · Convolution
