Deep Photometric Stereo Network with Multi-Scale Feature Aggregation

Chanki Yu; Sang Wook Lee

doi:10.3390/s20216261

Deep Photometric Stereo Network with Multi-Scale Feature Aggregation

Sensors ◽

10.3390/s20216261 ◽

2020 ◽

Vol 20 (21) ◽

pp. 6261

Author(s):

Chanki Yu ◽

Sang Wook Lee

Keyword(s):

Complex Geometry ◽

Photometric Stereo ◽

Network Architectures ◽

Scale Feature ◽

Multi Scale ◽

Surface Normal ◽

Feature Aggregation ◽

Intermediate Order ◽

Real Objects ◽

Fusion Scheme

We present photometric stereo algorithms robust to non-Lambertian reflection, which are based on a convolutional neural network in which surface normals of objects with complex geometry and surface reflectance are estimated from a given set of an arbitrary number of images. These images are taken from the same viewpoint under different directional illumination conditions. The proposed method focuses on surface normal estimation, where multi-scale feature aggregation is proposed to obtain a more accurate surface normal, and max pooling is adopted to obtain an intermediate order-agnostic representation in the photometric stereo scenario. The proposed multi-scale feature aggregation scheme using feature concatenation is easily incorporated into existing photometric stereo network architectures. Our experiments were performed with a DiLiGent photometric stereo benchmark dataset consisting of ten real objects, and they demonstrated that the accuracies of our calibrated and uncalibrated photometric stereo approaches were improved over those of baseline methods. In particular, our experiments also demonstrated that our uncalibrated photometric stereo outperformed the state-of-the-art method. Our work is the first to consider the multi-scale feature aggregation in photometric stereo, and we showed that our proposed multi-scale fusion scheme estimated the surface normal accurately and was beneficial to improving performance.

Download Full-text

Multi-FAN: multi-spectral mosaic super-resolution via multi-scale feature aggregation network

Machine Vision and Applications ◽

10.1007/s00138-021-01174-w ◽

2021 ◽

Vol 32 (2) ◽

Author(s):

Mehrdad Sheoiby ◽

Sadegh Aliakbarian ◽

Saeed Anwar ◽

Lars Petersson

Keyword(s):

Super Resolution ◽

Scale Feature ◽

Multi Scale ◽

Feature Aggregation

Download Full-text

DDocE: Deep Document Enhancement with Multi-scale Feature Aggregation and Pixel-Wise Adjustments

10.1007/978-3-030-86198-8_17 ◽

2021 ◽

pp. 229-244

Author(s):

Karina O. M. Bogdan ◽

Guilherme A. S. Megeto ◽

Rovilson Leal ◽

Gustavo Souza ◽

Augusto C. Valente ◽

...

Keyword(s):

Scale Feature ◽

Multi Scale ◽

Feature Aggregation

Download Full-text

Y-Net: Multi-Scale Feature Aggregation Network With Wavelet Structure Similarity Loss Function For Single Image Dehazing

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053920 ◽

2020 ◽

Cited By ~ 1

Author(s):

Hao-Hsiang Yang ◽

Chao-Han Huck Yang ◽

Yi-Chang James Tsai

Keyword(s):

Loss Function ◽

Single Image ◽

Image Dehazing ◽

Scale Feature ◽

Multi Scale ◽

Feature Aggregation ◽

Structure Similarity ◽

Single Image Dehazing

Download Full-text

Intermediate Generation Selection Attention and Multi-scale Feature Aggregation for segmentation of cell regions in RPE implant absorbance images

2020 IEEE 17th India Council International Conference (INDICON) ◽

10.1109/indicon49873.2020.9342342 ◽

2020 ◽

Author(s):

Gaurav Patel ◽

Hitesh Tekchandani ◽

Shrish Verma

Keyword(s):

Scale Feature ◽

Multi Scale ◽

Feature Aggregation

Download Full-text

A Multi-Scale Feature Aggregation Network Based on Channel-Spatial Attention for Remote Sensing Scene Classification

10.1109/igarss47720.2021.9554855 ◽

2021 ◽

Author(s):

Ming Li ◽

Lin Lei ◽

Xiao Li ◽

Yuli Sun

Keyword(s):

Remote Sensing ◽

Spatial Attention ◽

Scene Classification ◽

Scale Feature ◽

Multi Scale ◽

Feature Aggregation

Download Full-text

Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation

IEEE Robotics and Automation Letters ◽

10.1109/lra.2021.3086419 ◽

2021 ◽

Vol 6 (3) ◽

pp. 5889-5896

Author(s):

Yechao Bai ◽

Ziyuan Huang ◽

Lyuyu Shen ◽

Hongliang Guo ◽

Marcelo H. Ang Jr ◽

...

Keyword(s):

Semantic Segmentation ◽

Scale Feature ◽

Multi Scale ◽

Feature Aggregation

Download Full-text

Multi-Level and Multi-Scale Feature Aggregation Using Pretrained Convolutional Neural Networks for Music Auto-Tagging

IEEE Signal Processing Letters ◽

10.1109/lsp.2017.2713830 ◽

2017 ◽

Vol 24 (8) ◽

pp. 1208-1212 ◽

Cited By ~ 35

Author(s):

Jongpil Lee ◽

Juhan Nam

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Scale Feature ◽

Multi Scale ◽

Feature Aggregation ◽

Multi Level

Download Full-text

Adaptive Stick-Like Features for Human Detection Based on Multi-scale Feature Fusion Scheme

2010 International Conference on Digital Image Computing: Techniques and Applications ◽

10.1109/dicta.2010.70 ◽

2010 ◽

Author(s):

Sheng Wang ◽

Ruo Du ◽

Qiang Wu ◽

XiangJian He

Keyword(s):

Feature Fusion ◽

Human Detection ◽

Scale Feature ◽

Multi Scale ◽

Fusion Scheme

Download Full-text

Multi-scale feature aggregation network for Image super-resolution

Applied Intelligence ◽

10.1007/s10489-021-02593-y ◽

2021 ◽

Author(s):

Wenlong Chen ◽

Pengcheng Yao ◽

Shaoyan Gai ◽

Feipeng Da

Keyword(s):

Super Resolution ◽

Scale Feature ◽

Multi Scale ◽

Feature Aggregation ◽

Image Super Resolution

Download Full-text

Multi-Level and Multi-Scale Feature Aggregation Network for Semantic Segmentation in Vehicle-Mounted Scenes

Sensors ◽

10.3390/s21093270 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3270

Author(s):

Yong Liao ◽

Qiong Liu

Keyword(s):

Receptive Fields ◽

Semantic Segmentation ◽

Layer By Layer ◽

Scale Feature ◽

Backbone Networks ◽

Multi Scale ◽

Backbone Network ◽

Feature Aggregation ◽

Multi Level ◽

High Level

The main challenges of semantic segmentation in vehicle-mounted scenes are object scale variation and trading off model accuracy and efficiency. Lightweight backbone networks for semantic segmentation usually extract single-scale features layer-by-layer only by using a fixed receptive field. Most modern real-time semantic segmentation networks heavily compromise spatial details when encoding semantics, and sacrifice accuracy for speed. Many improving strategies adopt dilated convolution and add a sub-network, in which either intensive computation or redundant parameters are brought. We propose a multi-level and multi-scale feature aggregation network (MMFANet). A spatial pyramid module is designed by cascading dilated convolutions with different receptive fields to extract multi-scale features layer-by-layer. Subseqently, a lightweight backbone network is built by reducing the feature channel capacity of the module. To improve the accuracy of our network, we design two additional modules to separately capture spatial details and high-level semantics from the backbone network without significantly increasing the computation cost. Comprehensive experimental results show that our model achieves 79.3% MIoU on the Cityscapes test dataset at a speed of 58.5 FPS, and it is more accurate than SwiftNet (75.5% MIoU). Furthermore, the number of parameters of our model is at least 53.38% less than that of other state-of-the-art models.

Download Full-text