Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation

Bruno Artacho; Andreas Savakis

doi:10.3390/s19245361

Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation

Sensors ◽

10.3390/s19245361 ◽

2019 ◽

Vol 19 (24) ◽

pp. 5361 ◽

Cited By ~ 6

Author(s):

Bruno Artacho ◽

Andreas Savakis

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

State Of The Art ◽

Semantic Segmentation ◽

Training Time ◽

Network Parameters ◽

Spatial Pooling ◽

Memory Footprint ◽

Accuracy Increase ◽

Spatial Pyramid

We propose a new efficient architecture for semantic segmentation, based on a “Waterfall” Atrous Spatial Pooling architecture, that achieves a considerable accuracy increase while decreasing the number of network parameters and memory footprint. The proposed Waterfall architecture leverages the efficiency of progressive filtering in the cascade architecture while maintaining multiscale fields-of-view comparable to spatial pyramid configurations. Additionally, our method does not rely on a postprocessing stage with Conditional Random Fields, which further reduces complexity and required training time. We demonstrate that the Waterfall approach with a ResNet backbone is a robust and efficient architecture for semantic segmentation obtaining state-of-the-art results with significant reduction in the number of parameters for the Pascal VOC dataset and the Cityscapes dataset.

Download Full-text

Weakly Supervised Conditional Random Fields Model for Semantic Segmentation with Image Patches

Applied Sciences ◽

10.3390/app10051679 ◽

2020 ◽

Vol 10 (5) ◽

pp. 1679

Author(s):

Xinying Xu ◽

Yujing Xue ◽

Xiaoxia Han ◽

Zhe Zhang ◽

Jun Xie ◽

...

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Semantic Category ◽

Semantic Segmentation ◽

Potential Energy Function ◽

Semantic Class ◽

Image Patches ◽

Benchmark Datasets ◽

Weakly Supervised ◽

Class Labels

Image semantic segmentation (ISS) is used to segment an image into regions with differently labeled semantic category. Most of the existing ISS methods are based on fully supervised learning, which requires pixel-level labeling for training the model. As a result, it is often very time-consuming and labor-intensive, yet still subject to manual errors and subjective inconsistency. To tackle such difficulties, a weakly supervised ISS approach is proposed, in which the challenging problem of label inference from image-level to pixel-level will be particularly addressed, using image patches and conditional random fields (CRF). An improved simple linear iterative cluster (SLIC) algorithm is employed to extract superpixels. for image segmentation. Specifically, it generates various numbers of superpixels according to different images, which can be used to guide the process of image patch extraction based on the image-level labeled information. Based on the extracted image patches, the CRF model is constructed for inferring semantic class labels, which uses the potential energy function to map from the image-level to pixel-level image labels. Finally, patch based CRF (PBCRF) model is used to accomplish the weakly supervised ISS. Experiments conducted on two publicly available benchmark datasets, MSRC and PASCAL VOC 2012, have demonstrated that our proposed algorithm can yield very promising results compared to quite a few state-of-the-art ISS methods, including some deep learning-based models.

Download Full-text

Adaptive Context Encoding Module for Semantic Segmentation

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.10.ipas-027 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 27-1-27-7

Author(s):

Congcong Wang ◽

Faouzi Alaya Cheikh ◽

Azeddine Beghdadi ◽

Ole Jakob Elle

Keyword(s):

Neural Networks ◽

State Of The Art ◽

Experimental Studies ◽

Semantic Segmentation ◽

Multiple Scale ◽

Context Information ◽

Convolution Operation ◽

Sampling Locations ◽

Spatial Pyramid Pooling ◽

Spatial Pyramid

The object sizes in images are diverse, therefore, capturing multiple scale context information is essential for semantic segmentation. Existing context aggregation methods such as pyramid pooling module (PPM) and atrous spatial pyramid pooling (ASPP) employ different pooling size or atrous rate, such that multiple scale information is captured. However, the pooling sizes and atrous rates are chosen empirically. Rethinking of ASPP leads to our observation that learnable sampling locations of the convolution operation can endow the network learnable fieldof- view, thus the ability of capturing object context information adaptively. Following this observation, in this paper, we propose an adaptive context encoding (ACE) module based on deformable convolution operation where sampling locations of the convolution operation are learnable. Our ACE module can be embedded into other Convolutional Neural Networks (CNNs) easily for context aggregation. The effectiveness of the proposed module is demonstrated on Pascal-Context and ADE20K datasets. Although our proposed ACE only consists of three deformable convolution blocks, it outperforms PPM and ASPP in terms of mean Intersection of Union (mIoU) on both datasets. All the experimental studies confirm that our proposed module is effective compared to the state-of-the-art methods.

Download Full-text

Guiding Attention in Sequence-to-Sequence Models for Dialogue Act Prediction

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6259 ◽

2020 ◽

Vol 34 (05) ◽

pp. 7594-7601

Author(s):

Pierre Colombo ◽

Emile Chapuis ◽

Matteo Manica ◽

Emmanuel Vignon ◽

Giovanna Varni ◽

...

Keyword(s):

Machine Translation ◽

Random Fields ◽

Conditional Random Fields ◽

State Of The Art ◽

The State ◽

Attention Mechanism ◽

Accuracy Score ◽

Beam Search ◽

Conversational Agents ◽

Neural Machine Translation

The task of predicting dialog acts (DA) based on conversational dialog is a key component in the development of conversational agents. Accurately predicting DAs requires a precise modeling of both the conversation and the global tag dependencies. We leverage seq2seq approaches widely adopted in Neural Machine Translation (NMT) to improve the modelling of tag sequentiality. Seq2seq models are known to learn complex global dependencies while currently proposed approaches using linear conditional random fields (CRF) only model local tag dependencies. In this work, we introduce a seq2seq model tailored for DA classification using: a hierarchical encoder, a novel guided attention mechanism and beam search applied to both training and inference. Compared to the state of the art our model does not require handcrafted features and is trained end-to-end. Furthermore, the proposed approach achieves an unmatched accuracy score of 85% on SwDA, and state-of-the-art accuracy score of 91.6% on MRDA.

Download Full-text

Scale-Adaptive Conditional Random Fields for Semantic Segmentation

KIISE Transactions on Computing Practices ◽

10.5626/ktcp.2021.27.12.574 ◽

2021 ◽

Vol 27 (12) ◽

pp. 574-577

Author(s):

Jungbeom Lee ◽

Sungroh Yoon

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Semantic Segmentation

Download Full-text

Image Semantic Segmentation Based on Multi-Scale Feature Extraction and Fully Connected Conditional Random Fields

Laser & Optoelectronics Progress ◽

10.3788/lop56.131007 ◽

2019 ◽

Vol 56 (13) ◽

pp. 131007

Author(s):

董永峰 Yongfeng Dong ◽

杨雨訢 Yuxin Yang ◽

王利琴 Liqin Wang

Keyword(s):

Feature Extraction ◽

Random Fields ◽

Conditional Random Fields ◽

Semantic Segmentation ◽

Scale Feature ◽

Multi Scale ◽

Fully Connected

Download Full-text

Improved semantic segmentation for robotic applications with hierarchical conditional random fields

2017 IEEE International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2017.7989617 ◽

2017 ◽

Cited By ~ 3

Author(s):

Benjamin J. Meyer ◽

Tom Drummond

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Semantic Segmentation ◽

Robotic Applications

Download Full-text

Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation: Combining Probabilistic Graphical Models with Deep Learning for Structured Prediction

IEEE Signal Processing Magazine ◽

10.1109/msp.2017.2762355 ◽

2018 ◽

Vol 35 (1) ◽

pp. 37-52 ◽

Cited By ~ 36

Author(s):

Anurag Arnab ◽

Shuai Zheng ◽

Sadeep Jayasumana ◽

Bernardino Romera-Paredes ◽

Mans Larsson ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Graphical Models ◽

Random Fields ◽

Deep Neural Networks ◽

Conditional Random Fields ◽

Probabilistic Graphical Models ◽

Semantic Segmentation ◽

Structured Prediction

Download Full-text

Sequeval: An Offline Evaluation Framework for Sequence-Based Recommender Systems

Information ◽

10.3390/info10050174 ◽

2019 ◽

Vol 10 (5) ◽

pp. 174 ◽

Cited By ~ 2

Author(s):

Diego Monti ◽

Enrico Palumbo ◽

Giuseppe Rizzo ◽

Maurizio Morisio

Keyword(s):

Recommender Systems ◽

Random Fields ◽

Conditional Random Fields ◽

State Of The Art ◽

Focal Point ◽

Objective Evaluation ◽

Lessons Learned ◽

Evaluation Framework ◽

Definition Of ◽

The Right

Recommender systems have gained a lot of popularity due to their large adoption in various industries such as entertainment and tourism. Numerous research efforts have focused on formulating and advancing state-of-the-art of systems that recommend the right set of items to the right person. However, these recommender systems are hard to compare since the published evaluation results are computed on diverse datasets and obtained using different methodologies. In this paper, we researched and prototyped an offline evaluation framework called Sequeval that is designed to evaluate recommender systems capable of suggesting sequences of items. We provide a mathematical definition of such sequence-based recommenders, a methodology for performing their evaluation, and the implementation details of eight metrics. We report the lessons learned using this framework for assessing the performance of four baselines and two recommender systems based on Conditional Random Fields (CRF) and Recurrent Neural Networks (RNN), considering two different datasets. Sequeval is publicly available and it aims to become a focal point for researchers and practitioners when experimenting with sequence-based recommender systems, providing comparable and objective evaluation results.

Download Full-text

Very High Resolution Image Semantic Segmentation with Contextualized Convolutional Neural Network Coupled with Higher Order Conditional Random Fields

2019 IEEE 15th International Conference on Control and Automation (ICCA) ◽

10.1109/icca.2019.8899544 ◽

2019 ◽

Author(s):

Tiancan Mei ◽

Hong Ji ◽

Wenyuan Zheng ◽

Saixian He

Keyword(s):

Neural Network ◽

High Resolution ◽

Convolutional Neural Network ◽

Random Fields ◽

Conditional Random Fields ◽

Semantic Segmentation ◽

Higher Order ◽

Resolution Image ◽

High Resolution Image ◽

Very High

Download Full-text

Learning depth-sensitive conditional random fields for semantic segmentation of RGB-D images

2014 IEEE International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2014.6907778 ◽

2014 ◽

Cited By ~ 32

Author(s):

Andreas C. Muller ◽

Sven Behnke

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Semantic Segmentation

Download Full-text