UVIRT—Unsupervised Virtual Try-on Using Disentangled Clothing and Person Features

Hideki Tsunashima; Kosuke Arase; Antony Lam; Hirokatsu Kataoka

doi:10.3390/s20195647

UVIRT—Unsupervised Virtual Try-on Using Disentangled Clothing and Person Features

Sensors ◽

10.3390/s20195647 ◽

2020 ◽

Vol 20 (19) ◽

pp. 5647

Author(s):

Hideki Tsunashima ◽

Kosuke Arase ◽

Antony Lam ◽

Hirokatsu Kataoka

Keyword(s):

Auxiliary Information ◽

Semantic Segmentation ◽

Target Person ◽

Person Features ◽

Category Labels ◽

Supervised Methods ◽

Weakly Supervised ◽

Very High ◽

Significant Attention

Virtual Try-on is the ability to realistically superimpose clothing onto a target person. Due to its importance to the multi-billion dollar e-commerce industry, the problem has received significant attention in recent years. To date, most virtual try-on methods have been supervised approaches, namely using annotated data, such as clothes parsing semantic segmentation masks and paired images. These approaches incur a very high cost in annotation. Even existing weakly-supervised virtual try-on methods still use annotated data or pre-trained networks as auxiliary information and the costs of the annotation are still significantly high. Plus, the strategy using pre-trained networks is not appropriate in the practical scenarios due to latency. In this paper we propose Unsupervised VIRtual Try-on using disentangled representation (UVIRT). After UVIRT extracts a clothes and a person feature from a person image and a clothes image respectively, it exchanges a clothes and a person feature. Finally, UVIRT achieve virtual try-on. This is all achieved in an unsupervised manner so UVIRT has the advantage that it does not require any annotated data, pre-trained networks nor even category labels. In the experiments, we qualitatively and quantitatively compare between supervised methods and our UVIRT method on the MPV dataset (which has paired images) and on a Consumer-to-Consumer (C2C) marketplace dataset (which has unpaired images). As a result, UVIRT outperform the supervised method on the C2C marketplace dataset, and achieve comparable results on the MPV dataset, which has paired images in comparison with the conventional supervised method.

Download Full-text

SPMF-Net: Weakly Supervised Building Segmentation by Combining Superpixel Pooling and Multi-Scale Feature Fusion

Remote Sensing ◽

10.3390/rs12061049 ◽

2020 ◽

Vol 12 (6) ◽

pp. 1049 ◽

Cited By ~ 2

Author(s):

Jie Chen ◽

Fen He ◽

Yi Zhang ◽

Geng Sun ◽

Min Deng

Keyword(s):

Feature Fusion ◽

Semantic Segmentation ◽

Building Detection ◽

Segmentation Method ◽

Scale Feature ◽

Multi Scale ◽

Semantic Labeling ◽

Supervised Methods ◽

Boundary Information ◽

Weakly Supervised

The lack of pixel-level labeling limits the practicality of deep learning-based building semantic segmentation. Weakly supervised semantic segmentation based on image-level labeling results in incomplete object regions and missing boundary information. This paper proposes a weakly supervised semantic segmentation method for building detection. The proposed method takes the image-level label as supervision information in a classification network that combines superpixel pooling and multi-scale feature fusion structures. The main advantage of the proposed strategy is its ability to improve the intactness and boundary accuracy of a detected building. Our method achieves impressive results on two 2D semantic labeling datasets, which outperform some competing weakly supervised methods and are close to the result of the fully supervised method.

Download Full-text

Multi-model Integrated Weakly Supervised Semantic Segmentation Method

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2019.17379 ◽

2019 ◽

Vol 31 (5) ◽

pp. 800

Author(s):

Changzhen Xiong ◽

Hui Zhi

Keyword(s):

Semantic Segmentation ◽

Segmentation Method ◽

Weakly Supervised

Download Full-text

Weakly-Supervised Recommended Traversable Area Segmentation Using Automatically Labeled Images for Autonomous Driving in Pedestrian Environment with No Edges

Sensors ◽

10.3390/s21020437 ◽

2021 ◽

Vol 21 (2) ◽

pp. 437

Author(s):

Yuya Onozuka ◽

Ryosuke Matsumi ◽

Motoki Shino

Keyword(s):

Visual Information ◽

Data Augmentation ◽

Semantic Segmentation ◽

Autonomous Driving ◽

Weighting Method ◽

Personal Mobility ◽

Human Understanding ◽

Autonomous Mobility ◽

Weakly Supervised ◽

Traffic Rules

Detection of traversable areas is essential to navigation of autonomous personal mobility systems in unknown pedestrian environments. However, traffic rules may recommend or require driving in specified areas, such as sidewalks, in environments where roadways and sidewalks coexist. Therefore, it is necessary for such autonomous mobility systems to estimate the areas that are mechanically traversable and recommended by traffic rules and to navigate based on this estimation. In this paper, we propose a method for weakly-supervised recommended traversable area segmentation in environments with no edges using automatically labeled images based on paths selected by humans. This approach is based on the idea that a human-selected driving path more accurately reflects both mechanical traversability and human understanding of traffic rules and visual information. In addition, we propose a data augmentation method and a loss weighting method for detecting the appropriate recommended traversable area from a single human-selected path. Evaluation of the results showed that the proposed learning methods are effective for recommended traversable area detection and found that weakly-supervised semantic segmentation using human-selected path information is useful for recommended area detection in environments with no edges.

Download Full-text

CSENet: Cascade semantic erasing network for weakly-supervised semantic segmentation

Neurocomputing ◽

10.1016/j.neucom.2020.05.107 ◽

2020 ◽

Author(s):

Jiahui Liu ◽

Changqian Yu ◽

Beibei Yang ◽

Changxin Gao ◽

Nong Sang

Keyword(s):

Semantic Segmentation ◽

Weakly Supervised

Download Full-text

Contrastive consistent feature learning for weakly supervised object localization semantic segmentation

Neurocomputing ◽

10.1016/j.neucom.2021.03.023 ◽

2021 ◽

Author(s):

Minsong Ki ◽

Youngjung Uh ◽

Wonyoung Lee ◽

Hyeran Byun

Keyword(s):

Feature Learning ◽

Semantic Segmentation ◽

Object Localization ◽

Consistent Feature ◽

Weakly Supervised

Download Full-text

RSS-Net: Weakly-Supervised Multi-Class Semantic Segmentation with FMCW Radar

2020 IEEE Intelligent Vehicles Symposium (IV) ◽

10.1109/iv47402.2020.9304674 ◽

2020 ◽

Author(s):

Prannay Kaul ◽

Daniele de Martini ◽

Matthew Gadd ◽

Paul Newman

Keyword(s):

Semantic Segmentation ◽

Fmcw Radar ◽

Weakly Supervised

Download Full-text

Towards closing the gap in weakly supervised semantic segmentation with DCNNs: Combining local and global models

Computer Vision and Image Understanding ◽

10.1016/j.cviu.2021.103209 ◽

2021 ◽

pp. 103209

Author(s):

Christoph Mayer ◽

Radu Timofte ◽

Grégory Paul

Keyword(s):

Semantic Segmentation ◽

Global Models ◽

Closing The Gap ◽

Weakly Supervised

Download Full-text

Weakly Supervised Learning with Deep Convolutional Neural Networks for Semantic Segmentation: Understanding Semantic Layout of Images with Minimum Human Supervision

IEEE Signal Processing Magazine ◽

10.1109/msp.2017.2742558 ◽

2017 ◽

Vol 34 (6) ◽

pp. 39-49 ◽

Cited By ~ 12

Author(s):

Seunghoon Hong ◽

Suha Kwak ◽

Bohyung Han

Keyword(s):

Neural Networks ◽

Supervised Learning ◽

Convolutional Neural Networks ◽

Semantic Segmentation ◽

Deep Convolutional Neural Networks ◽

Weakly Supervised Learning ◽

Weakly Supervised

Download Full-text

Single annotated pixel based weakly supervised semantic segmentation under driving scenes

Pattern Recognition ◽

10.1016/j.patcog.2021.107979 ◽

2021 ◽

Vol 116 ◽

pp. 107979

Author(s):

Xi Li ◽

Huimin Ma ◽

Sheng Yi ◽

Yanxian Chen ◽

Hongbing Ma

Keyword(s):

Semantic Segmentation ◽

Weakly Supervised

Download Full-text

Optimal Scale of Hierarchical Image Segmentation with Scribbles Guidance for Weakly Supervised Semantic Segmentation

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421540264 ◽

2021 ◽

pp. 2154026

Author(s):

Zaid Al-Huda ◽

Donghai Zhai ◽

Yan Yang ◽

Riyadh Nazar Ali Algburi

Keyword(s):

Image Segmentation ◽

Graphical Model ◽

Semantic Segmentation ◽

Saliency Map ◽

Training Data ◽

Deep Convolutional Neural Networks ◽

High Quality ◽

Optimal Scale ◽

Supervised Segmentation ◽

Weakly Supervised

Deep convolutional neural networks (DCNNs) trained on the pixel-level annotated images have achieved improvements in semantic segmentation. Due to the high cost of labeling training data, their applications may have great limitation. However, weakly supervised segmentation approaches can significantly reduce human labeling efforts. In this paper, we introduce a new framework to generate high-quality initial pixel-level annotations. By using a hierarchical image segmentation algorithm to predict the boundary map, we select the optimal scale of high-quality hierarchies. In the initialization step, scribble annotations and the saliency map are combined to construct a graphic model over the optimal scale segmentation. By solving the minimal cut problem, it can spread information from scribbles to unmarked regions. In the training process, the segmentation network is trained by using the initial pixel-level annotations. To iteratively optimize the segmentation, we use a graphical model to refine segmentation masks and retrain the segmentation network to get more precise pixel-level annotations. The experimental results on Pascal VOC 2012 dataset demonstrate that the proposed framework outperforms most of weakly supervised semantic segmentation methods and achieves the state-of-the-art performance, which is [Formula: see text] mIoU.

Download Full-text