Ordered Subspace Clustering for Complex Non-Rigid Motion by 3D Reconstruction

Weinan Du; Jinghua Li; Fei Wu; Yanfeng Sun; Yongli Hu

doi:10.3390/app9081559

Ordered Subspace Clustering for Complex Non-Rigid Motion by 3D Reconstruction

Applied Sciences ◽

10.3390/app9081559 ◽

2019 ◽

Vol 9 (8) ◽

pp. 1559

Author(s):

Weinan Du ◽

Jinghua Li ◽

Fei Wu ◽

Yanfeng Sun ◽

Yongli Hu

Keyword(s):

State Of The Art ◽

Motion Segmentation ◽

Rigid Motion ◽

Subspace Clustering ◽

Sequential Data ◽

Dynamic Scene ◽

Normalized Mutual Information ◽

3D Data ◽

Segmentation Approach ◽

Representation Method

As a fundamental and challenging problem, non-rigid structure-from-motion (NRSfM) has attracted a large amount of research interest. It is worth mentioning that NRSfM has been applied to dynamic scene understanding and motion segmentation. Especially, a motion segmentation approach combining NRSfM with the subspace representation has been proposed. However, the current subspace representation for non-rigid motions clustering do not take into account the inherent sequential property, which has been proved vital for sequential data clustering. Hence this paper proposes a novel framework to segment the complex and non-rigid motion via an ordered subspace representation method for the reconstructed 3D data, where the sequential property is properly formulated in the procedure of learning the affinity matrix for clustering with simultaneously recovering the 3D non-rigid motion by a monocular camera with 2D point tracks. Experiment results on three public sequential action datasets, BU-4DFE, MSR and UMPM, verify the benefits of method presented in this paper for classical complex non-rigid motion analysis and outperform state-of-the-art methods with lowest subspace clustering error (SCE) rates and highest normalized mutual information (NMI) in subspace clustering and motion segmentation fields.

Download Full-text

Towards Scalable Economic Photovoltaic Potential Analysis Using Aerial Images and Deep Learning

Energies ◽

10.3390/en14133800 ◽

2021 ◽

Vol 14 (13) ◽

pp. 3800

Author(s):

Sebastian Krapf ◽

Nils Kemmerzell ◽

Syed Khawaja Haseeb Khawaja Haseeb Uddin ◽

Manuel Hack Hack Vázquez ◽

Fabian Netzler ◽

...

Keyword(s):

Deep Learning ◽

System Analysis ◽

State Of The Art ◽

Critical Role ◽

Semantic Segmentation ◽

Energy System ◽

Aerial Images ◽

Potential Analysis ◽

3D Data ◽

Challenges And Opportunities

Roof-mounted photovoltaic systems play a critical role in the global transition to renewable energy generation. An analysis of roof photovoltaic potential is an important tool for supporting decision-making and for accelerating new installations. State of the art uses 3D data to conduct potential analyses with high spatial resolution, limiting the study area to places with available 3D data. Recent advances in deep learning allow the required roof information from aerial images to be extracted. Furthermore, most publications consider the technical photovoltaic potential, and only a few publications determine the photovoltaic economic potential. Therefore, this paper extends state of the art by proposing and applying a methodology for scalable economic photovoltaic potential analysis using aerial images and deep learning. Two convolutional neural networks are trained for semantic segmentation of roof segments and superstructures and achieve an Intersection over Union values of 0.84 and 0.64, respectively. We calculated the internal rate of return of each roof segment for 71 buildings in a small study area. A comparison of this paper’s methodology with a 3D-based analysis discusses its benefits and disadvantages. The proposed methodology uses only publicly available data and is potentially scalable to the global level. However, this poses a variety of research challenges and opportunities, which are summarized with a focus on the application of deep learning, economic photovoltaic potential analysis, and energy system analysis.

Download Full-text

Adaptive Structure Concept Factorization for Multiview Clustering

Neural Computation ◽

10.1162/neco_a_01055 ◽

2018 ◽

Vol 30 (4) ◽

pp. 1080-1103 ◽

Cited By ~ 10

Author(s):

Kun Zhan ◽

Jinhui Shi ◽

Jing Wang ◽

Haibo Wang ◽

Yuange Xie

Keyword(s):

Nonnegative Matrix Factorization ◽

State Of The Art ◽

Nonnegative Matrix ◽

Adaptive Method ◽

Data Sets ◽

Clustering Methods ◽

Normalized Mutual Information ◽

Adaptive Structure ◽

Concept Factorization ◽

Multiview Clustering

Most existing multiview clustering methods require that graph matrices in different views are computed beforehand and that each graph is obtained independently. However, this requirement ignores the correlation between multiple views. In this letter, we tackle the problem of multiview clustering by jointly optimizing the graph matrix to make full use of the data correlation between views. With the interview correlation, a concept factorization–based multiview clustering method is developed for data integration, and the adaptive method correlates the affinity weights of all views. This method differs from nonnegative matrix factorization–based clustering methods in that it can be applicable to data sets containing negative values. Experiments are conducted to demonstrate the effectiveness of the proposed method in comparison with state-of-the-art approaches in terms of accuracy, normalized mutual information, and purity.

Download Full-text

A Multiscale CNN-CRF Framework for Environmental Microorganism Image Segmentation

BioMed Research International ◽

10.1155/2020/4621403 ◽

2020 ◽

Vol 2020 ◽

pp. 1-27

Author(s):

Jinghua Zhang ◽

Chen Li ◽

Frank Kulwa ◽

Xin Zhao ◽

Changhao Sun ◽

...

Keyword(s):

Image Segmentation ◽

State Of The Art ◽

Conditional Random Field ◽

Segmentation Method ◽

Recall Accuracy ◽

Evaluation Indexes ◽

Segmentation Quality ◽

Segmentation Approach ◽

Overall Evaluation

To assist researchers to identify Environmental Microorganisms (EMs) effectively, a Multiscale CNN-CRF (MSCC) framework for the EM image segmentation is proposed in this paper. There are two parts in this framework: The first is a novel pixel-level segmentation approach, using a newly introduced Convolutional Neural Network (CNN), namely, “mU-Net-B3”, with a dense Conditional Random Field (CRF) postprocessing. The second is a VGG-16 based patch-level segmentation method with a novel “buffer” strategy, which further improves the segmentation quality of the details of the EMs. In the experiment, compared with the state-of-the-art methods on 420 EM images, the proposed MSCC method reduces the memory requirement from 355 MB to 103 MB, improves the overall evaluation indexes (Dice, Jaccard, Recall, Accuracy) from 85.24%, 77.42%, 82.27%, and 96.76% to 87.13%, 79.74%, 87.12%, and 96.91%, respectively, and reduces the volume overlap error from 22.58% to 20.26%. Therefore, the MSCC method shows great potential in the EM segmentation field.

Download Full-text

Registration of Terrestrial Laser Scanning Surveys Using Terrain-Invariant Regions for Measuring Exploitative Volumes over Open-Pit Mines

Remote Sensing ◽

10.3390/rs11060606 ◽

2019 ◽

Vol 11 (6) ◽

pp. 606 ◽

Cited By ~ 7

Author(s):

Zhihua Xu ◽

Ershuai Xu ◽

Lixin Wu ◽

Shanjun Liu ◽

Yachun Mao

Keyword(s):

Laser Scanning ◽

State Of The Art ◽

Rigid Motion ◽

Terrestrial Laser Scanning ◽

Open Pit ◽

Time Interval ◽

Open Pit Mines ◽

Registration Method ◽

Art Methods ◽

Invariant Regions

Terrestrial laser scanning (TLS) techniques have been widely used in open-pit mine applications. It is a crucial task to measure the exploitative volume of open-pit mines, within a specific time interval. One major challenge is posed, however, when conducting accurate registrations for temporal TLS surveys in continuously changing areas, created by excavation activities. In this paper, we propose a coarse-to-fine registration method, based on terrain-invariant regions (TIR), for temporal TLS surveys. More specifically, an approximate four-point congruent set (4PCS) of temporal TLS surveys is first identified, based on affine invariant rules. Second, a set of correspondences among temporal TLS surveys were collected by matching multi-scale sparse features of the 3D neighbors, centered at the approximate 4PCS. Third, the correspondences were used to estimate a rigid motion between the overlapping TLS surveys for the coarse registration, according to which the initial TIR from temporal TLS surveys were identified. Finally, the rigid motion between temporal TLS was iteratively optimized, based on the point clouds, only from the TIR. Based on the fine-level registered TLS surveys, Digital Elevation Models (DEMs) can be generated to calculate the exploitative volume, through a DEM differential. We applied the proposed method to two open-pit mines in China, and also compared our method with five state-of-the-art methods for registering temporal TLS surveys. Experimental results indicated that the proposed method achieved a higher registration accuracy than the state-of-the-art methods. Based on the registered result, our method achieved a 98.03% overall accuracy for measuring the exploitative volume, compared to in-situ measurement.

Download Full-text

Direct model-based image motion segmentation for dynamic scene analysis

Recent Developments in Computer Vision - Lecture Notes in Computer Science ◽

10.1007/3-540-60793-5_66 ◽

1996 ◽

pp. 101-110 ◽

Cited By ~ 4

Author(s):

Jean-Marc Odobez ◽

Patrick Bouthemy

Keyword(s):

Motion Segmentation ◽

Scene Analysis ◽

Image Motion ◽

Dynamic Scene ◽

Model Based ◽

Dynamic Scene Analysis ◽

Direct Model

Download Full-text

Large-Scale Multi-View Subspace Clustering in Linear Time

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5867 ◽

2020 ◽

Vol 34 (04) ◽

pp. 4412-4419 ◽

Cited By ~ 3

Author(s):

Zhao Kang ◽

Wangtao Zhou ◽

Zhitong Zhao ◽

Junming Shao ◽

Meng Han ◽

...

Keyword(s):

Large Scale ◽

State Of The Art ◽

Linear Time ◽

Subspace Clustering ◽

Data Sets ◽

Clustering Methods ◽

Single View ◽

Novel Approach ◽

Points Of View ◽

Effectiveness And Efficiency

A plethora of multi-view subspace clustering (MVSC) methods have been proposed over the past few years. Researchers manage to boost clustering accuracy from different points of view. However, many state-of-the-art MVSC algorithms, typically have a quadratic or even cubic complexity, are inefficient and inherently difficult to apply at large scales. In the era of big data, the computational issue becomes critical. To fill this gap, we propose a large-scale MVSC (LMVSC) algorithm with linear order complexity. Inspired by the idea of anchor graph, we first learn a smaller graph for each view. Then, a novel approach is designed to integrate those graphs so that we can implement spectral clustering on a smaller graph. Interestingly, it turns out that our model also applies to single-view scenario. Extensive experiments on various large-scale benchmark data sets validate the effectiveness and efficiency of our approach with respect to state-of-the-art clustering methods.

Download Full-text

Subspace Clustering with Sparsity and Grouping Effect

Mathematical Problems in Engineering ◽

10.1155/2017/4787039 ◽

2017 ◽

Vol 2017 ◽

pp. 1-9 ◽

Cited By ~ 1

Author(s):

Binbin Zhang ◽

Weiwei Wang ◽

Xiangchu Feng

Keyword(s):

State Of The Art ◽

Subspace Clustering ◽

The Self ◽

Dimensional Structure ◽

High Dimensional ◽

Affinity Matrix ◽

Grouping Effect ◽

Art Methods ◽

Data Points ◽

Low Dimensional

Subspace clustering aims to group a set of data from a union of subspaces into the subspace from which it was drawn. It has become a popular method for recovering the low-dimensional structure underlying high-dimensional dataset. The state-of-the-art methods construct an affinity matrix based on the self-representation of the dataset and then use a spectral clustering method to obtain the final clustering result. These methods show that sparsity and grouping effect of the affinity matrix are important in recovering the low-dimensional structure. In this work, we propose a weighted sparse penalty and a weighted grouping effect penalty in modeling the self-representation of data points. The experimental results on Extended Yale B, USPS, and Berkeley 500 image segmentation datasets show that the proposed model is more effective than state-of-the-art methods in revealing the subspace structure underlying high-dimensional dataset.

Download Full-text

Integrating the Symmetry Image and Improved Sparse Representation for Railway Fastener Classification and Defect Recognition

Mathematical Problems in Engineering ◽

10.1155/2015/462528 ◽

2015 ◽

Vol 2015 ◽

pp. 1-11 ◽

Cited By ~ 6

Author(s):

Jiajia Liu ◽

Bailin Li ◽

Ying Xiong ◽

Biao He ◽

Li Li

Keyword(s):

Sparse Representation ◽

State Of The Art ◽

Identification Problem ◽

Test Sample ◽

Computationally Efficient ◽

Railway Traffic ◽

Training Samples ◽

Inspection Systems ◽

Representation Method ◽

Defect Recognition

The detection of fastener defects is an important task for ensuring the safety of railway traffic. The earlier automatic inspection systems based on computer vision can detect effectively the completely missing fasteners, but they have weaker ability to recognize the partially worn ones. In this paper, we propose a method for detecting both partly worn and completely missing fasteners, the proposed algorithm exploits the first and second symmetry sample of original testing fastener image and integrates them for improved representation-based fastener recognition. This scheme is simple and computationally efficient. The underlying rationales of the scheme are as follows: First, the new virtual symmetrical images really reflect some possible appearance of the fastener; then the integration of two judgments of the symmetrical sample for fastener recognition can somewhat overcome the misclassification problem. Second, the improved sparse representation method discarding the training samples that are “far” from the test sample and uses a small number of samples that are “near” to the test sample to represent the test sample, so as to perform classification and it is able to reduce the side-effect of the error identification problem of the original fastener image. The experimental results show that the proposed method outperforms state-of-the-art fastener recognition methods.

Download Full-text