Number of Building Stories Estimation from Monocular Satellite Image Using a Modified Mask R-CNN

Chao Ji; Hong Tang

doi:10.3390/rs12223833

Number of Building Stories Estimation from Monocular Satellite Image Using a Modified Mask R-CNN

Remote Sensing ◽

10.3390/rs12223833 ◽

2020 ◽

Vol 12 (22) ◽

pp. 3833

Author(s):

Chao Ji ◽

Hong Tang

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Object Detection ◽

Satellite Remote Sensing ◽

Satellite Images ◽

Mean Absolute Error ◽

Satellite Image ◽

Absolute Error ◽

High Rise ◽

The Mean

Stereo photogrammetric survey used to be used to extract the height of buildings, then to convert the height to number of stories through certain rules to estimate the number of stories of buildings by means of satellite remote sensing. In contrast, we propose a new method using deep learning to estimate the number of stories of buildings from monocular optical satellite image end to end in this paper. To the best of our knowledge, this is the first attempt to directly estimate the number of stories of buildings from monocular satellite images. Specifically, in the proposed method, we extend a classic object detection network, i.e., Mask R-CNN, by adding a new head to predict the number of stories of detected buildings from satellite images. GF-2 images from nine cities in China are used to validate the effectiveness of the proposed methods. The result of experiment show that the mean absolute error of prediction on buildings whose stories between 1–7, 8–20, and above 20 are 1.329, 3.546, and 8.317, respectively, which indicate that our method has possible application potentials in low-rise buildings, but the accuracy in middle-rise and high-rise buildings needs to be further improved.

Download Full-text

Classification of EgyptSat-1 Images Using Deep Learning Methods

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327909666190207153858 ◽

2020 ◽

Vol 10 (1) ◽

pp. 37-46 ◽

Cited By ~ 3

Author(s):

Hatem Keshk ◽

Xu-Cheng Yin

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Deep Learning ◽

Convolutional Neural Network ◽

High Performance ◽

Satellite Images ◽

Satellite Image ◽

Aerial Photographs ◽

Average Accuracy

Background: Deep Learning (DL) neural network methods have become a hotspot subject of research in the remote sensing field. Classification of aerial satellite images depends on spectral content, which is a challenging topic in remote sensing. Objective: With the aim to accomplish a high performance and accuracy of Egyptsat-1 satellite image classification, the use of the Convolutional Neural Network (CNN) is raised in this paper because CNN is considered a leading deep learning method. CNN is developed to classify aerial photographs into land cover classes such as urban, vegetation, desert, water bodies, soil, roads, etc. In our work, a comparison between MAXIMUM Likelihood (ML) which represents the traditional supervised classification methods and CNN method is conducted. Conclusion: This research finds that CNN outperforms ML by 9%. The convolutional neural network has better classification result, which reached 92.25% as its average accuracy. Also, the experiments showed that the convolutional neural network is the most satisfactory and effective classification method applied to classify Egyptsat-1 satellite images.

Download Full-text

Cross-Racial Automatic Age Estimation from Facial Images using Deep Learning

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2021/14992021 ◽

2021 ◽

Vol 9 (9) ◽

pp. 1288-1294

Keyword(s):

Deep Learning ◽

Age Estimation ◽

Mean Absolute Error ◽

Absolute Error ◽

Learning Approach ◽

Racial Groups ◽

Human Beings ◽

The Mean ◽

Facial Images ◽

Facial Age

This paper presents a deep learning approach for age estimation of human beings using their facial images. The different racial groups based on skin colour have been incorporated in the annotations of the images in the dataset, while ensuring an adequate distribution of subjects across the racial groups so as to achieve an accurate Automatic Facial Age Estimation (AFAE). The principle of transfer learning is applied to the ResNet50 Convolutional Neural Network (CNN) initially pretrained for the task of object classification and finetuning it’s hyperparameters to propose an AFAE system that can be used to automate ages of humans across multiple racial groups. The mean absolute error of 4.25 years is obtained at the end of the research which proved the effectiveness and superiority of the proposed method.

Download Full-text

Predicting intraocular pressure using systemic variables or fundus photography with deep learning in a health examination cohort

Scientific Reports ◽

10.1038/s41598-020-80839-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Kaori Ishii ◽

Ryo Asaoka ◽

Takashi Omoto ◽

Shingo Mitaki ◽

Yuri Fujino ◽

...

Keyword(s):

Deep Learning ◽

Intraocular Pressure ◽

Mean Absolute Error ◽

Absolute Error ◽

Fundus Photography ◽

Training Dataset ◽

Support Vector ◽

Testing Dataset ◽

The Mean ◽

Systemic Variables

AbstractThe purpose of the current study was to predict intraocular pressure (IOP) using color fundus photography with a deep learning (DL) model, or, systemic variables with a multivariate linear regression model (MLM), along with least absolute shrinkage and selection operator regression (LASSO), support vector machine (SVM), and Random Forest: (RF). Training dataset included 3883 examinations from 3883 eyes of 1945 subjects and testing dataset 289 examinations from 289 eyes from 146 subjects. With the training dataset, MLM was constructed to predict IOP using 35 systemic variables and 25 blood measurements. A DL model was developed to predict IOP from color fundus photographs. The prediction accuracy of each model was evaluated through the absolute error and the marginal R-squared (mR2), using the testing dataset. The mean absolute error with MLM was 2.29 mmHg, which was significantly smaller than that with DL (2.70 dB). The mR2 with MLM was 0.15, whereas that with DL was 0.0066. The mean absolute error (between 2.24 and 2.30 mmHg) and mR2 (between 0.11 and 0.15) with LASSO, SVM and RF were similar to or poorer than MLM. A DL model to predict IOP using color fundus photography proved far less accurate than MLM using systemic variables.

Download Full-text

Deep Learning Architecture for Estimating Hourly Ground-Level PM2.5 Using Satellite Remote Sensing

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2019.2900270 ◽

2019 ◽

Vol 16 (9) ◽

pp. 1343-1347 ◽

Cited By ~ 9

Author(s):

Yibo Sun ◽

Qiaolin Zeng ◽

Bing Geng ◽

Xinwen Lin ◽

Bilige Sude ◽

...

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Satellite Remote Sensing ◽

Ground Level

Download Full-text

A Review of Deep Learning-Based Contactless Heart Rate Measurement Methods

Sensors ◽

10.3390/s21113719 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3719

Author(s):

Aoxin Ni ◽

Arian Azarang ◽

Nasser Kehtarnavaz

Keyword(s):

Heart Rate ◽

Deep Learning ◽

Mean Absolute Error ◽

Absolute Error ◽

Video Camera ◽

Measurement Methods ◽

Rate Measurement ◽

Learning Methods ◽

The Public ◽

Heart Rate Measurement

The interest in contactless or remote heart rate measurement has been steadily growing in healthcare and sports applications. Contactless methods involve the utilization of a video camera and image processing algorithms. Recently, deep learning methods have been used to improve the performance of conventional contactless methods for heart rate measurement. After providing a review of the related literature, a comparison of the deep learning methods whose codes are publicly available is conducted in this paper. The public domain UBFC dataset is used to compare the performance of these deep learning methods for heart rate measurement. The results obtained show that the deep learning method PhysNet generates the best heart rate measurement outcome among these methods, with a mean absolute error value of 2.57 beats per minute and a mean square error value of 7.56 beats per minute.

Download Full-text

Design and Implementation of Intelligent Inspection and Alarm Flight System for Epidemic Prevention

Drones ◽

10.3390/drones5030068 ◽

2021 ◽

Vol 5 (3) ◽

pp. 68

Author(s):

Jiwei Fan ◽

Xiaogang Yang ◽

Ruitao Lu ◽

Xueli Xie ◽

Weipeng Li

Keyword(s):

Deep Learning ◽

Autonomous Navigation ◽

Detection Method ◽

Active Role ◽

Absolute Error ◽

Face Mask ◽

Learning Technology ◽

Flight System ◽

Crowd Density ◽

The Mean

Unmanned aerial vehicles (UAV) and related technologies have played an active role in the prevention and control of novel coronaviruses at home and abroad, especially in epidemic prevention, surveillance, and elimination. However, the existing UAVs have a single function, limited processing capacity, and poor interaction. To overcome these shortcomings, we designed an intelligent anti-epidemic patrol detection and warning flight system, which integrates UAV autonomous navigation, deep learning, intelligent voice, and other technologies. Based on the convolution neural network and deep learning technology, the system possesses a crowd density detection method and a face mask detection method, which can detect the position of dense crowds. Intelligent voice alarm technology was used to achieve an intelligent alarm system for abnormal situations, such as crowd-gathering areas and people without masks, and to carry out intelligent dissemination of epidemic prevention policies, which provides a powerful technical means for epidemic prevention and delaying their spread. To verify the superiority and feasibility of the system, high-precision online analysis was carried out for the crowd in the inspection area, and pedestrians’ faces were detected on the ground to identify whether they were wearing a mask. The experimental results show that the mean absolute error (MAE) of the crowd density detection was less than 8.4, and the mean average precision (mAP) of face mask detection was 61.42%. The system can provide convenient and accurate evaluation information for decision-makers and meets the requirements of real-time and accurate detection.

Download Full-text

Detection and Severity Evaluation of Combined Rail Defects Using Deep Learning

Vibration ◽

10.3390/vibration4020022 ◽

2021 ◽

Vol 4 (2) ◽

pp. 341-356

Author(s):

Jessada Sresakoolchai ◽

Sakdirat Kaewunruen

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Mean Absolute Error ◽

Absolute Error ◽

Machine Learning Techniques ◽

Rolling Stock ◽

Raw Data ◽

Learning Techniques ◽

Combined Defects

Various techniques have been developed to detect railway defects. One of the popular techniques is machine learning. This unprecedented study applies deep learning, which is a branch of machine learning techniques, to detect and evaluate the severity of rail combined defects. The combined defects in the study are settlement and dipped joint. Features used to detect and evaluate the severity of combined defects are axle box accelerations simulated using a verified rolling stock dynamic behavior simulation called D-Track. A total of 1650 simulations are run to generate numerical data. Deep learning techniques used in the study are deep neural network (DNN), convolutional neural network (CNN), and recurrent neural network (RNN). Simulated data are used in two ways: simplified data and raw data. Simplified data are used to develop the DNN model, while raw data are used to develop the CNN and RNN model. For simplified data, features are extracted from raw data, which are the weight of rolling stock, the speed of rolling stock, and three peak and bottom accelerations from two wheels of rolling stock. In total, there are 14 features used as simplified data for developing the DNN model. For raw data, time-domain accelerations are used directly to develop the CNN and RNN models without processing and data extraction. Hyperparameter tuning is performed to ensure that the performance of each model is optimized. Grid search is used for performing hyperparameter tuning. To detect the combined defects, the study proposes two approaches. The first approach uses one model to detect settlement and dipped joint, and the second approach uses two models to detect settlement and dipped joint separately. The results show that the CNN models of both approaches provide the same accuracy of 99%, so one model is good enough to detect settlement and dipped joint. To evaluate the severity of the combined defects, the study applies classification and regression concepts. Classification is used to evaluate the severity by categorizing defects into light, medium, and severe classes, and regression is used to estimate the size of defects. From the study, the CNN model is suitable for evaluating dipped joint severity with an accuracy of 84% and mean absolute error (MAE) of 1.25 mm, and the RNN model is suitable for evaluating settlement severity with an accuracy of 99% and mean absolute error (MAE) of 1.58 mm.

Download Full-text

Explanation Plus Prediction—The Logical Focus of Project Management Research

Project Management Journal ◽

10.1177/8756972821999945 ◽

2021 ◽

pp. 875697282199994

Author(s):

Joseph F. Hair ◽

Marko Sarstedt

Keyword(s):

Project Management ◽

Statistical Models ◽

Predictive Power ◽

Mean Absolute Error ◽

Explanatory Power ◽

Absolute Error ◽

Model Parameters ◽

Mean Square ◽

Management Research ◽

The Mean

Most project management research focuses almost exclusively on explanatory analyses. Evaluation of the explanatory power of statistical models is generally based on F-type statistics and the R 2 metric, followed by an assessment of the model parameters (e.g., beta coefficients) in terms of their significance, size, and direction. However, these measures are not indicative of a model’s predictive power, which is central for deriving managerial recommendations. We recommend that project management researchers routinely use additional metrics, such as the mean absolute error or the root mean square error, to accurately quantify their statistical models’ predictive power.

Download Full-text

Visualizing the Variance of a Random Variable

Open Systems & Information Dynamics ◽

10.1142/s1230161211000054 ◽

2011 ◽

Vol 18 (01) ◽

pp. 71-85

Author(s):

Fabrizio Cacciafesta

Keyword(s):

Stochastic Dominance ◽

Mean Absolute Error ◽

Random Variable ◽

Absolute Error ◽

Second Order ◽

Taylor Formula ◽

Order Stochastic Dominance ◽

The Mean ◽

Second Order Stochastic Dominance ◽

Options Theory

We provide a simple way to visualize the variance and the mean absolute error of a random variable with finite mean. Some application to options theory and to second order stochastic dominance is given: we show, among other, that the "call-put parity" may be seen as a Taylor formula.

Download Full-text

SeMo-YOLO: A Multiscale Object Detection Network in Satellite Remote Sensing Images

10.1109/ijcnn52387.2021.9534343 ◽

2021 ◽

Author(s):

Peng Li ◽

Cheng Che

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Satellite Remote Sensing ◽

Remote Sensing Images

Download Full-text