scholarly journals A Room Monitoring System Using Deep Learning and Perspective Correction Techniques

2020 ◽  
Vol 10 (13) ◽  
pp. 4423
Author(s):  
Huu-Huy Ngo ◽  
Feng-Cheng Lin ◽  
Yang-Ting Sehn ◽  
Mengru Tu ◽  
Chyi-Ren Dow

Studies on room monitoring have only focused on objects in a singular and uniform posture or low-density groups. Considering the wide use of convolutional neural networks for object detection, especially person detection, we use deep learning and perspective correction techniques to propose a room monitoring system that can detect persons with different motion states, high-density groups, and small-sized persons owing to the distance from the camera. This system uses consecutive frames from the monitoring camera as input images. Two approaches are used: perspective correction and person detection. First, perspective correction is used to transform an input image into a 2D top-view image. This allows users to observe the system more easily with different views (2D and 3D views). Second, the proposed person detection scheme combines the Mask region-based convolutional neural network (R-CNN) scheme and the tile technique for person detection, especially for detecting small-sized persons. All results are stored in a cloud database. Moreover, new person coordinates in 2D images are generated from the final bounding boxes and heat maps are created according to the 2D images; these enable users to examine the system quickly in different views. Additionally, a system prototype is developed to demonstrate the feasibility of the proposed system. Experimental results prove that our proposed system outperforms existing schemes in terms of accuracy, mean absolute error (MAE), and root mean squared error (RMSE).

2020 ◽  
Vol 71 (7) ◽  
pp. 868-880
Author(s):  
Nguyen Hong-Quan ◽  
Nguyen Thuy-Binh ◽  
Tran Duc-Long ◽  
Le Thi-Lan

Along with the strong development of camera networks, a video analysis system has been become more and more popular and has been applied in various practical applications. In this paper, we focus on person re-identification (person ReID) task that is a crucial step of video analysis systems. The purpose of person ReID is to associate multiple images of a given person when moving in a non-overlapping camera network. Many efforts have been made to person ReID. However, most of studies on person ReID only deal with well-alignment bounding boxes which are detected manually and considered as the perfect inputs for person ReID. In fact, when building a fully automated person ReID system the quality of the two previous steps that are person detection and tracking may have a strong effect on the person ReID performance. The contribution of this paper are two-folds. First, a unified framework for person ReID based on deep learning models is proposed. In this framework, the coupling of a deep neural network for person detection and a deep-learning-based tracking method is used. Besides, features extracted from an improved ResNet architecture are proposed for person representation to achieve a higher ReID accuracy. Second, our self-built dataset is introduced and employed for evaluation of all three steps in the fully automated person ReID framework.


Sensors ◽  
2020 ◽  
Vol 20 (10) ◽  
pp. 2939
Author(s):  
Mohsen Gholami ◽  
Christopher Napier ◽  
Carlo Menon

Abnormal running kinematics are associated with an increased incidence of lower extremity injuries among runners. Accurate and unobtrusive running kinematic measurement plays an important role in the detection of gait abnormalities and the prevention of injuries among runners. Inertial-based methods have been proposed to address this need. However, previous methods require cumbersome sensor setup or participant-specific calibration. This study aims to validate a shoe-mounted accelerometer for sagittal plane lower extremity angle measurement during running based on a deep learning approach. A convolutional neural network (CNN) architecture was selected as the regression model to generalize in inter-participant scenarios and to minimize poorly estimated joints. Motion and accelerometer data were recorded from ten participants while running on a treadmill at five different speeds. The reference joint angles were measured by an optical motion capture system. The CNN model predictions deviated from the reference angles with a root mean squared error (RMSE) of less than 3.5° and 6.5° in intra- and inter-participant scenarios, respectively. Moreover, we provide an estimation of six important gait events with a mean absolute error of less than 2.5° and 6.5° in intra- and inter-participants scenarios, respectively. This study highlights an appealing minimal sensor setup approach for gait analysis purposes.


Sensors ◽  
2021 ◽  
Vol 21 (21) ◽  
pp. 7058
Author(s):  
Heesang Eom ◽  
Jongryun Roh ◽  
Yuli Sun Hariyani ◽  
Suwhan Baek ◽  
Sukho Lee ◽  
...  

Wearable technologies are known to improve our quality of life. Among the various wearable devices, shoes are non-intrusive, lightweight, and can be used for outdoor activities. In this study, we estimated the energy consumption and heart rate in an environment (i.e., running on a treadmill) using smart shoes equipped with triaxial acceleration, triaxial gyroscope, and four-point pressure sensors. The proposed model uses the latest deep learning architecture which does not require any separate preprocessing. Moreover, it is possible to select the optimal sensor using a channel-wise attention mechanism to weigh the sensors depending on their contributions to the estimation of energy expenditure (EE) and heart rate (HR). The performance of the proposed model was evaluated using the root mean squared error (RMSE), mean absolute error (MAE), and coefficient of determination (R2). Moreover, the RMSE was 1.05 ± 0.15, MAE 0.83 ± 0.12 and R2 0.922 ± 0.005 in EE estimation. On the other hand, and RMSE was 7.87 ± 1.12, MAE 6.21 ± 0.86, and R2 0.897 ± 0.017 in HR estimation. In both estimations, the most effective sensor was the z axis of the accelerometer and gyroscope sensors. Through these results, it is demonstrated that the proposed model could contribute to the improvement of the performance of both EE and HR estimations by effectively selecting the optimal sensors during the active movements of participants.


2021 ◽  
Vol 20 ◽  
pp. 153303382110624
Author(s):  
Xudong Xue ◽  
Yi Ding ◽  
Jun Shi ◽  
Xiaoyu Hao ◽  
Xiangbin Li ◽  
...  

Objective: To generate synthetic CT (sCT) images with high quality from CBCT and planning CT (pCT) for dose calculation by using deep learning methods. Methods: 169 NPC patients with a total of 20926 slices of CBCT and pCT images were included. In this study the CycleGAN, Pix2pix and U-Net models were used to generate the sCT images. The Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Peak Signal to Noise Ratio (PSNR), and Structural Similarity Index (SSIM) were used to quantify the accuracy of the proposed models in a testing cohort of 34 patients. Radiation dose were calculated on pCT and sCT following the same protocol. Dose distributions were evaluated for 4 patients by comparing the dose-volume-histogram (DVH) and 2D gamma index analysis. Results: The average MAE and RMSE values between sCT by three models and pCT reduced by 15.4 HU and 26.8 HU at least, while the mean PSNR and SSIM metrics between sCT by different models and pCT added by 10.6 and 0.05 at most, respectively. There were only slight differences for DVH of selected contours between different plans. The passing rates of 2D gamma index analysis under 3 mm/3% 3 mm/2%, 2 mm/3%and 2 mm/2% criteria were all higher than 95%. Conclusions: All the sCT had achieved better evaluation metrics than those of original CBCT, while the performance of CycleGAN model was proved to be best among three methods. The dosimetric agreement confirmed the HU accuracy and consistent anatomical structures of sCT by deep learning methods.


2020 ◽  
Vol 10 (20) ◽  
pp. 7233
Author(s):  
Mohd Asyraf Zulkifley ◽  
Siti Raihanah Abdani ◽  
Nuraisyah Hani Zulkifley

One of the methods for identifying growth disorder is by assessing the skeletal bone age. A child with a healthy growth rate will have approximately the same chronological and bone ages. It is important to detect any growth disorder as early as possible, so that mitigation treatment can be administered with less negative consequences. Recently, the most popular approach in assessing the discrepancy between bone and chronological ages is through the subjective protocol of Tanner–Whitehouse that assesses selected regions in the hand X-ray images. This approach relies heavily on the medical personnel experience, which produces a high intra-observer bias. Therefore, an automated bone age prediction system with image registration using hand X-ray images is proposed in order to complement the inexperienced doctors by providing the second opinion. The system relies on an optimized regression network using a novel residual separable convolution model. The regressor network requires an input image to be 299 × 299 pixels, which will be mapped to the predicted bone age through three modules of the Xception network. Moreover, the images will be pre-processed or registered first to a standardized and normalized pose using separable convolutional neural networks. Three steps image registration are performed by segmenting the hand regions, which will be rotated using angle calculated from four keypoints of interest, before positional alignment is applied to ensure the region of interest is located in the middle. The hand segmentation is based on DeepLab V3 plus architecture, while keypoints regressor for angle alignment is based on MobileNet V1 architecture, where both of them use separable convolution as the core operators. To avoid the pitfall of underfitting, synthetic data are generated while using various rotation angles, zooming factors, and shearing images in order to augment the training dataset. The experimental results show that the proposed method returns the lowest mean absolute error and mean squared error of 8.200 months and 121.902 months2, respectively. Hence, an error of less than one year is acceptable in predicting the bone age, which can serve as a good supplement tool for providing the second expert opinion. This work does not consider gender information, which is crucial in making a better prediction, as the male and female bone structures are naturally different.


2020 ◽  
Vol 71 (7) ◽  
pp. 868-880
Author(s):  
Nguyen Hong Quan ◽  
Nguyen Thuy Binh ◽  
Tran Duc Long ◽  
Le Thi Lan

Along with the strong development of camera networks, a video analysis system has been become more and more popular and has been applied in various practical applications. In this paper, we focus on person re-identification (person ReID) task that is a crucial step of video analysis systems. The purpose of person ReID is to associate multiple images of a given person when moving in a non-overlapping camera network. Many efforts have been made to person ReID. However, most of studies on person ReID only deal with well-alignment bounding boxes which are detected manually and considered as the perfect inputs for person ReID. In fact, when building a fully automated person ReID system the quality of the two previous steps that are person detection and tracking may have a strong effect on the person ReID performance. The contribution of this paper are two-folds. First, a unified framework for person ReID based on deep learning models is proposed. In this framework, the coupling of a deep neural network for person detection and a deep-learning-based tracking method is used. Besides, features extracted from an improved ResNet architecture are proposed for person representation to achieve a higher ReID accuracy. Second, our self-built dataset is introduced and employed for evaluation of all three steps in the fully automated person ReID framework.


Complexity ◽  
2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Shuping Li ◽  
Taotang Liu

Predicting students’ performance is very important in matters related to higher education as well as with regard to deep learning and its relationship to educational data. Prediction of students’ performance provides support in selecting courses and designing appropriate future study plans for students. In addition to predicting the performance of students, it helps teachers and managers to monitor students in order to provide support to them and to integrate the training programs to obtain the best results. One of the benefits of student’s prediction is that it reduces the official warning signs as well as expelling students because of their inefficiency. Prediction provides support to the students themselves through their choice of courses and study plans appropriate to their abilities. The proposed method used deep neural network in prediction by extracting informative data as a feature with corresponding weights. Multiple updated hidden layers are used to design neural network automatically; number of nodes and hidden layers controlled by feed forwarding and backpropagation data are produced by previous cases. The training mode is used to train the system with labeled data from dataset and the testing mode is used for evaluating the system. Mean absolute error (MAE) and root mean squared error (RMSE) with accuracy used for evolution of the proposed method. The proposed system has proven its worth in terms of efficiency through the achieved results in MAE (0.593) and RMSE (0.785) to get the best prediction.


2021 ◽  
Vol 3 (2) ◽  
pp. 153-165
Author(s):  
Meejoung Kim

In this paper, we analyze and predict the number of daily confirmed cases of coronavirus (COVID-19) based on two statistical models and a deep learning (DL) model; the autoregressive integrated moving average (ARIMA), the generalized autoregressive conditional heteroscedasticity (GARCH), and the stacked long short-term memory deep neural network (LSTM DNN). We find the orders of the statistical models by the autocorrelation function and the partial autocorrelation function, and the hyperparameters of the DL model, such as the numbers of LSTM cells and blocks of a cell, by the exhaustive search. Ten datasets are used in the experiment; nine countries and the world datasets, from Dec. 31, 2019, to Feb. 22, 2021, provided by the WHO. We investigate the effects of data size and vaccination on performance. Numerical results show that performance depends on the used data's dates and vaccination. It also shows that the prediction by the LSTM DNN is better than those of the two statistical models. Based on the experimental results, the percentage improvements of LSTM DNN are up to 88.54% (86.63%) and 90.15% (87.74%) compared to ARIMA and GARCH, respectively, in mean absolute error (root mean squared error). While the performances of ARIMA and GARCH are varying according to the datasets. The obtained results may provide a criterion for the performance ranges and prediction accuracy of the COVID-19 daily confirmed cases.Doi: 10.28991/SciMedJ-2021-0302-7 Full Text: PDF


2021 ◽  
Vol 2021 ◽  
pp. 1-14
Author(s):  
Shengguo Ge ◽  
Kuo Li ◽  
Siti Nurulain Binti Mohd Rum

In array signal processing, the direction of arrival (DOA) of the signal source has drawn broad research interests with its wide applications in fields such as sonar, radar, communications, medical detection, and electronic countermeasures. In recent years, the application of deep learning (DL) to DOA estimation has achieved great success. This study provides a systematic review of research on DOA estimation using deep neural network methods. We manually selected twenty-five papers related to this research from five prominent databases (SpringerLink, IEEE Xplore, ScienceDirect, Scopus, and Google Scholar) for exploration. Six questions describing the overall trend of DOA estimation using deep learning are put forward. Then, we answered these questions by reviewing the literature. This review is helpful for researchers in this field because it provides more specific and comprehensive information needed for future research. Specifically, we first analyzed the background of the selected papers, including the type of publication, the number of citations, and the country of origin. Then, the DL technology used in DOA estimation is systematically analyzed, including the purpose of using DL in DOA estimation, various DL models (convolutional neural network, deep neural network, and combination network), and various DOA estimation schemes. Finally, various evaluation criteria (root-mean-squared error, accuracy, and mean absolute error) are used to evaluate the DL technology in DOA estimation, and various factors (signal-to-noise ratio, number of snapshots, number of antennas, and number of signal sources) affecting DOA estimation are analyzed. Based on our findings, we believe that deep learning can perform DOA estimation well, and there is still room for improvement in deep learning technology. In this study, the factors affecting DOA estimation can be used as the direction for researchers to conduct in-depth research.


2020 ◽  
Vol 29 (2) ◽  
pp. e013
Author(s):  
İlker Ercanli

Aim of Study: As an innovative prediction technique, Artificial Intelligence technique based on a Deep Learning Algorithm (DLA) with various numbers of neurons and hidden layer alternatives were trained and evaluated to predict the relationships between total tree height (TTH) and diameter at breast height (DBH) with nonlinear least squared (NLS) regression models and nonlinear mixed effect (NLME) regression models.Area of Study: The data of this study were measured from even-aged, pure Turkish Pine (Pinus brutia Ten.) stands in the Kestel Forests located in the Bursa region of northwestern Turkey.Material and Methods: 1132 pairs of TTH-DBH measurements from 132 sample plots were used for modeling relationships between TTH, DBH, and stand attributes such as dominant height (Ho) and diameter (Do).Main Results: The combination of 100 # neurons and 8 # hidden layer in DLA resulted in the best predictive total height prediction values with Average Absolute Error (0.4188), max. Average Absolute Error (3.7598), Root Mean Squared Error (0.6942), Root Mean Squared error % (5.2164), Akaike Information Criteria (-345.4465), Bayesian Information Criterion (-330.836), the average Bias (0.0288) and the average Bias % (0.2166), and fitting abilities with r (0.9842) and Fit Index (0.9684). Also, the results of equivalence tests showed that the DLA technique successfully predicted the TTH in the validation dataset.Research highlights: These superior fitting scores coupled with the validation results in TTH predictions suggested that deep learning network models should be considered an alternative to the traditional nonlinear regression techniques and should be given importance as an innovative prediction technique.Keywords: Prediction; artificial intelligence; deep learning algorithms; number of neurons; hidden layer alternatives.Abbreviations: TTH (total tree height), DBH (diameter at breast height), OLS (ordinary least squares), NLME (nonlinear mixed effect), AIT (Artificial Intelligence Techniques), ANN (Artificial Neural Network), DLA (Deep Learning Algorithm), GPU (Graphical Processing Units), NLS (nonlinear least squared), RMSE (root mean squared error), AIC (Akaike information criteria), BIC (Bayesian information criterion), FI (fit index), AAE (average absolute error), BLUP (best linear unbiased predictor), TOST (two one-sided test method). 


Sign in / Sign up

Export Citation Format

Share Document