Semantic Segmentation Using Deep Learning with Vegetation Indices for Rice Lodging Identification in Multi-date UAV Visible Images

Ming-Der Yang; Hsin-Hung Tseng; Yu-Chun Hsu; Hui Ping Tsai

doi:10.3390/rs12040633

Semantic Segmentation Using Deep Learning with Vegetation Indices for Rice Lodging Identification in Multi-date UAV Visible Images

Remote Sensing ◽

10.3390/rs12040633 ◽

2020 ◽

Vol 12 (4) ◽

pp. 633 ◽

Cited By ~ 16

Author(s):

Ming-Der Yang ◽

Hsin-Hung Tseng ◽

Yu-Chun Hsu ◽

Hui Ping Tsai

Keyword(s):

Deep Learning ◽

Large Scale ◽

Vegetation Indices ◽

Semantic Segmentation ◽

Likelihood Method ◽

Large Area ◽

Computation Efficiency ◽

Proposed Model ◽

Visible Images ◽

Uav Images

A rapid and precise large-scale agricultural disaster survey is a basis for agricultural disaster relief and insurance but is labor-intensive and time-consuming. This study applies Unmanned Aerial Vehicles (UAVs) images through deep-learning image processing to estimate the rice lodging in paddies over a large area. This study establishes an image semantic segmentation model employing two neural network architectures, FCN-AlexNet, and SegNet, whose effects are explored in the interpretation of various object sizes and computation efficiency. Commercial UAVs imaging rice paddies in high-resolution visible images are used to calculate three vegetation indicators to improve the applicability of visible images. The proposed model was trained and tested on a set of UAV images in 2017 and was validated on a set of UAV images in 2019. For the identification of rice lodging on the 2017 UAV images, the F1-score reaches 0.80 and 0.79 for FCN-AlexNet and SegNet, respectively. The F1-score of FCN-AlexNet using RGB + ExGR combination also reaches 0.78 in the 2019 images for validation. The proposed model adopting semantic segmentation networks is proven to have better efficiency, approximately 10 to 15 times faster, and a lower misinterpretation rate than that of the maximum likelihood method.

Download Full-text

Encoder-Decoder Architecture for Ultrasound IMC Segmentation and cIMT Measurement

Sensors ◽

10.3390/s21206839 ◽

2021 ◽

Vol 21 (20) ◽

pp. 6839

Author(s):

Aisha Al-Mohannadi ◽

Somaya Al-Maadeed ◽

Omar Elharrouss ◽

Kishor Kumar Sadasivuni

Keyword(s):

Deep Learning ◽

Early Diagnosis ◽

Large Scale ◽

Intima Media Thickness ◽

Semantic Segmentation ◽

Learning Techniques ◽

Proposed Model ◽

Decoder Architecture ◽

Huge Impact ◽

Good Learning

Cardiovascular diseases (CVDs) have shown a huge impact on the number of deaths in the world. Thus, common carotid artery (CCA) segmentation and intima-media thickness (IMT) measurements have been significantly implemented to perform early diagnosis of CVDs by analyzing IMT features. Using computer vision algorithms on CCA images is not widely used for this type of diagnosis, due to the complexity and the lack of dataset to do it. The advancement of deep learning techniques has made accurate early diagnosis from images possible. In this paper, a deep-learning-based approach is proposed to apply semantic segmentation for intima-media complex (IMC) and to calculate the cIMT measurement. In order to overcome the lack of large-scale datasets, an encoder-decoder-based model is proposed using multi-image inputs that can help achieve good learning for the model using different features. The obtained results were evaluated using different image segmentation metrics which demonstrate the effectiveness of the proposed architecture. In addition, IMT thickness is computed, and the experiment showed that the proposed model is robust and fully automated compared to the state-of-the-art work.

Download Full-text

Multi Disease-Prediction Framework Using Hybrid Deep Learning: An Optimal Prediction Model (Preprint)

10.2196/preprints.22865 ◽

2020 ◽

Author(s):

Anusha Ampavathi ◽

Vijaya Saradhi T

Keyword(s):

Feature Extraction ◽

Big Data ◽

Deep Learning ◽

Weight Function ◽

Optimization Algorithm ◽

Large Scale ◽

Heuristic Algorithms ◽

Disease Prediction ◽

Health Care Decisions ◽

Proposed Model

UNSTRUCTURED Big data and its approaches are generally helpful for healthcare and biomedical sectors for predicting the disease. For trivial symptoms, the difficulty is to meet the doctors at any time in the hospital. Thus, big data provides essential data regarding the diseases on the basis of the patient’s symptoms. For several medical organizations, disease prediction is important for making the best feasible health care decisions. Conversely, the conventional medical care model offers input as structured that requires more accurate and consistent prediction. This paper is planned to develop the multi-disease prediction using the improvised deep learning concept. Here, the different datasets pertain to “Diabetes, Hepatitis, lung cancer, liver tumor, heart disease, Parkinson’s disease, and Alzheimer’s disease”, from the benchmark UCI repository is gathered for conducting the experiment. The proposed model involves three phases (a) Data normalization (b) Weighted normalized feature extraction, and (c) prediction. Initially, the dataset is normalized in order to make the attribute's range at a certain level. Further, weighted feature extraction is performed, in which a weight function is multiplied with each attribute value for making large scale deviation. Here, the weight function is optimized using the combination of two meta-heuristic algorithms termed as Jaya Algorithm-based Multi-Verse Optimization algorithm (JA-MVO). The optimally extracted features are subjected to the hybrid deep learning algorithms like “Deep Belief Network (DBN) and Recurrent Neural Network (RNN)”. As a modification to hybrid deep learning architecture, the weight of both DBN and RNN is optimized using the same hybrid optimization algorithm. Further, the comparative evaluation of the proposed prediction over the existing models certifies its effectiveness through various performance measures.

Download Full-text

Identifying the Branch of Kiwifruit Based on Unmanned Aerial Vehicle (UAV) Images Using Deep Learning Method

Sensors ◽

10.3390/s21134442 ◽

2021 ◽

Vol 21 (13) ◽

pp. 4442

Author(s):

Zijie Niu ◽

Juntao Deng ◽

Xu Zhang ◽

Jun Zhang ◽

Shijia Pan ◽

...

Keyword(s):

Deep Learning ◽

Unmanned Aerial Vehicle ◽

Semantic Segmentation ◽

Dynamic Monitoring ◽

Support Vector ◽

Distribution Maps ◽

Time Operation ◽

Aerial Vehicle ◽

Uav Images ◽

Segmentation Image

It is important to obtain accurate information about kiwifruit vines to monitoring their physiological states and undertake precise orchard operations. However, because vines are small and cling to trellises, and have branches laying on the ground, numerous challenges exist in the acquisition of accurate data for kiwifruit vines. In this paper, a kiwifruit canopy distribution prediction model is proposed on the basis of low-altitude unmanned aerial vehicle (UAV) images and deep learning techniques. First, the location of the kiwifruit plants and vine distribution are extracted from high-precision images collected by UAV. The canopy gradient distribution maps with different noise reduction and distribution effects are generated by modifying the threshold and sampling size using the resampling normalization method. The results showed that the accuracies of the vine segmentation using PSPnet, support vector machine, and random forest classification were 71.2%, 85.8%, and 75.26%, respectively. However, the segmentation image obtained using depth semantic segmentation had a higher signal-to-noise ratio and was closer to the real situation. The average intersection over union of the deep semantic segmentation was more than or equal to 80% in distribution maps, whereas, in traditional machine learning, the average intersection was between 20% and 60%. This indicates the proposed model can quickly extract the vine distribution and plant position, and is thus able to perform dynamic monitoring of orchards to provide real-time operation guidance.

Download Full-text

A Deep Learning Approach on Building Detection from Unmanned Aerial Vehicle-Based Images in Riverbank Monitoring

Sensors ◽

10.3390/s18113921 ◽

2018 ◽

Vol 18 (11) ◽

pp. 3921 ◽

Cited By ~ 17

Author(s):

Wuttichai Boonpook ◽

Yumin Tan ◽

Yinghua Ye ◽

Peerapong Torteeka ◽

Kritanai Torsri ◽

...

Keyword(s):

Deep Learning ◽

Network Architecture ◽

Semantic Segmentation ◽

Water Levels ◽

Building Detection ◽

Network Training ◽

Building Information ◽

Open Standard ◽

Uav Images ◽

Uav Image

Buildings along riverbanks are likely to be affected by rising water levels, therefore the acquisition of accurate building information has great importance not only for riverbank environmental protection but also for dealing with emergency cases like flooding. UAV-based photographs are flexible and cloud-free compared to satellite images and can provide very high-resolution images up to centimeter level, while there exist great challenges in quickly and accurately detecting and extracting building from UAV images because there are usually too many details and distortions on UAV images. In this paper, a deep learning (DL)-based approach is proposed for more accurately extracting building information, in which the network architecture, SegNet, is used in the semantic segmentation after the network training on a completely labeled UAV image dataset covering multi-dimension urban settlement appearances along a riverbank area in Chongqing. The experiment results show that an excellent performance has been obtained in the detection of buildings from untrained locations with an average overall accuracy more than 90%. To verify the generality and advantage of the proposed method, the procedure is further evaluated by training and testing with another two open standard datasets which have a variety of building patterns and styles, and the final overall accuracies of building extraction are more than 93% and 95%, respectively.

Download Full-text

Real-Time Vehicle Make and Model Recognition with the Residual SqueezeNet Architecture

Sensors ◽

10.3390/s19050982 ◽

2019 ◽

Vol 19 (5) ◽

pp. 982 ◽

Cited By ~ 9

Author(s):

Hyo Lee ◽

Ihsan Ullah ◽

Weiguo Wan ◽

Yongbin Gao ◽

Zhijun Fang

Keyword(s):

Deep Learning ◽

Real Time ◽

Large Scale ◽

Recognition Rate ◽

Experimental Results ◽

Learning Approach ◽

Deep Model ◽

Proposed Model ◽

Real Time Applications ◽

Model Recognition

Make and model recognition (MMR) of vehicles plays an important role in automatic vision-based systems. This paper proposes a novel deep learning approach for MMR using the SqueezeNet architecture. The frontal views of vehicle images are first extracted and fed into a deep network for training and testing. The SqueezeNet architecture with bypass connections between the Fire modules, a variant of the vanilla SqueezeNet, is employed for this study, which makes our MMR system more efficient. The experimental results on our collected large-scale vehicle datasets indicate that the proposed model achieves 96.3% recognition rate at the rank-1 level with an economical time slice of 108.8 ms. For inference tasks, the deployed deep model requires less than 5 MB of space and thus has a great viability in real-time applications.

Download Full-text

Visual Saliency Prediction Based on Deep Learning

Information ◽

10.3390/info10080257 ◽

2019 ◽

Vol 10 (8) ◽

pp. 257 ◽

Cited By ~ 7

Author(s):

Bashir Ghariba ◽

Mohamed S. Shehata ◽

Peter McGuire

Keyword(s):

Deep Learning ◽

Saliency Detection ◽

Visual Saliency ◽

Semantic Segmentation ◽

Input Image ◽

Human Eye ◽

Proposed Model ◽

Global Accuracy ◽

Visual Saliency Detection ◽

Deep Learning Model

Human eye movement is one of the most important functions for understanding our surroundings. When a human eye processes a scene, it quickly focuses on dominant parts of the scene, commonly known as a visual saliency detection or visual attention prediction. Recently, neural networks have been used to predict visual saliency. This paper proposes a deep learning encoder-decoder architecture, based on a transfer learning technique, to predict visual saliency. In the proposed model, visual features are extracted through convolutional layers from raw images to predict visual saliency. In addition, the proposed model uses the VGG-16 network for semantic segmentation, which uses a pixel classification layer to predict the categorical label for every pixel in an input image. The proposed model is applied to several datasets, including TORONTO, MIT300, MIT1003, and DUT-OMRON, to illustrate its efficiency. The results of the proposed model are quantitatively and qualitatively compared to classic and state-of-the-art deep learning models. Using the proposed deep learning model, a global accuracy of up to 96.22% is achieved for the prediction of visual saliency.

Download Full-text

A Deep-Learning-Based Approach for Wheat Yellow Rust Disease Recognition from Unmanned Aerial Vehicle Images

Sensors ◽

10.3390/s21196540 ◽

2021 ◽

Vol 21 (19) ◽

pp. 6540

Author(s):

Qian Pan ◽

Maofang Gao ◽

Pingbo Wu ◽

Jingwen Yan ◽

Shilei Li

Keyword(s):

Deep Learning ◽

Unmanned Aerial Vehicle ◽

Recognition Accuracy ◽

Yellow Rust ◽

Semantic Segmentation ◽

High Accuracy ◽

Economic Losses ◽

Support Vector ◽

Aerial Vehicle ◽

Uav Images

Yellow rust is a disease with a wide range that causes great damage to wheat. The traditional method of manually identifying wheat yellow rust is very inefficient. To improve this situation, this study proposed a deep-learning-based method for identifying wheat yellow rust from unmanned aerial vehicle (UAV) images. The method was based on the pyramid scene parsing network (PSPNet) semantic segmentation model to classify healthy wheat, yellow rust wheat, and bare soil in small-scale UAV images, and to investigate the spatial generalization of the model. In addition, it was proposed to use the high-accuracy classification results of traditional algorithms as weak samples for wheat yellow rust identification. The recognition accuracy of the PSPNet model in this study reached 98%. On this basis, this study used the trained semantic segmentation model to recognize another wheat field. The results showed that the method had certain generalization ability, and its accuracy reached 98%. In addition, the high-accuracy classification result of a support vector machine was used as a weak label by weak supervision, which better solved the labeling problem of large-size images, and the final recognition accuracy reached 94%. Therefore, the present study method facilitated timely control measures to reduce economic losses.

Download Full-text

Deep-Kcr: accurate detection of lysine crotonylation sites using deep learning method

Briefings in Bioinformatics ◽

10.1093/bib/bbaa255 ◽

2020 ◽

Author(s):

Hao Lv ◽

Fu-Ying Dao ◽

Zheng-Xing Guan ◽

Hui Yang ◽

Yan-Wen Li ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Short Term Memory ◽

Information Gain ◽

Independent Set ◽

Cost Effective ◽

Cellular Regulation ◽

Proposed Model ◽

Experimental Approaches ◽

Memory Network

Abstract As a newly discovered protein posttranslational modification, histone lysine crotonylation (Kcr) involved in cellular regulation and human diseases. Various proteomics technologies have been developed to detect Kcr sites. However, experimental approaches for identifying Kcr sites are often time-consuming and labor-intensive, which is difficult to widely popularize in large-scale species. Computational approaches are cost-effective and can be used in a high-throughput manner to generate relatively precise identification. In this study, we develop a deep learning-based method termed as Deep-Kcr for Kcr sites prediction by combining sequence-based features, physicochemical property-based features and numerical space-derived information with information gain feature selection. We investigate the performances of convolutional neural network (CNN) and five commonly used classifiers (long short-term memory network, random forest, LogitBoost, naive Bayes and logistic regression) using 10-fold cross-validation and independent set test. Results show that CNN could always display the best performance with high computational efficiency on large dataset. We also compare the Deep-Kcr with other existing tools to demonstrate the excellent predictive power and robustness of our method. Based on the proposed model, a webserver called Deep-Kcr was established and is freely accessible at http://lin-group.cn/server/Deep-Kcr.

Download Full-text

3D semantic segmentation using deep learning for large-scale indoor point cloud

2019 14th IEEE International Conference on Electronic Measurement & Instruments (ICEMI) ◽

10.1109/icemi46757.2019.9101756 ◽

2019 ◽

Author(s):

Chen Hui ◽

Xu Peng ◽

Zuo Yipeng ◽

Wang Weina

Keyword(s):

Deep Learning ◽

Point Cloud ◽

Large Scale ◽

Semantic Segmentation

Download Full-text

Deep person re-identification in UAV images

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-019-0647-z ◽

2019 ◽

Vol 2019 (1) ◽

Author(s):

Aleksei Grigorev ◽

Zhihong Tian ◽

Seungmin Rho ◽

Jianxin Xiong ◽

Shaohui Liu ◽

...

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Group Learning ◽

Identification Problem ◽

Gaussian Mixture ◽

Surveillance Systems ◽

Deep Convolutional Neural Networks ◽

Recent Success ◽

Proposed Model ◽

Uav Images

AbstractThe person re-identification is one of the most significant problems in computer vision and surveillance systems. The recent success of deep convolutional neural networks in image classification has inspired researchers to investigate the application of deep learning to the person re-identification. However, the huge amount of research on this problem considers classical settings, where pedestrians are captured by static surveillance cameras, although there is a growing demand for analyzing images and videos taken by drones. In this paper, we aim at filling this gap and provide insights on the person re-identification from drones. To our knowledge, it is the first attempt to tackle this problem under such constraints. We present the person re-identification dataset, named DRone HIT (DRHIT01), which is collected by using a drone. It contains 101 unique pedestrians, which are annotated with their identities. Each pedestrian has about 500 images. We propose to use a combination of triplet and large-margin Gaussian mixture (L-GM) loss to tackle the drone-based person re-identification problem. The proposed network equipped with multi-branch design, channel group learning, and combination of loss functions is evaluated on the DRHIT01 dataset. Besides, transfer learning from the most popular person re-identification datasets is evaluated. Experiment results demonstrate the importance of transfer learning and show that the proposed model outperforms the classic deep learning approach.

Download Full-text