CED-Net: Crops and Weeds Segmentation for Smart Farming Using a Small Cascaded Encoder-Decoder Architecture

Abbas Khan; Talha Ilyas; Muhammad Umraiz; Zubaer Ibna Mannan; Hyongsuk Kim

doi:10.3390/electronics9101602

CED-Net: Crops and Weeds Segmentation for Smart Farming Using a Small Cascaded Encoder-Decoder Architecture

Electronics ◽

10.3390/electronics9101602 ◽

2020 ◽

Vol 9 (10) ◽

pp. 1602 ◽

Cited By ~ 2

Author(s):

Abbas Khan ◽

Talha Ilyas ◽

Muhammad Umraiz ◽

Zubaer Ibna Mannan ◽

Hyongsuk Kim

Keyword(s):

Agricultural Sector ◽

State Of The Art ◽

Human Life ◽

Semantic Segmentation ◽

Segmentation Method ◽

Training Time ◽

Smart Farming ◽

Decoder Architecture ◽

True Detection ◽

Coarse To Fine

Convolutional neural networks (CNNs) have achieved state-of-the-art performance in numerous aspects of human life and the agricultural sector is no exception. One of the main objectives of deep learning for smart farming is to identify the precise location of weeds and crops on farmland. In this paper, we propose a semantic segmentation method based on a cascaded encoder-decoder network, namely CED-Net, to differentiate weeds from crops. The existing architectures for weeds and crops segmentation are quite deep, with millions of parameters that require longer training time. To overcome such limitations, we propose an idea of training small networks in cascade to obtain coarse-to-fine predictions, which are then combined to produce the final results. Evaluation of the proposed network and comparison with other state-of-the-art networks are conducted using four publicly available datasets: rice seeding and weed dataset, BoniRob dataset, carrot crop vs. weed dataset, and a paddy–millet dataset. The experimental results and their comparisons proclaim that the proposed network outperforms state-of-the-art architectures, such as U-Net, SegNet, FCN-8s, and DeepLabv3, over intersection over union (IoU), F1-score, sensitivity, true detection rate, and average precision comparison metrics by utilizing only (1/5.74 × U-Net), (1/5.77 × SegNet), (1/3.04 × FCN-8s), and (1/3.24 × DeepLabv3) fractions of total parameters.

Download Full-text

Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation

Sensors ◽

10.3390/s19245361 ◽

2019 ◽

Vol 19 (24) ◽

pp. 5361 ◽

Cited By ~ 6

Author(s):

Bruno Artacho ◽

Andreas Savakis

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

State Of The Art ◽

Semantic Segmentation ◽

Training Time ◽

Network Parameters ◽

Spatial Pooling ◽

Memory Footprint ◽

Accuracy Increase ◽

Spatial Pyramid

We propose a new efficient architecture for semantic segmentation, based on a “Waterfall” Atrous Spatial Pooling architecture, that achieves a considerable accuracy increase while decreasing the number of network parameters and memory footprint. The proposed Waterfall architecture leverages the efficiency of progressive filtering in the cascade architecture while maintaining multiscale fields-of-view comparable to spatial pyramid configurations. Additionally, our method does not rely on a postprocessing stage with Conditional Random Fields, which further reduces complexity and required training time. We demonstrate that the Waterfall approach with a ResNet backbone is a robust and efficient architecture for semantic segmentation obtaining state-of-the-art results with significant reduction in the number of parameters for the Pascal VOC dataset and the Cityscapes dataset.

Download Full-text

Preliminary Results on Different Text Processing Tasks Using Encoder-Decoder Networks and the Causal Feature Extractor

Applied Sciences ◽

10.3390/app10175772 ◽

2020 ◽

Vol 10 (17) ◽

pp. 5772

Author(s):

Adrián Javaloy ◽

Ginés García-Mateos

Keyword(s):

Language Processing ◽

Network Architecture ◽

State Of The Art ◽

Text Processing ◽

Short Term ◽

Training Time ◽

Preliminary Results ◽

Feature Extractor ◽

Decoder Architecture ◽

Different Types

Deep learning methods are gaining popularity in different application domains, and especially in natural language processing. It is commonly believed that using a large enough dataset and an adequate network architecture, almost any processing problem can be solved. A frequent and widely used typology is the encoder-decoder architecture, where the input data is transformed into an intermediate code by means of an encoder, and then a decoder takes this code to produce its output. Different types of networks can be used in the encoder and the decoder, depending on the problem of interest, such as convolutional neural networks (CNN) or long-short term memories (LSTM). This paper uses for the encoder a method recently proposed, called Causal Feature Extractor (CFE). It is based on causal convolutions (i.e., convolutions that depend only on one direction of the input), dilatation (i.e., increasing the aperture size of the convolutions) and bidirectionality (i.e., independent networks in both directions). Some preliminary results are presented on three different tasks and compared with state-of-the-art methods: bilingual translation, LaTeX decompilation and audio transcription. The proposed method achieves promising results, showing its ubiquity to work with text, audio and images. Moreover, it has a shorter training time, requiring less time per iteration, and a good use of the attention mechanisms based on attention matrices.

Download Full-text

BU-Net: Brain Tumor Segmentation Using Modified U-Net Architecture

Electronics ◽

10.3390/electronics9122203 ◽

2020 ◽

Vol 9 (12) ◽

pp. 2203

Author(s):

Mobeen Ur Rehman ◽

SeungBin Cho ◽

Jee Hong Kim ◽

Kil To Chong

Keyword(s):

Brain Tumor ◽

State Of The Art ◽

Contextual Information ◽

High Grade Glioma ◽

Semantic Segmentation ◽

Tumor Segmentation ◽

Segmentation Method ◽

Brain Tumor Segmentation ◽

High Performing ◽

Treatment And Prevention

The semantic segmentation of a brain tumor is of paramount importance for its treatment and prevention. Recently, researches have proposed various neural network-based architectures to improve the performance of segmentation of brain tumor sub-regions. Brain tumor segmentation, being a challenging area of research, requires improvement in its performance. This paper proposes a 2D image segmentation method, BU-Net, to contribute to brain tumor segmentation research. Residual extended skip (RES) and wide context (WC) are used along with the customized loss function in the baseline U-Net architecture. The modifications contribute by finding more diverse features, by increasing the valid receptive field. The contextual information is extracted with the aggregating features to get better segmentation performance. The proposed BU-Net was evaluated on the high-grade glioma (HGG) datasets of the BraTS2017 Challenge—the test datasets of the BraTS 2017 and 2018 Challenge datasets. Three major labels to segmented were tumor core (TC), whole tumor (WT), and enhancing core (EC). To compare the performance quantitatively, the dice score was utilized. The proposed BU-Net outperformed the existing state-of-the-art techniques. The high performing BU-Net can have a great contribution to researchers from the field of bioinformatics and medicine.

Download Full-text

Multi-model Integrated Weakly Supervised Semantic Segmentation Method

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2019.17379 ◽

2019 ◽

Vol 31 (5) ◽

pp. 800

Author(s):

Changzhen Xiong ◽

Hui Zhi

Keyword(s):

Semantic Segmentation ◽

Segmentation Method ◽

Weakly Supervised

Download Full-text

Utilizing semantic segmentation method with convolutional neural net to model a partial nephrectomy simulator for 3D printing

European Urology Supplements ◽

10.1016/s1569-9056(19)31264-3 ◽

2019 ◽

Vol 18 (1) ◽

pp. e1747-e1748

Author(s):

Y.S. Kyung ◽

B. Lim ◽

S.Y. Choi ◽

J.H. Han ◽

W. Lee ◽

...

Keyword(s):

3D Printing ◽

Partial Nephrectomy ◽

Semantic Segmentation ◽

Neural Net ◽

Segmentation Method

Download Full-text

A progressive image semantic segmentation method using recurrent neural network

2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP) ◽

10.1109/icsp51882.2021.9408920 ◽

2021 ◽

Author(s):

Li Yi

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Semantic Segmentation ◽

Segmentation Method

Download Full-text

Design and Analysis of Photovoltaic Powered Battery-Operated Computer Vision-Based Multi-Purpose Smart Farming Robot

Agronomy ◽

10.3390/agronomy11030530 ◽

2021 ◽

Vol 11 (3) ◽

pp. 530

Author(s):

Aneesh A. Chand ◽

Kushal A. Prasad ◽

Ellen Mar ◽

Sanaila Dakai ◽

Kabir A. Mamun ◽

...

Keyword(s):

Computer Vision ◽

Storage Tank ◽

Agricultural Sector ◽

Pesticide Use ◽

Vertical Orientation ◽

Power Profile ◽

Smart Farming ◽

Fully Automatic ◽

Solar Powered ◽

Spraying Process

Farm machinery like water sprinklers (WS) and pesticide sprayers (PS) are becoming quite popular in the agricultural sector. The WS and PS are two distinct types of machinery, mostly powered using conventional energy sources. In recent times, the battery and solar-powered WS and PS have also emerged. With the current WS and PS, the main drawback is the lack of intelligence on water and pesticide use decisions and autonomous control. This paper proposes a novel multi-purpose smart farming robot (MpSFR) that handles both water sprinkling and pesticide spraying. The MpSFR is a photovoltaic (PV) powered battery-operated internet of things (IoT) and computer vision (CV) based robot that helps in automating the watering and spraying process. Firstly, the PV-powered battery-operated autonomous MpSFR equipped with a storage tank for water and pesticide drove with a programmed pumping device is engineered. The sprinkling and spraying mechanisms are made fully automatic with a programmed pattern that utilizes IoT sensors and CV to continuously monitor the soil moisture and the plant’s health based on pests. Two servo motors accomplish the horizontal and vertical orientation of the spraying nozzle. We provided an option to remotely switch the sprayer to spray either water or pesticide using an infrared device, i.e., within a 5-m range. Secondly, the operation of the developed MpSFR is experimentally verified in the test farm. The field test’s observed results include the solar power profile, battery charging, and discharging conditions. The results show that the MpSFR operates effectively, and decisions on water use and pesticide are automated.

Download Full-text

SketchGNN: Semantic Sketch Segmentation with Graph Neural Networks

ACM Transactions on Graphics ◽

10.1145/3450284 ◽

2021 ◽

Vol 40 (3) ◽

pp. 1-13

Author(s):

Lumin Yang ◽

Jiajie Zhuang ◽

Hongbo Fu ◽

Xiangzhi Wei ◽

Kun Zhou ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Large Scale ◽

State Of The Art ◽

Semantic Segmentation ◽

Structure Information ◽

Graph Neural Networks ◽

Node Labels ◽

Point Level

We introduce SketchGNN , a convolutional graph neural network for semantic segmentation and labeling of freehand vector sketches. We treat an input stroke-based sketch as a graph with nodes representing the sampled points along input strokes and edges encoding the stroke structure information. To predict the per-node labels, our SketchGNN uses graph convolution and a static-dynamic branching network architecture to extract the features at three levels, i.e., point-level, stroke-level, and sketch-level. SketchGNN significantly improves the accuracy of the state-of-the-art methods for semantic sketch segmentation (by 11.2% in the pixel-based metric and 18.2% in the component-based metric over a large-scale challenging SPG dataset) and has magnitudes fewer parameters than both image-based and sequence-based methods.

Download Full-text

ART-UP: A Novel Method for Generating Scanning-Robust Aesthetic QR Codes

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3418214 ◽

2021 ◽

Vol 17 (1) ◽

pp. 1-23

Author(s):

Mingliang Xu ◽

Qingfeng Li ◽

Jianwei Niu ◽

Hao Su ◽

Xiting Liu ◽

...

Keyword(s):

State Of The Art ◽

Visual Quality ◽

Qr Code ◽

Quick Response ◽

Estimation Model ◽

Qr Codes ◽

Excellent Performance ◽

Novel Method ◽

Coarse To Fine

Quick response (QR) codes are usually scanned in different environments, so they must be robust to variations in illumination, scale, coverage, and camera angles. Aesthetic QR codes improve the visual quality, but subtle changes in their appearance may cause scanning failure. In this article, a new method to generate scanning-robust aesthetic QR codes is proposed, which is based on a module-based scanning probability estimation model that can effectively balance the tradeoff between visual quality and scanning robustness. Our method locally adjusts the luminance of each module by estimating the probability of successful sampling. The approach adopts the hierarchical, coarse-to-fine strategy to enhance the visual quality of aesthetic QR codes, which sequentially generate the following three codes: a binary aesthetic QR code, a grayscale aesthetic QR code, and the final color aesthetic QR code. Our approach also can be used to create QR codes with different visual styles by adjusting some initialization parameters. User surveys and decoding experiments were adopted for evaluating our method compared with state-of-the-art algorithms, which indicates that the proposed approach has excellent performance in terms of both visual quality and scanning robustness.

Download Full-text

Towards Scalable Economic Photovoltaic Potential Analysis Using Aerial Images and Deep Learning

Energies ◽

10.3390/en14133800 ◽

2021 ◽

Vol 14 (13) ◽

pp. 3800

Author(s):

Sebastian Krapf ◽

Nils Kemmerzell ◽

Syed Khawaja Haseeb Khawaja Haseeb Uddin ◽

Manuel Hack Hack Vázquez ◽

Fabian Netzler ◽

...

Keyword(s):

Deep Learning ◽

System Analysis ◽

State Of The Art ◽

Critical Role ◽

Semantic Segmentation ◽

Energy System ◽

Aerial Images ◽

Potential Analysis ◽

3D Data ◽

Challenges And Opportunities

Roof-mounted photovoltaic systems play a critical role in the global transition to renewable energy generation. An analysis of roof photovoltaic potential is an important tool for supporting decision-making and for accelerating new installations. State of the art uses 3D data to conduct potential analyses with high spatial resolution, limiting the study area to places with available 3D data. Recent advances in deep learning allow the required roof information from aerial images to be extracted. Furthermore, most publications consider the technical photovoltaic potential, and only a few publications determine the photovoltaic economic potential. Therefore, this paper extends state of the art by proposing and applying a methodology for scalable economic photovoltaic potential analysis using aerial images and deep learning. Two convolutional neural networks are trained for semantic segmentation of roof segments and superstructures and achieve an Intersection over Union values of 0.84 and 0.64, respectively. We calculated the internal rate of return of each roof segment for 71 buildings in a small study area. A comparison of this paper’s methodology with a 3D-based analysis discusses its benefits and disadvantages. The proposed methodology uses only publicly available data and is potentially scalable to the global level. However, this poses a variety of research challenges and opportunities, which are summarized with a focus on the application of deep learning, economic photovoltaic potential analysis, and energy system analysis.

Download Full-text