A Novel Segmentation Method for Furnace Flame Using Adaptive Color Model and Hybrid-Coded HLO

Complexity ◽

10.1155/2021/3027126 ◽

2021 ◽

Vol 2021 ◽

pp. 1-16

Author(s):

Pinggai Zhang ◽

Minrui Fei ◽

Ling Wang ◽

Xian Wu ◽

Chen Peng ◽

...

Keyword(s):

State Of The Art ◽

Color Model ◽

Model Parameters ◽

Detection Accuracy ◽

Optimal Parameters ◽

Segmentation Method ◽

Safe Operation ◽

False Detection ◽

High Detection ◽

Optimal Set

In recent years, the combustion furnace has been widely applied in many different fields of industrial technology, and the accurate detection of combustion states can effectively help operators adjust combustion strategies to improve combustion utilization and ensure safe operation. However, the combustion states inside the industrial furnace change according to the production needs, which further challenges the optimal set of model parameters. To effectively segment the flame pixels, a novel segmentation method for furnace flame using adaptive color model and hybrid-coded human learning optimization (AHcHLO) is proposed. A new adaptive color model with mixed variables (NACMM) is designed for adapting to different combustion states, and the AHcHLO is developed to search for the optimal parameters of NACMM. Then, the best NACMM with optimal parameters is adopted to segment the combustion flame image more precisely and effectively. Finally, the experiment results show that the developed AHcHLO obtains the best-known overall results so far on benchmark functions and the proposed NACMM outperforms state-of-the-art flame segmentation approaches, providing a high detection accuracy and a low false detection rate.

Download Full-text

Real-Time Scene Text Detection with Differentiable Binarization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6812 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11474-11481 ◽

Cited By ~ 9

Author(s):

Minghui Liao ◽

Zhaoyi Wan ◽

Cong Yao ◽

Kai Chen ◽

Xiang Bai

Keyword(s):

State Of The Art ◽

Text Detection ◽

Detection Accuracy ◽

Post Processing ◽

Segmentation Method ◽

Performance Improvements ◽

Scene Text Detection ◽

Scene Text ◽

Benchmark Datasets ◽

Bounding Boxes

Recently, segmentation-based methods are quite popular in scene text detection, as the segmentation results can more accurately describe scene text of various shapes such as curve text. However, the post-processing of binarization is essential for segmentation-based detection, which converts probability maps produced by a segmentation method into bounding boxes/regions of text. In this paper, we propose a module named Differentiable Binarization (DB), which can perform the binarization process in a segmentation network. Optimized along with a DB module, a segmentation network can adaptively set the thresholds for binarization, which not only simplifies the post-processing but also enhances the performance of text detection. Based on a simple segmentation network, we validate the performance improvements of DB on five benchmark datasets, which consistently achieves state-of-the-art results, in terms of both detection accuracy and speed. In particular, with a light-weight backbone, the performance improvements by DB are significant so that we can look for an ideal tradeoff between detection accuracy and efficiency. Specifically, with a backbone of ResNet-18, our detector achieves an F-measure of 82.8, running at 62 FPS, on the MSRA-TD500 dataset. Code is available at: https://github.com/MhLiao/DB.

Download Full-text

Object Detection with the Addition of New Classes Based on the Method of RNOL

Mathematical Problems in Engineering ◽

10.1155/2020/9205373 ◽

2020 ◽

Vol 2020 ◽

pp. 1-6

Author(s):

Haiquan Fang ◽

Feijia Zhu

Keyword(s):

Object Detection ◽

State Of The Art ◽

Fine Tuning ◽

Detection Methods ◽

Detection Accuracy ◽

Important Research ◽

Training Time ◽

Tuning Method ◽

High Detection ◽

Computer Vision Applications

Object detection plays an important role in many computer vision applications. Innovative object detection methods based on deep learning such as Faster R-CNN, YOLO, and SSD have achieved state-of the-art results in terms of detection accuracy. There have been few studies to date on object detection with the addition of new classes, however, though this problem is often encountered in the industry. Therefore, this issue has important research significance and practical value. On the premise that the old class samples are available, a method of reserving nodes in advance in the output layer (RNOL) was established in this study. Experiments show that RNOL can achieve high detection accuracy in both new and old classes over a short training time while outperforming the traditional fine-tuning method.

Download Full-text

Detection of Malicious Spatial-Domain Steganography over Noisy Channels Using Convolutional Neural Networks

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.4.mwsf-076 ◽

2020 ◽

Vol 2020 (4) ◽

pp. 76-1-76-7

Author(s):

Swaroop Shankar Prasad ◽

Ofer Hadar ◽

Ilia Polian

Keyword(s):

State Of The Art ◽

Visual Quality ◽

Channel Noise ◽

Detection Accuracy ◽

Noisy Channel ◽

Noisy Channels ◽

Reliable Transmission ◽

Reliable Detection ◽

Natural Noise ◽

Will Force

Image steganography can have legitimate uses, for example, augmenting an image with a watermark for copyright reasons, but can also be utilized for malicious purposes. We investigate the detection of malicious steganography using neural networkbased classification when images are transmitted through a noisy channel. Noise makes detection harder because the classifier must not only detect perturbations in the image but also decide whether they are due to the malicious steganographic modifications or due to natural noise. Our results show that reliable detection is possible even for state-of-the-art steganographic algorithms that insert stego bits not affecting an image’s visual quality. The detection accuracy is high (above 85%) if the payload, or the amount of the steganographic content in an image, exceeds a certain threshold. At the same time, noise critically affects the steganographic information being transmitted, both through desynchronization (destruction of information which bits of the image contain steganographic information) and by flipping these bits themselves. This will force the adversary to use a redundant encoding with a substantial number of error-correction bits for reliable transmission, making detection feasible even for small payloads.

Download Full-text

Smoke recognition network based on dynamic characteristics

International Journal of Advanced Robotic Systems ◽

10.1177/1729881420925662 ◽

2020 ◽

Vol 17 (3) ◽

pp. 172988142092566

Author(s):

Dahan Wang ◽

Sheng Luo ◽

Li Zhao ◽

Xiaoming Pan ◽

Muchou Wang ◽

...

Keyword(s):

Dynamic Characteristics ◽

State Of The Art ◽

The State ◽

Detection Accuracy ◽

Static Characteristics ◽

Good Tool ◽

Early Signal ◽

Fuzzy Objects ◽

The Difference ◽

Smoke Recognition

Fire is a fierce disaster, and smoke is the early signal of fire. Since such features as chrominance, texture, and shape of smoke are very special, a lot of methods based on these features have been developed. But these static characteristics vary widely, so there are some exceptions leading to low detection accuracy. On the other side, the motion of smoke is much more discriminating than the aforementioned features, so a time-domain neural network is proposed to extract its dynamic characteristics. This smoke recognition network has these advantages:(1) extract the spatiotemporal with the 3D filters which work on dynamic and static characteristics synchronously; (2) high accuracy, 87.31% samples being classified rightly, which is the state of the art even in a chaotic environments, and the fuzzy objects for other methods, such as haze, fog, and climbing cars, are distinguished distinctly; (3) high sensitiveness, smoke being detected averagely at the 23rd frame, which is also the state of the art, which is meaningful to alarm early fire as soon as possible; and (4) it is not been based on any hypothesis, which guarantee the method compatible. Finally, a new metric, the difference between the first frame in which smoke is detected and the first frame in which smoke happens, is proposed to compare the algorithms sensitivity in videos. The experiments confirm that the dynamic characteristics are more discriminating than the aforementioned static characteristics, and smoke recognition network is a good tool to extract compound feature.

Download Full-text

Transcription Alignment of Historical Vietnamese Manuscripts without Human-Annotated Learning Samples

Applied Sciences ◽

10.3390/app11114894 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4894

Author(s):

Anna Scius-Bertrand ◽

Michael Jungo ◽

Beat Wolf ◽

Andreas Fischer ◽

Marc Bui

Keyword(s):

Object Detection ◽

State Of The Art ◽

Positive Impact ◽

Detection System ◽

Training Data ◽

Detection Accuracy ◽

Current State ◽

Alignment Task ◽

Scanned Image ◽

Automatic Transcription

The current state of the art for automatic transcription of historical manuscripts is typically limited by the requirement of human-annotated learning samples, which are are necessary to train specific machine learning models for specific languages and scripts. Transcription alignment is a simpler task that aims to find a correspondence between text in the scanned image and its existing Unicode counterpart, a correspondence which can then be used as training data. The alignment task can be approached with heuristic methods dedicated to certain types of manuscripts, or with weakly trained systems reducing the required amount of annotations. In this article, we propose a novel learning-based alignment method based on fully convolutional object detection that does not require any human annotation at all. Instead, the object detection system is initially trained on synthetic printed pages using a font and then adapted to the real manuscripts by means of self-training. On a dataset of historical Vietnamese handwriting, we demonstrate the feasibility of annotation-free alignment as well as the positive impact of self-training on the character detection accuracy, reaching a detection accuracy of 96.4% with a YOLOv5m model without using any human annotation.

Download Full-text

The use of remote sensing satellite using deep learning in emergency monitoring of high-level landslides disaster in Jinsha River

The Journal of Supercomputing ◽

10.1007/s11227-020-03604-4 ◽

2021 ◽

Author(s):

Leijin Long ◽

Feng He ◽

Hongjiang Liu

Keyword(s):

Remote Sensing ◽

Southwest China ◽

Influence Factors ◽

Classification Error ◽

Model Parameters ◽

Detection Accuracy ◽

Remote Sensing Images ◽

Jinsha River ◽

Detection Model ◽

High Level

AbstractIn order to monitor the high-level landslides frequently occurring in Jinsha River area of Southwest China, and protect the lives and property safety of people in mountainous areas, the data of satellite remote sensing images are combined with various factors inducing landslides and transformed into landslide influence factors, which provides data basis for the establishment of landslide detection model. Then, based on the deep belief networks (DBN) and convolutional neural network (CNN) algorithm, two landslide detection models DBN and convolutional neural-deep belief network (CDN) are established to monitor the high-level landslide in Jinsha River. The influence of the model parameters on the landslide detection results is analyzed, and the accuracy of DBN and CDN models in dealing with actual landslide problems is compared. The results show that when the number of neurons in the DBN is 100, the overall error is the minimum, and when the number of learning layers is 3, the classification error is the minimum. The detection accuracy of DBN and CDN is 97.56% and 97.63%, respectively, which indicates that both DBN and CDN models are feasible in dealing with landslides from remote sensing images. This exploration provides a reference for the study of high-level landslide disasters in Jinsha River.

Download Full-text

Malaria parasite detection in thick blood smear microscopic images using modified YOLOV3 and YOLOV4 models

BMC Bioinformatics ◽

10.1186/s12859-021-04036-4 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Fetulhak Abdurahman ◽

Kinde Anlay Fante ◽

Mohammed Aliy

Keyword(s):

Object Detection ◽

Malaria Parasite ◽

Blood Smear ◽

Clustering Algorithm ◽

State Of The Art ◽

Detection Accuracy ◽

Small Object ◽

Thick Blood Smear ◽

Malaria Parasites ◽

Microscopic Images

Abstract Background Manual microscopic examination of Leishman/Giemsa stained thin and thick blood smear is still the “gold standard” for malaria diagnosis. One of the drawbacks of this method is that its accuracy, consistency, and diagnosis speed depend on microscopists’ diagnostic and technical skills. It is difficult to get highly skilled microscopists in remote areas of developing countries. To alleviate this problem, in this paper, we propose to investigate state-of-the-art one-stage and two-stage object detection algorithms for automated malaria parasite screening from microscopic image of thick blood slides. Results YOLOV3 and YOLOV4 models, which are state-of-the-art object detectors in accuracy and speed, are not optimized for detecting small objects such as malaria parasites in microscopic images. We modify these models by increasing feature scale and adding more detection layers to enhance their capability of detecting small objects without notably decreasing detection speed. We propose one modified YOLOV4 model, called YOLOV4-MOD and two modified models of YOLOV3, which are called YOLOV3-MOD1 and YOLOV3-MOD2. Besides, new anchor box sizes are generated using K-means clustering algorithm to exploit the potential of these models in small object detection. The performance of the modified YOLOV3 and YOLOV4 models were evaluated on a publicly available malaria dataset. These models have achieved state-of-the-art accuracy by exceeding performance of their original versions, Faster R-CNN, and SSD in terms of mean average precision (mAP), recall, precision, F1 score, and average IOU. YOLOV4-MOD has achieved the best detection accuracy among all the other models with a mAP of 96.32%. YOLOV3-MOD2 and YOLOV3-MOD1 have achieved mAP of 96.14% and 95.46%, respectively. Conclusions The experimental results of this study demonstrate that performance of modified YOLOV3 and YOLOV4 models are highly promising for detecting malaria parasites from images captured by a smartphone camera over the microscope eyepiece. The proposed system is suitable for deployment in low-resource setting areas.

Download Full-text

Multi-resolution Visual Positioning and Navigation Technique for Unmanned Aerial System Landing Assistance

Journal of Navigation ◽

10.1017/s0373463317000327 ◽

2017 ◽

Vol 70 (6) ◽

pp. 1276-1292

Author(s):

Chong Yu ◽

Jiyuan Cai ◽

Qingyu Chen

Keyword(s):

Real World ◽

State Of The Art ◽

Unmanned Aerial System ◽

Detection Accuracy ◽

Relative Positioning ◽

Positioning Accuracy ◽

Visual Positioning ◽

Positioning Technique ◽

Technique Comparison ◽

Resolution Simulation

To achieve more accurate navigation performance in the landing process, a multi-resolution visual positioning technique is proposed for landing assistance of an Unmanned Aerial System (UAS). This technique uses a captured image of an artificial landmark (e.g. barcode) to provide relative positioning information in the X, Y and Z axes, and yaw, roll and pitch orientations. A multi-resolution coding algorithm is designed to ensure the UAS will not lose the detection of the landing target due to limited visual angles or camera resolution. Simulation and real world experiments prove the performance of the proposed technique in positioning accuracy, detection accuracy, and navigation effect. Two types of UAS are used to verify the generalisation of the proposed technique. Comparison experiments to state-of-the-art techniques are also included with the results analysis.

Download Full-text

Analysis of State-of-the-Art Spin-Transfer-Torque Nonvolatile Flip-Flops Considering Restore Yield in the Near/Sub-Threshold Voltage Region

Electronics ◽

10.3390/electronics9122118 ◽

2020 ◽

Vol 9 (12) ◽

pp. 2118

Author(s):

Gwang Hui Choi ◽

Taehui Na

Keyword(s):

Threshold Voltage ◽

State Of The Art ◽

Spin Transfer Torque ◽

Spin Transfer ◽

Model Parameters ◽

Offset Cancellation ◽

Battery Lifetime ◽

Battery Capacity ◽

Iot Devices ◽

Voltage Region

Recently, the leakage power consumption of Internet of Things (IoT) devices has become a main issue to be tackled, due to the fact that the scaling of process technology increases the leakage current in the IoT devices having limited battery capacity, resulting in the reduction of battery lifetime. The most effective method to extend the battery lifetime is to shut-off the device during standby mode. For this reason, spin-transfer-torque magnetic-tunnel-junction (STT-MTJ) based nonvolatile flip-flop (NVFF) is being considered as a strong candidate to store the computing data. Since there is a risk that the MTJ resistance may change during the read operation (i.e., the read disturbance problem), NVFF should consider the read disturbance problem to satisfy reliable data restoration. To date, several NVFFs have been proposed. Even though they satisfy the target restore yield of 4σ, most of them do not take the read disturbance into account. Furthermore, several recently proposed NVFFs which focus on the offset-cancellation technique to improve the restore yield have obvious limitation with decreasing the supply voltage (VDD), because the offset-cancellation technique uses switch operation in the critical path that can exacerbate the restore yield in the near/sub-threshold region. In this regard, this paper analyzes state-of-the-art STT-MTJ based NVFFs with respect to the voltage region and provides insight that a simple circuit having no offset-cancellation technique could achieve a better restore yield in the near/sub-threshold voltage region. Monte–Carlo HSPICE simulation results, using industry-compatible 28 nm model parameters, show that in case of VDD of 0.6 V, complex NVFF circuits having offset tolerance characteristic have a better restore yield, whereas in case of VDD of 0.4 V with sizing up strategy, a simple NVFF circuit having no offset tolerance characteristic has a better restore yield.

Download Full-text

Research on Lightweight Infrared Pedestrian Detection Model Algorithm for Embedded Platform

Security and Communication Networks ◽

10.1155/2021/1549772 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Zhaoli Wu ◽

Xin Wang ◽

Chao Chen

Keyword(s):

Real Time ◽

Target Detection ◽

Pedestrian Detection ◽

Infrared Image ◽

Far Infrared ◽

Detection Algorithm ◽

Model Parameters ◽

Detection Accuracy ◽

Detection Model ◽

Embedded Platform

Due to the limitation of energy consumption and power consumption, the embedded platform cannot meet the real-time requirements of the far-infrared image pedestrian detection algorithm. To solve this problem, this paper proposes a new real-time infrared pedestrian detection algorithm (RepVGG-YOLOv4, Rep-YOLO), which uses RepVGG to reconstruct the YOLOv4 backbone network, reduces the amount of model parameters and calculations, and improves the speed of target detection; using space spatial pyramid pooling (SPP) obtains different receptive field information to improve the accuracy of model detection; using the channel pruning compression method reduces redundant parameters, model size, and computational complexity. The experimental results show that compared with the YOLOv4 target detection algorithm, the Rep-YOLO algorithm reduces the model volume by 90%, the floating-point calculation is reduced by 93.4%, the reasoning speed is increased by 4 times, and the model detection accuracy after compression reaches 93.25%.

Download Full-text