SeedSortNet: a rapid and highly effificient lightweight CNN based on visual attention for seed sorting

PeerJ Computer Science ◽

10.7717/peerj-cs.639 ◽

2021 ◽

Vol 7 ◽

pp. e639

Author(s):

Chunlei Li ◽

Huanyu Li ◽

Zhoufeng Liu ◽

Bicao Li ◽

Yun Huang

Keyword(s):

Computational Complexity ◽

Visual Attention ◽

Feature Space ◽

Learning Technology ◽

Spatial Transformation ◽

Computational Costs ◽

Fine Grained ◽

Multi Scale ◽

Accuracy Rates

Seed purity directly affects the quality of seed breeding and subsequent processing products. Seed sorting based on machine vision provides an effective solution to this problem. The deep learning technology, particularly convolutional neural networks (CNNs), have exhibited impressive performance in image recognition and classification, and have been proven applicable in seed sorting. However the huge computational complexity and massive storage requirements make it a great challenge to deploy them in real-time applications, especially on devices with limited resources. In this study, a rapid and highly efficient lightweight CNN based on visual attention, namely SeedSortNet, is proposed for seed sorting. First, a dual-branch lightweight feature extraction module Shield-block is elaborately designed by performing identity mapping, spatial transformation at higher dimensions and different receptive field modeling, and thus it can alleviate information loss and effectively characterize the multi-scale feature while utilizing fewer parameters and lower computational complexity. In the down-sampling layer, the traditional MaxPool is replaced as MaxBlurPool to improve the shift-invariant of the network. Also, an extremely lightweight sub-feature space attention module (SFSAM) is presented to selectively emphasize fine-grained features and suppress the interference of complex backgrounds. Experimental results show that SeedSortNet achieves the accuracy rates of 97.33% and 99.56% on the maize seed dataset and sunflower seed dataset, respectively, and outperforms the mainstream lightweight networks (MobileNetv2, ShuffleNetv2, etc.) at similar computational costs, with only 0.400M parameters (vs. 4.06M, 5.40M).

Download Full-text

The Spatial-Comprehensiveness (S-COM) Index: Identifying Optimal Spatial Extents in Volunteered Geographic Information Point Datasets

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9090497 ◽

2020 ◽

Vol 9 (9) ◽

pp. 497

Author(s):

Haydn Lawrence ◽

Colin Robertson ◽

Rob Feick ◽

Trisalyn Nelson

Keyword(s):

Social Media ◽

Data Quality ◽

Spatial Scales ◽

Volunteered Geographic Information ◽

Geographic Information ◽

Quality Metric ◽

Fine Grained ◽

Multi Scale ◽

Feasible Study

Social media and other forms of volunteered geographic information (VGI) are used frequently as a source of fine-grained big data for research. While employing geographically referenced social media data for a wide array of purposes has become commonplace, the relevant scales over which these data apply to is typically unknown. For researchers to use VGI appropriately (e.g., aggregated to areal units (e.g., neighbourhoods) to elicit key trend or demographic information), general methods for assessing the quality are required, particularly, the explicit linkage of data quality and relevant spatial scales, as there are no accepted standards or sampling controls. We present a data quality metric, the Spatial-comprehensiveness Index (S-COM), which can delineate feasible study areas or spatial extents based on the quality of uneven and dynamic geographically referenced VGI. This scale-sensitive approach to analyzing VGI is demonstrated over different grains with data from two citizen science initiatives. The S-COM index can be used both to assess feasible study extents based on coverage, user-heterogeneity, and density and to find feasible sub-study areas from a larger, indefinite area. The results identified sub-study areas of VGI for focused analysis, allowing for a larger adoption of a similar methodology in multi-scale analyses of VGI.

Download Full-text

Reducing the Deterioration of Sentiment Analysis Results Due to the Time Impact

Information ◽

10.3390/info9080184 ◽

2018 ◽

Vol 9 (8) ◽

pp. 184 ◽

Cited By ~ 2

Author(s):

Yuliya Rubtsova

Keyword(s):

Computational Complexity ◽

Text Classification ◽

Feature Space ◽

Sentiment Classification ◽

Text Collections ◽

Word Representation ◽

F Measure ◽

Over Time

The research identifies and substantiates the problem of quality deterioration in the sentiment classification of text collections identical in composition and characteristics, but staggered over time. It is shown that the quality of sentiment classification can drop up to 15% in terms of the F-measure over a year and a half. This paper presents three different approaches to improving text classification by sentiment in continuously-updated text collections in Russian: using a weighing scheme with linear computational complexity, adding lexicons of emotional vocabulary to the feature space and distributed word representation. All methods are compared, and it is shown which method is most applicable in certain cases. Experiments comparing the methods on sufficiently representative text collections are described. It is shown that suggested approaches could reduce the deterioration of sentiment classification results for collections staggered over time.

Download Full-text

SSAH: Semi-Supervised Adversarial Deep Hashing with Self-Paced Hard Sample Generation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6773 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11157-11164

Author(s):

Sheng Jin ◽

Shangchen Zhou ◽

Yao Liu ◽

Chao Chen ◽

Xiaoshuai Sun ◽

...

Keyword(s):

Large Scale ◽

Semantic Information ◽

Unified Framework ◽

Generative Adversarial Network ◽

Fine Grained ◽

Multi Scale ◽

Deep Hashing ◽

Adversarial Network ◽

Improve State

Deep hashing methods have been proved to be effective and efficient for large-scale Web media search. The success of these data-driven methods largely depends on collecting sufficient labeled data, which is usually a crucial limitation in practical cases. The current solutions to this issue utilize Generative Adversarial Network (GAN) to augment data in semi-supervised learning. However, existing GAN-based methods treat image generations and hashing learning as two isolated processes, leading to generation ineffectiveness. Besides, most works fail to exploit the semantic information in unlabeled data. In this paper, we propose a novel Semi-supervised Self-pace Adversarial Hashing method, named SSAH to solve the above problems in a unified framework. The SSAH method consists of an adversarial network (A-Net) and a hashing network (H-Net). To improve the quality of generative images, first, the A-Net learns hard samples with multi-scale occlusions and multi-angle rotated deformations which compete against the learning of accurate hashing codes. Second, we design a novel self-paced hard generation policy to gradually increase the hashing difficulty of generated samples. To make use of the semantic information in unlabeled ones, we propose a semi-supervised consistent loss. The experimental results show that our method can significantly improve state-of-the-art models on both the widely-used hashing datasets and fine-grained datasets.

Download Full-text

Designing and researching technology-enhanced learning for the zone of proximal implementation

Research in Learning Technology ◽

10.3402/rlt.v21i0.17374 ◽

2013 ◽

Vol 21 ◽

Cited By ~ 19

Author(s):

Susan McKenney

Keyword(s):

Teaching And Learning ◽

Design Research ◽

Value Added ◽

Technology Enhanced Learning ◽

Learning Technology ◽

Fine Grained ◽

Pupil Learning ◽

Methodological Considerations ◽

Enhanced Learning

Internationally, society is increasingly demanding that the relevance and practical applicability of research be made transparent. Despite intentions to the contrary, insights on pedagogically appropriate uses of educational technology for representative teachers in everyday school settings are severely limited. In part, this is because (design) research is often conducted at the bleeding edge of what is technologically possible – exploring innovative uses of new and emerging technologies. There is no disputing that such work is greatly needed to seek out new ways to potentially enhance the quality of teaching and learning. However, in the excitement of exploring what is possible, tomorrow, insufficient research and development work focuses on what is practical, today. This leaves a problematic gap between what could be effective technology-enhanced learning (TEL) in theory, and what can be effective TEL in practice. This paper calls for designers/researchers of TEL to devote attention to not only fine-grained issues of pupil learning and instruction but also to broader factors that determine if and how innovations are understood, adopted and used by teachers and schools, by designing innovations to align with their zone of proximal implementation. Methodological considerations are given for designing and studying interventions that are prone to implementation by being: value-added, clear, harmonious and tolerant.Keywords: learning design; implementation; innovation(Published: 16 September 2013)Citation: Research in Learning Technology Supplement 2013, 21: 17374 - http://dx.doi.org/10.3402/rlt.v21i0.17374

Download Full-text

Detection of Myocardial Infarction Using ECG and Multi-Scale Feature Concatenate

Sensors ◽

10.3390/s21051906 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1906

Author(s):

Jia-Zheng Jian ◽

Tzong-Rong Ger ◽

Han-Hua Lai ◽

Chi-Ming Ku ◽

Chiung-An Chen ◽

...

Keyword(s):

Myocardial Infarction ◽

Network Structure ◽

Class Imbalance ◽

Class Imbalance Problem ◽

Multi Scale ◽

Imbalance Problem ◽

Average Accuracy ◽

Significant Difference ◽

Electrocardiogram Ecg

Diverse computer-aided diagnosis systems based on convolutional neural networks were applied to automate the detection of myocardial infarction (MI) found in electrocardiogram (ECG) for early diagnosis and prevention. However, issues, particularly overfitting and underfitting, were not being taken into account. In other words, it is unclear whether the network structure is too simple or complex. Toward this end, the proposed models were developed by starting with the simplest structure: a multi-lead features-concatenate narrow network (N-Net) in which only two convolutional layers were included in each lead branch. Additionally, multi-scale features-concatenate networks (MSN-Net) were also implemented where larger features were being extracted through pooling the signals. The best structure was obtained via tuning both the number of filters in the convolutional layers and the number of inputting signal scales. As a result, the N-Net reached a 95.76% accuracy in the MI detection task, whereas the MSN-Net reached an accuracy of 61.82% in the MI locating task. Both networks give a higher average accuracy and a significant difference of p < 0.001 evaluated by the U test compared with the state-of-the-art. The models are also smaller in size thus are suitable to fit in wearable devices for offline monitoring. In conclusion, testing throughout the simple and complex network structure is indispensable. However, the way of dealing with the class imbalance problem and the quality of the extracted features are yet to be discussed.

Download Full-text

Ensemble-Based Out-of-Distribution Detection

Electronics ◽

10.3390/electronics10050567 ◽

2021 ◽

Vol 10 (5) ◽

pp. 567

Author(s):

Donghun Yang ◽

Kien Mai Mai Ngoc ◽

Iksoo Shin ◽

Kyong-Ha Lee ◽

Myunggwon Hwang

Keyword(s):

Detection Method ◽

State Of The Art ◽

Metric Learning ◽

Feature Space ◽

Confidence Score ◽

Distance Metric Learning ◽

Current State ◽

Overall Performance ◽

Deep Learning Model

To design an efficient deep learning model that can be used in the real-world, it is important to detect out-of-distribution (OOD) data well. Various studies have been conducted to solve the OOD problem. The current state-of-the-art approach uses a confidence score based on the Mahalanobis distance in a feature space. Although it outperformed the previous approaches, the results were sensitive to the quality of the trained model and the dataset complexity. Herein, we propose a novel OOD detection method that can train more efficient feature space for OOD detection. The proposed method uses an ensemble of the features trained using the softmax-based classifier and the network based on distance metric learning (DML). Through the complementary interaction of these two networks, the trained feature space has a more clumped distribution and can fit well on the Gaussian distribution by class. Therefore, OOD data can be efficiently detected by setting a threshold in the trained feature space. To evaluate the proposed method, we applied our method to various combinations of image datasets. The results show that the overall performance of the proposed approach is superior to those of other methods, including the state-of-the-art approach, on any combination of datasets.

Download Full-text

Non-Local and Multi-Scale Mechanisms for Image Inpainting

Sensors ◽

10.3390/s21093281 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3281

Author(s):

Xu He ◽

Yong Yin

Keyword(s):

Markov Random Fields ◽

Receptive Fields ◽

Image Inpainting ◽

Long Distance ◽

Visual Appearance ◽

Fine Grained ◽

Multi Scale ◽

Non Local ◽

Relationship Of

Recently, deep learning-based techniques have shown great power in image inpainting especially dealing with squared holes. However, they fail to generate plausible results inside the missing regions for irregular and large holes as there is a lack of understanding between missing regions and existing counterparts. To overcome this limitation, we combine two non-local mechanisms including a contextual attention module (CAM) and an implicit diversified Markov random fields (ID-MRF) loss with a multi-scale architecture which uses several dense fusion blocks (DFB) based on the dense combination of dilated convolution to guide the generative network to restore discontinuous and continuous large masked areas. To prevent color discrepancies and grid-like artifacts, we apply the ID-MRF loss to improve the visual appearance by comparing similarities of long-distance feature patches. To further capture the long-term relationship of different regions in large missing regions, we introduce the CAM. Although CAM has the ability to create plausible results via reconstructing refined features, it depends on initial predicted results. Hence, we employ the DFB to obtain larger and more effective receptive fields, which benefits to predict more precise and fine-grained information for CAM. Extensive experiments on two widely-used datasets demonstrate that our proposed framework significantly outperforms the state-of-the-art approaches both in quantity and quality.

Download Full-text

Performance of deep learning technology for evaluation of positioning quality in periapical radiography of the maxillary canine

Oral Radiology ◽

10.1007/s11282-021-00538-2 ◽

2021 ◽

Author(s):

Mizuho Mori ◽

Yoshiko Ariji ◽

Motoki Fukuda ◽

Tomoya Kitano ◽

Takuma Funakoshi ◽

...

Keyword(s):

Deep Learning ◽

Characteristic Curve ◽

Classification Performance ◽

Learning Systems ◽

Learning Technology ◽

System 2 ◽

Potential Benefits ◽

System 1 ◽

Periapical Radiography

Abstract Objectives The aim of the present study was to create and test an automatic system for assessing the technical quality of positioning in periapical radiography of the maxillary canines using deep learning classification and segmentation techniques. Methods We created and tested two deep learning systems using 500 periapical radiographs (250 each of good- and bad-quality images). We assigned 350, 70, and 80 images as the training, validation, and test datasets, respectively. The learning model of system 1 was created with only the classification process, whereas system 2 consisted of both the segmentation and classification models. In each model, 500 epochs of training were performed using AlexNet and U-net for classification and segmentation, respectively. The segmentation results were evaluated by the intersection over union method, with values of 0.6 or more considered as success. The classification results were compared between the two systems. Results The segmentation performance of system 2 was recall, precision, and F measure of 0.937, 0.961, and 0.949, respectively. System 2 showed better classification performance values than those obtained by system 1. The area under the receiver operating characteristic curve values differed significantly between system 1 (0.649) and system 2 (0.927). Conclusions The deep learning systems we created appeared to have potential benefits in evaluation of the technical positioning quality of periapical radiographs through the use of segmentation and classification functions.

Download Full-text

Multi-scale mesh saliency based on low-rank and sparse analysis in shape feature space

Computer Aided Geometric Design ◽

10.1016/j.cagd.2015.03.003 ◽

2015 ◽

Vol 35-36 ◽

pp. 206-214 ◽

Cited By ~ 12

Author(s):

Shengfa Wang ◽

Nannan Li ◽

Shuai Li ◽

Zhongxuan Luo ◽

Zhixun Su ◽

...

Keyword(s):

Feature Space ◽

Low Rank ◽

Shape Feature ◽

Mesh Saliency ◽

Multi Scale

Download Full-text

A novel two-stage method of plant seedlings classification based on deep learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211507 ◽

2021 ◽

pp. 1-11

Author(s):

Tianhong Dai ◽

Shijie Cong ◽

Jianping Huang ◽

Yanwen Zhang ◽

Xinwang Huang ◽

...

Keyword(s):

Deep Learning ◽

Learning Technology ◽

Two Stage ◽

Second Stage ◽

Stage Classification ◽

Different Types ◽

Two Stages ◽

Plant Seedlings

In agricultural production, weed removal is an important part of crop cultivation, but inevitably, other plants compete with crops for nutrients. Only by identifying and removing weeds can the quality of the harvest be guaranteed. Therefore, the distinction between weeds and crops is particularly important. Recently, deep learning technology has also been applied to the field of botany, and achieved good results. Convolutional neural networks are widely used in deep learning because of their excellent classification effects. The purpose of this article is to find a new method of plant seedling classification. This method includes two stages: image segmentation and image classification. The first stage is to use the improved U-Net to segment the dataset, and the second stage is to use six classification networks to classify the seedlings of the segmented dataset. The dataset used for the experiment contained 12 different types of plants, namely, 3 crops and 9 weeds. The model was evaluated by the multi-class statistical analysis of accuracy, recall, precision, and F1-score. The results show that the two-stage classification method combining the improved U-Net segmentation network and the classification network was more conducive to the classification of plant seedlings, and the classification accuracy reaches 97.7%.

Download Full-text