Semi-Supervised Learning-Based Live Fish Identification in Aquaculture Using Modified Deep Convolutional Generative Adversarial Networks

Jian Zhao; Yihao Li; Fengdeng Zhang; Songming Zhu; Ying Liu; Huanda Lu; Zhangying Ye

doi:10.13031/trans.12684

Semi-Supervised Learning-Based Live Fish Identification in Aquaculture Using Modified Deep Convolutional Generative Adversarial Networks

Transactions of the ASABE ◽

10.13031/trans.12684 ◽

2018 ◽

Vol 61 (2) ◽

pp. 699-710 ◽

Cited By ~ 5

Author(s):

Jian Zhao ◽

Yihao Li ◽

Fengdeng Zhang ◽

Songming Zhu ◽

Ying Liu ◽

...

Keyword(s):

Supervised Learning ◽

Ground Truth ◽

Training Data ◽

Generative Adversarial Networks ◽

Low Resolution ◽

Adversarial Networks ◽

Live Fish ◽

Training Samples ◽

Spatial Pyramid Pooling ◽

Spatial Pyramid

Abstract. Aiming at live fish identification in aquaculture, a practical and efficient semi-supervised learning model, based on modified deep convolutional generative adversarial networks (DCGANs), was proposed in this study. Benefiting from the modified DCGANs structure, the presented model can be trained effectively using relatively few labeled training samples. In consideration of the complex poses of fish and the low resolution of sampling images in aquaculture, spatial pyramid pooling and some improved techniques specifically for the presented model were used to make the model more robust. Finally, in tests with two preprocessed and challenging datasets (with 5%, 10%, and 15% labeled training data in the fish recognition ground-truth dataset and 25%, 50%, and 75% labeled training data in the Croatian fish dataset), the feasibility and reliability of the presented model for live fish identification were proved with respective accuracies of 80.52%, 81.66%, and 83.07% for the ground-truth dataset and 65.13%, 78.72%, and 82.95% for the Croatian fish dataset. Keywords: Aquaculture, Deep convolutional generative adversarial networks, Few labeled training samples, Live fish identification, Semi-supervised learning, Spatial pyramid pooling.

Download Full-text

Automated Segmentation of Epithelial Tissue Using Cycle-Consistent Generative Adversarial Networks

10.1101/311373 ◽

2018 ◽

Cited By ~ 6

Author(s):

Matthias Häring ◽

Jörg Großhans ◽

Fred Wolf ◽

Stephan Eule

Keyword(s):

Ground Truth ◽

Training Data ◽

Generative Adversarial Networks ◽

Epithelial Tissue ◽

Automated Segmentation ◽

Segmentation Method ◽

Adversarial Networks ◽

Training Samples ◽

Segmentation Of Images ◽

Image Mask

AbstractA central problem in biomedical imaging is the automated segmentation of images for further quantitative analysis. Recently, fully convolutional neural networks, such as the U-Net, were applied successfully in a variety of segmentation tasks. A downside of this approach is the requirement for a large amount of well-prepared training samples, consisting of image - ground truth mask pairs. Since training data must be created by hand for each experiment, this task can be very costly and time-consuming. Here, we present a segmentation method based on cycle consistent generative adversarial networks, which can be trained even in absence of prepared image - mask pairs. We show that it successfully performs image segmentation tasks on samples with substantial defects and even generalizes well to different tissue types.

Download Full-text

Harnessing GANs for Zero-Shot Learning of New Classes in Visual Speech Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5649 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2645-2652 ◽

Cited By ~ 2

Author(s):

Yaman Kumar ◽

Dhruva Sahrawat ◽

Shubham Maheshwari ◽

Debanjan Mahata ◽

Amanda Stent ◽

...

Keyword(s):

Speech Recognition ◽

Classification Problem ◽

Visual Speech ◽

Training Data ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Novel Approach ◽

Visual Speech Recognition ◽

Training Samples ◽

English Training

Visual Speech Recognition (VSR) is the process of recognizing or interpreting speech by watching the lip movements of the speaker. Recent machine learning based approaches model VSR as a classification problem; however, the scarcity of training data leads to error-prone systems with very low accuracies in predicting unseen classes. To solve this problem, we present a novel approach to zero-shot learning by generating new classes using Generative Adversarial Networks (GANs), and show how the addition of unseen class samples increases the accuracy of a VSR system by a significant margin of 27% and allows it to handle speaker-independent out-of-vocabulary phrases. We also show that our models are language agnostic and therefore capable of seamlessly generating, using English training data, videos for a new language (Hindi). To the best of our knowledge, this is the first work to show empirical evidence of the use of GANs for generating training samples of unseen classes in the domain of VSR, hence facilitating zero-shot learning. We make the added videos for new classes publicly available along with our code1.

Download Full-text

Investigating the Performance of Generative Adversarial Networks for Prostate Tissue Detection and Segmentation

Journal of Imaging ◽

10.3390/jimaging6090083 ◽

2020 ◽

Vol 6 (9) ◽

pp. 83 ◽

Cited By ~ 1

Author(s):

Ufuk Cem Birbiri ◽

Azam Hamidinekoo ◽

Amélie Grall ◽

Paul Malcolm ◽

Reyer Zwiggelaar

Keyword(s):

Region Of Interest ◽

Training Data ◽

Prostate Tissue ◽

Generative Adversarial Networks ◽

Correct Identification ◽

Adversarial Networks ◽

Training Samples ◽

Mri Scans ◽

Magnetic Resonance Imaging Mri ◽

Public Datasets

The manual delineation of region of interest (RoI) in 3D magnetic resonance imaging (MRI) of the prostate is time-consuming and subjective. Correct identification of prostate tissue is helpful to define a precise RoI to be used in CAD systems in clinical practice during diagnostic imaging, radiotherapy and monitoring the progress of disease. Conditional GAN (cGAN), cycleGAN and U-Net models and their performances were studied for the detection and segmentation of prostate tissue in 3D multi-parametric MRI scans. These models were trained and evaluated on MRI data from 40 patients with biopsy-proven prostate cancer. Due to the limited amount of available training data, three augmentation schemes were proposed to artificially increase the training samples. These models were tested on a clinical dataset annotated for this study and on a public dataset (PROMISE12). The cGAN model outperformed the U-Net and cycleGAN predictions owing to the inclusion of paired image supervision. Based on our quantitative results, cGAN gained a Dice score of 0.78 and 0.75 on the private and the PROMISE12 public datasets, respectively.

Download Full-text

Generalizable fully automated multi-label segmentation of four-chamber view echocardiograms based on deep convolutional adversarial networks

Journal of The Royal Society Interface ◽

10.1098/rsif.2020.0267 ◽

2020 ◽

Vol 17 (169) ◽

pp. 20200267

Author(s):

Arghavan Arafati ◽

Daisuke Morisawa ◽

Michael R. Avendi ◽

M. Reza Amini ◽

Ramin A. Assadi ◽

...

Keyword(s):

Automatic Segmentation ◽

Ground Truth ◽

Training Data ◽

Generative Adversarial Networks ◽

Pixel Classification ◽

Systolic Volume ◽

Chamber View ◽

Adversarial Networks ◽

Generalization Problem ◽

Fully Automatic

A major issue in translation of the artificial intelligence platforms for automatic segmentation of echocardiograms to clinics is their generalizability. The present study introduces and verifies a novel generalizable and efficient fully automatic multi-label segmentation method for four-chamber view echocardiograms based on deep fully convolutional networks (FCNs) and adversarial training. For the first time, we used generative adversarial networks for pixel classification training, a novel method in machine learning not currently used for cardiac imaging, to overcome the generalization problem. The method's performance was validated against manual segmentations as the ground-truth. Furthermore, to verify our method's generalizability in comparison with other existing techniques, we compared our method's performance with a state-of-the-art method on our dataset in addition to an independent dataset of 450 patients from the CAMUS (cardiac acquisitions for multi-structure ultrasound segmentation) challenge. On our test dataset, automatic segmentation of all four chambers achieved a dice metric of 92.1%, 86.3%, 89.6% and 91.4% for LV, RV, LA and RA, respectively. LV volumes' correlation between automatic and manual segmentation were 0.94 and 0.93 for end-diastolic volume and end-systolic volume, respectively. Excellent agreement with chambers’ reference contours and significant improvement over previous FCN-based methods suggest that generative adversarial networks for pixel classification training can effectively design generalizable fully automatic FCN-based networks for four-chamber segmentation of echocardiograms even with limited number of training data.

Download Full-text

GANDaLF: GAN for Data-Limited Fingerprinting

Proceedings on Privacy Enhancing Technologies ◽

10.2478/popets-2021-0029 ◽

2021 ◽

Vol 2021 (2) ◽

pp. 305-322

Author(s):

Se Eun Oh ◽

Nate Mathews ◽

Mohammad Saidur Rahman ◽

Matthew Wright ◽

Nicholas Hopper

Keyword(s):

Deep Learning ◽

Training Data ◽

Generative Adversarial Networks ◽

Large Set ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Closed World ◽

Training Samples ◽

A Site

Abstract We introduce Generative Adversarial Networks for Data-Limited Fingerprinting (GANDaLF), a new deep-learning-based technique to perform Website Fingerprinting (WF) on Tor traffic. In contrast to most earlier work on deep-learning for WF, GANDaLF is intended to work with few training samples, and achieves this goal through the use of a Generative Adversarial Network to generate a large set of “fake” data that helps to train a deep neural network in distinguishing between classes of actual training data. We evaluate GANDaLF in low-data scenarios including as few as 10 training instances per site, and in multiple settings, including fingerprinting of website index pages and fingerprinting of non-index pages within a site. GANDaLF achieves closed-world accuracy of 87% with just 20 instances per site (and 100 sites) in standard WF settings. In particular, GANDaLF can outperform Var-CNN and Triplet Fingerprinting (TF) across all settings in subpage fingerprinting. For example, GANDaLF outperforms TF by a 29% margin and Var-CNN by 38% for training sets using 20 instances per site.

Download Full-text

Improved SinGAN Integrated with an Attentional Mechanism for Remote Sensing Image Classification

Remote Sensing ◽

10.3390/rs13091713 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1713

Author(s):

Songwei Gu ◽

Rui Zhang ◽

Hongxia Luo ◽

Mengyao Li ◽

Huamei Feng ◽

...

Keyword(s):

Remote Sensing ◽

Real Life ◽

Attention Mechanism ◽

Training Data ◽

Generative Adversarial Networks ◽

Natural Image ◽

Remote Sensing Images ◽

Training Time ◽

Adversarial Networks ◽

Remote Sensing Image Classification

Deep learning is an important research method in the remote sensing field. However, samples of remote sensing images are relatively few in real life, and those with markers are scarce. Many neural networks represented by Generative Adversarial Networks (GANs) can learn from real samples to generate pseudosamples, rather than traditional methods that often require more time and man-power to obtain samples. However, the generated pseudosamples often have poor realism and cannot be reliably used as the basis for various analyses and applications in the field of remote sensing. To address the abovementioned problems, a pseudolabeled sample generation method is proposed in this work and applied to scene classification of remote sensing images. The improved unconditional generative model that can be learned from a single natural image (Improved SinGAN) with an attention mechanism can effectively generate enough pseudolabeled samples from a single remote sensing scene image sample. Pseudosamples generated by the improved SinGAN model have stronger realism and relatively less training time, and the extracted features are easily recognized in the classification network. The improved SinGAN can better identify sub-jects from images with complex ground scenes compared with the original network. This mechanism solves the problem of geographic errors of generated pseudosamples. This study incorporated the generated pseudosamples into training data for the classification experiment. The result showed that the SinGAN model with the integration of the attention mechanism can better guarantee feature extraction of the training data. Thus, the quality of the generated samples is improved and the classification accuracy and stability of the classification network are also enhanced.

Download Full-text

Intrusion detection of railway clearance from infrared images using generative adversarial networks

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-192141 ◽

2020 ◽

pp. 1-13

Author(s):

Yundong Li ◽

Yi Liu ◽

Han Dong ◽

Wei Hu ◽

Chen Lin

Keyword(s):

Intrusion Detection ◽

Synthetic Data ◽

Generative Adversarial Networks ◽

Generation Model ◽

Single Shot ◽

Data Generation ◽

Infrared Images ◽

Adversarial Networks ◽

Training Samples ◽

Rgb Images

The intrusion detection of railway clearance is crucial for avoiding railway accidents caused by the invasion of abnormal objects, such as pedestrians, falling rocks, and animals. However, detecting intrusions using deep learning methods from infrared images captured at night remains a challenging task because of the lack of sufficient training samples. To address this issue, a transfer strategy that migrates daytime RGB images to the nighttime style of infrared images is proposed in this study. The proposed method consists of two stages. In the first stage, a data generation model is trained on the basis of generative adversarial networks using RGB images and a small number of infrared images, and then, synthetic samples are generated using a well-trained model. In the second stage, a single shot multibox detector (SSD) model is trained using synthetic data and utilized to detect abnormal objects from infrared images at nighttime. To validate the effectiveness of the proposed method, two groups of experiments, namely, railway and non-railway scenes, are conducted. Experimental results demonstrate the effectiveness of the proposed method, and an improvement of 17.8% is achieved for object detection at nighttime.

Download Full-text

Optimizing Generative Adversarial Networks for Low-Resolution Image Enhancement

2020 SoutheastCon ◽

10.1109/southeastcon44009.2020.9368265 ◽

2020 ◽

Author(s):

Justin Hall ◽

Maria Gonzalez Bocanegra ◽

Rami J. Haddad

Keyword(s):

Image Enhancement ◽

Generative Adversarial Networks ◽

Low Resolution ◽

Resolution Image ◽

Adversarial Networks

Download Full-text

Automatic Target Recognition for Low Resolution Foliage Penetrating SAR Images Using CNNs and GANs

Remote Sensing ◽

10.3390/rs13040596 ◽

2021 ◽

Vol 13 (4) ◽

pp. 596

Author(s):

David Vint ◽

Matthew Anderson ◽

Yuhao Yang ◽

Christos Ilioudis ◽

Gaetano Di Caterina ◽

...

Keyword(s):

Target Recognition ◽

Automatic Target Recognition ◽

Generative Adversarial Networks ◽

Low Resolution ◽

Sar Images ◽

Adversarial Networks ◽

Technological Advances ◽

Dataset Size ◽

Resolution Imaging ◽

High Level

In recent years, the technological advances leading to the production of high-resolution Synthetic Aperture Radar (SAR) images has enabled more and more effective target recognition capabilities. However, high spatial resolution is not always achievable, and, for some particular sensing modes, such as Foliage Penetrating Radars, low resolution imaging is often the only option. In this paper, the problem of automatic target recognition in Low Resolution Foliage Penetrating (FOPEN) SAR is addressed through the use of Convolutional Neural Networks (CNNs) able to extract both low and high level features of the imaged targets. Additionally, to address the issue of limited dataset size, Generative Adversarial Networks are used to enlarge the training set. Finally, a Receiver Operating Characteristic (ROC)-based post-classification decision approach is used to reduce classification errors and measure the capability of the classifier to provide a reliable output. The effectiveness of the proposed framework is demonstrated through the use of real SAR FOPEN data.

Download Full-text

Linear electromagnetic inverse scattering via generative adversarial networks

International Journal of Microwave and Wireless Technologies ◽

10.1017/s1759078721001331 ◽

2021 ◽

pp. 1-9

Author(s):

Huilin Zhou ◽

Huimin Zheng ◽

Qiegen Liu ◽

Jian Liu ◽

Yuhao Wang

Keyword(s):

Inverse Scattering ◽

Optimization Methods ◽

Training Data ◽

Generative Adversarial Networks ◽

Scattering Problems ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Highly Nonlinear ◽

Electromagnetic Inverse Scattering

Abstract Electromagnetic inverse-scattering problems (ISPs) are concerned with determining the properties of an unknown object using measured scattered fields. ISPs are often highly nonlinear, causing the problem to be very difficult to address. In addition, the reconstruction images of different optimization methods are distorted which leads to inaccurate reconstruction results. To alleviate these issues, we propose a new linear model solution of generative adversarial network-based (LM-GAN) inspired by generative adversarial networks (GAN). Two sub-networks are trained alternately in the adversarial framework. A linear deep iterative network as a generative network captures the spatial distribution of the data, and a discriminative network estimates the probability of a sample from the training data. Numerical results validate that LM-GAN has admirable fidelity and accuracy when reconstructing complex scatterers.

Download Full-text