The Additive Input-Doubling Method Based on the SVR with Nonlinear Kernels: Small Data Approach

Ivan Izonin; Roman Tkachenko; Nataliya Shakhovska; Nataliia Lotoshynska

doi:10.3390/sym13040612

The Additive Input-Doubling Method Based on the SVR with Nonlinear Kernels: Small Data Approach

Symmetry ◽

10.3390/sym13040612 ◽

2021 ◽

Vol 13 (4) ◽

pp. 612

Author(s):

Ivan Izonin ◽

Roman Tkachenko ◽

Nataliya Shakhovska ◽

Nataliia Lotoshynska

Keyword(s):

Axial Symmetry ◽

Prediction Accuracy ◽

Data Augmentation ◽

Materials Science ◽

Small Data ◽

Training Procedure ◽

Optimal Parameters ◽

Doubling Method ◽

Nonlinear Kernels ◽

Augmentation Procedure

The problem of effective intellectual analysis in the case of handling short datasets is topical in various application areas. Such problems arise in medicine, economics, materials science, science, etc. This paper deals with a new additive input-doubling method designed by the authors for processing short and very short datasets. The main steps of the method should include the procedure of data augmentation within the existing dataset both in rows and columns (without training), the use of nonlinear SVR to implement the training procedure, and the formation of the result based on the author’s procedure. The authors show that the developed data augmentation procedure corresponds to the principles of axial symmetry. The training and application procedures of the method developed are described in detail, and two algorithmic implementations are presented. The optimal parameters of the method operation were selected experimentally. The efficiency of its work during the processing of short datasets for solving the prediction task was established experimentally by comparison with other methods of this class. The highest prediction accuracy based on both proposed algorithmic implementations of a method among all of the investigated ones was defined. The main areas of application of the developed method are described, and its shortcomings and prospects of further research are given.

Download Full-text

Automatic Evaluation of the Lung Condition of COVID-19 Patients Using X-ray Images and Convolutional Neural Networks

Journal of Personalized Medicine ◽

10.3390/jpm11010028 ◽

2021 ◽

Vol 11 (1) ◽

pp. 28

Author(s):

Ivan Lorencin ◽

Sandi Baressi Šegota ◽

Nikola Anđelić ◽

Anđela Blagojević ◽

Tijana Šušteršić ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Clinical Picture ◽

Data Augmentation ◽

Lower Amount ◽

Training Procedure ◽

X Ray ◽

Severe Clinical Picture ◽

Lung Condition

COVID-19 represents one of the greatest challenges in modern history. Its impact is most noticeable in the health care system, mostly due to the accelerated and increased influx of patients with a more severe clinical picture. These facts are increasing the pressure on health systems. For this reason, the aim is to automate the process of diagnosis and treatment. The research presented in this article conducted an examination of the possibility of classifying the clinical picture of a patient using X-ray images and convolutional neural networks. The research was conducted on the dataset of 185 images that consists of four classes. Due to a lower amount of images, a data augmentation procedure was performed. In order to define the CNN architecture with highest classification performances, multiple CNNs were designed. Results show that the best classification performances can be achieved if ResNet152 is used. This CNN has achieved AUCmacro¯ and AUCmicro¯ up to 0.94, suggesting the possibility of applying CNN to the classification of the clinical picture of COVID-19 patients using an X-ray image of the lungs. When higher layers are frozen during the training procedure, higher AUCmacro¯ and AUCmicro¯ values are achieved. If ResNet152 is utilized, AUCmacro¯ and AUCmicro¯ values up to 0.96 are achieved if all layers except the last 12 are frozen during the training procedure.

Download Full-text

Combining a convolutional neural network with autoencoders to predict the survival chance of COVID-19 patients

Scientific Reports ◽

10.1038/s41598-021-93543-8 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Fahime Khozeimeh ◽

Danial Sharifrazi ◽

Navid Hoseini Izadi ◽

Javad Hassannataj Joloudari ◽

Afshin Shoeibi ◽

...

Keyword(s):

Clinical Data ◽

Data Augmentation ◽

Clinical Information ◽

Ct Images ◽

Classification Performance ◽

Survival Chance ◽

Average Accuracy ◽

Novel Method ◽

Aided Diagnosis ◽

Augmentation Procedure

AbstractCOVID-19 has caused many deaths worldwide. The automation of the diagnosis of this virus is highly desired. Convolutional neural networks (CNNs) have shown outstanding classification performance on image datasets. To date, it appears that COVID computer-aided diagnosis systems based on CNNs and clinical information have not yet been analysed or explored. We propose a novel method, named the CNN-AE, to predict the survival chance of COVID-19 patients using a CNN trained with clinical information. Notably, the required resources to prepare CT images are expensive and limited compared to those required to collect clinical data, such as blood pressure, liver disease, etc. We evaluated our method using a publicly available clinical dataset that we collected. The dataset properties were carefully analysed to extract important features and compute the correlations of features. A data augmentation procedure based on autoencoders (AEs) was proposed to balance the dataset. The experimental results revealed that the average accuracy of the CNN-AE (96.05%) was higher than that of the CNN (92.49%). To demonstrate the generality of our augmentation method, we trained some existing mortality risk prediction methods on our dataset (with and without data augmentation) and compared their performances. We also evaluated our method using another dataset for further generality verification. To show that clinical data can be used for COVID-19 survival chance prediction, the CNN-AE was compared with multiple pre-trained deep models that were tuned based on CT images.

Download Full-text

A sparse modeling for small data: Case studies in controlled syntheses of 2D materials

Digital Discovery ◽

10.1039/d1dd00010a ◽

2022 ◽

Author(s):

Yuri Haraguchi ◽

Yasuhiko Igarashi ◽

Hiroaki Imai ◽

Yuya Oaki

Keyword(s):

Experimental Data ◽

Case Studies ◽

Materials Science ◽

2D Materials ◽

Small Data ◽

Sparse Modeling

Data-scientific approaches have permeated in chemistry and materials science. In general, these approaches are not easily applied to small data, such as experimental data in laboratories. Our group has focused...

Download Full-text

Cross-property deep transfer learning framework for enhanced predictive analytics on small materials data

Nature Communications ◽

10.1038/s41467-021-26921-5 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Vishu Gupta ◽

Kamal Choudhary ◽

Francesca Tavazza ◽

Carelyn Campbell ◽

Wei-keng Liao ◽

...

Keyword(s):

Transfer Learning ◽

Predictive Models ◽

Materials Science ◽

Predictive Analytics ◽

Large Datasets ◽

Small Data ◽

Widespread Application ◽

Learning Framework ◽

Large Databases ◽

Physical Attributes

AbstractArtificial intelligence (AI) and machine learning (ML) have been increasingly used in materials science to build predictive models and accelerate discovery. For selected properties, availability of large databases has also facilitated application of deep learning (DL) and transfer learning (TL). However, unavailability of large datasets for a majority of properties prohibits widespread application of DL/TL. We present a cross-property deep-transfer-learning framework that leverages models trained on large datasets to build models on small datasets of different properties. We test the proposed framework on 39 computational and two experimental datasets and find that the TL models with only elemental fractions as input outperform ML/DL models trained from scratch even when they are allowed to use physical attributes as input, for 27/39 (≈ 69%) computational and both the experimental datasets. We believe that the proposed framework can be widely useful to tackle the small data challenge in applying AI/ML in materials science.

Download Full-text

Dynamic Ferromagnetic Hysteresis Modelling Using a Preisach-Recurrent Neural Network Model

Materials ◽

10.3390/ma13112561 ◽

2020 ◽

Vol 13 (11) ◽

pp. 2561 ◽

Cited By ~ 1

Author(s):

Christian Grech ◽

Marco Buzio ◽

Mariano Pentella ◽

Nicholas Sammut

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Neural Network Model ◽

Pure Iron ◽

Materials Science ◽

Particle Accelerator ◽

Magnetic Flux Density ◽

Training Procedure ◽

Dynamic Hysteresis

In this work, a Preisach-recurrent neural network model is proposed to predict the dynamic hysteresis in ARMCO pure iron, an important soft magnetic material in particle accelerator magnets. A recurrent neural network coupled with Preisach play operators is proposed, along with a novel validation method for the identification of the model’s parameters. The proposed model is found to predict the magnetic flux density of ARMCO pure iron with a Normalised Root Mean Square Error (NRMSE) better than 0.7%, when trained with just six different hysteresis loops. The model is evaluated using ramp-rates not used in the training procedure, which shows the ability of the model to predict data which has not been measured. The results demonstrate that the Preisach model based on a recurrent neural network can accurately describe ferromagnetic dynamic hysteresis when trained with a limited amount of data, showing the model’s potential in the field of materials science.

Download Full-text

FASTensor: A tensor framework for spatiotemporal description

10.5753/sibgrapi.est.2019.8298 ◽

2019 ◽

Author(s):

Virgı́nia F. Mota ◽

Jefersson A. dos Santos ◽

Arnaldo De A. Araújo

Keyword(s):

Cancer Cell ◽

Data Augmentation ◽

Human Action Recognition ◽

Human Action ◽

Research Field ◽

Small Data ◽

Learning Tools ◽

Melanoma Cancer ◽

Orientation Tensors ◽

Spatiotemporal Representation

Spatiotemporal description is a research field with applications in various areas such as video indexing, surveillance, human-computer interfaces, among others. Big Data problems in large databases are now being treated with Deep Learning tools, however we still have room for improvement in spatiotemporal handcraft description. Moreover, we still have problems that involve small data in which data augmentation and other techniques are not valid. The main contribution of this Ph.D. Thesis 1 is the development of a framework for spatiotemporal representation using orientation tensors enabling dimension reduction and invariance. This is a multipurpose framework called Features As Spatiotemporal Tensors (FASTensor). We evaluate this framework in three different applications: Human Action recognition, Video Pornography classification and Cancer Cell classification. The latter one is also a contribution of this work, since we introduce a new dataset called Melanoma Cancer Cell dataset (MCC). It is a small data that cannot be artificially augmented due the difficulty of extraction and the nature of motion. The results were competitive, while also being fast and simple to implement. Finally, our results in the MCC dataset can be used in other cancer cell treatment analysis.

Download Full-text

Segmentation Neural Network Incorporating Scale-Space in the Application of Cardiac MRI

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2020.3041 ◽

2020 ◽

Vol 10 (7) ◽

pp. 1494-1505

Author(s):

Hyo-Hun Kim ◽

Byung-Woo Hong

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Cardiac Mri ◽

Network Architecture ◽

Data Augmentation ◽

Scale Space ◽

Segmentation Algorithm ◽

Training Data ◽

Training Procedure ◽

Whole Heart

In this work, we present an image segmentation algorithm based on the convolutional neural network framework where the scale space theory is incorporated in the course of training procedure. The construction of data augmentation is designed to apply the scale space to the training data in order to effectively deal with the variability of regions of interest in geometry and appearance such as shape and contrast. The proposed data augmentation algorithm via scale space is aimed to improve invariant features with respect to both geometry and appearance by taking into consideration of their diffusion process. We develop a segmentation algorithm based on the convolutional neural network framework where the network architecture consists of encoding and decoding substructures in combination with the data augmentation scheme via the scale space induced by the heat equation. The quantitative analysis using the cardiac MRI dataset indicates that the proposed algorithm achieves better accuracy in the delineation of the left ventricles, which demonstrates the potential of the algorithm in the application of the whole heart segmentation as a compute-aided diagnosis system for the cardiac diseases.

Download Full-text

A Self-Improving Photosensitizer Discovery System via Bayesian Optimization and Quantum Chemical Calculation

10.26434/chemrxiv.14757975 ◽

2021 ◽

Author(s):

Shidang Xu ◽

Jiali Li ◽

Pengfei Cai ◽

Xiaoli Liu ◽

Bin Liu ◽

...

Keyword(s):

Artificial Intelligence ◽

Quantum Chemical ◽

Prediction Accuracy ◽

High Performance ◽

Materials Science ◽

Bayesian Optimization ◽

Average Error ◽

Chemical Calculation ◽

Discovery System ◽

Self Learning

Artificial intelligence (AI) based self-learning or self-improving material discovery system is the holy grail of next-generation material discovery and materials science. Herein, we demonstrate how to combine accurate prediction of material performance via quantum chemical calculations and Bayesian optimization-based active learning to realize a self-improving discovery system for high-performance photosensitizers (PS). Through self-improving cycles, such a system can improve the model prediction accuracy (best mean average error of 0.09 eV for singlet-triplet spitting) and high-performance PS search ability, realizing the efficient discovery of PS. From a molecular space with more than 7 million molecules, 5950 potential high-performance PSs were discovered.

Download Full-text

MIGAN: Malware Image Synthesis Using GANs

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.330110033 ◽

2019 ◽

Vol 33 ◽

pp. 10033-10034 ◽

Cited By ~ 1

Author(s):

Abhishek Singh ◽

Debojyoti Dutta ◽

Amit Saha

Keyword(s):

Language Processing ◽

Domain Knowledge ◽

Data Augmentation ◽

Image Synthesis ◽

Substantial Improvement ◽

Training Data ◽

Malware Analysis ◽

Training Procedure ◽

Original Dataset ◽

Augmentation Techniques

Majority of the advancement in Deep learning (DL) has occurred in domains such as computer vision, and natural language processing, where abundant training data is available. A major obstacle in leveraging DL techniques for malware analysis is the lack of sufficiently big, labeled datasets. In this paper, we take the first steps towards building a model which can synthesize labeled dataset of malware images using GAN. Such a model can be utilized to perform data augmentation for training a classifier. Furthermore, the model can be shared publicly for community to reap benefits of dataset without sharing the original dataset. First, we show the underlying idiosyncrasies of malware images and why existing data augmentation techniques as well as traditional GAN training fail to produce quality artificial samples. Next, we propose a new method for training GAN where we explicitly embed prior domain knowledge about the dataset into the training procedure. We show improvements in training stability and sample quality assessed on different metrics. Our experiments show substantial improvement on baselines and promise for using such a generative model for malware visualization systems.

Download Full-text

Mass agnostic jet taggers

SciPost Physics ◽

10.21468/scipostphys.8.1.011 ◽

2020 ◽

Vol 8 (1) ◽

Cited By ~ 8

Author(s):

Layne Bradshaw ◽

Rashmish K. Mishra ◽

Andrea Mitridate ◽

Bryan Ostdiek

Keyword(s):

Data Augmentation ◽

Large Data ◽

New Physics ◽

Large Data Sets ◽

Data Sets ◽

Training Procedure ◽

Signal Identification ◽

Training Techniques ◽

Quantitative Metrics ◽

Augmentation Techniques

Searching for new physics in large data sets needs a balance between two competing effects—signal identification vs background distortion. In this work, we perform a systematic study of both single variable and multivariate jet tagging methods that aim for this balance. The methods preserve the shape of the background distribution by either augmenting the training procedure or the data itself. Multiple quantitative metrics to compare the methods are considered, for tagging 2-, 3-, or 4-prong jets from the QCD background. This is the first study to show that the data augmentation techniques of Planing and PCA based scaling deliver similar performance as the augmented training techniques of Adversarial NN and uBoost, but are both easier to implement and computationally cheaper.

Download Full-text