Training time reduction for network architecture search using genetic algorithm

Author(s):  
Yuki Matsuoka ◽  
Hiroaki Aizawa ◽  
Junya Sato ◽  
Kunihito Kato
2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Changyan Zhu ◽  
Eng Aik Chan ◽  
You Wang ◽  
Weina Peng ◽  
Ruixiang Guo ◽  
...  

AbstractMultimode fibers (MMFs) have the potential to carry complex images for endoscopy and related applications, but decoding the complex speckle patterns produced by mode-mixing and modal dispersion in MMFs is a serious challenge. Several groups have recently shown that convolutional neural networks (CNNs) can be trained to perform high-fidelity MMF image reconstruction. We find that a considerably simpler neural network architecture, the single hidden layer dense neural network, performs at least as well as previously-used CNNs in terms of image reconstruction fidelity, and is superior in terms of training time and computing resources required. The trained networks can accurately reconstruct MMF images collected over a week after the cessation of the training set, with the dense network performing as well as the CNN over the entire period.


2010 ◽  
Vol 39 ◽  
pp. 247-252
Author(s):  
Sheng Xu ◽  
Zhi Juan Wang ◽  
Hui Fang Zhao

A two-stage neural network architecture constructed by combining potential support vector machines (P-SVM) with genetic algorithm (GA) and gray correlation coefficient analysis (GCCA) is proposed for patent innovation factors evolution. The enterprises patent innovation is complex to conduct due to its nonlinearity of influenced factors. It is necessary to make a trade off among these factors when some of them conflict firstly. A novel way about nonlinear regression model with the potential support vector machines (P-SVM) is presented in this paper. In the model development, the genetic algorithm is employed to optimize P-SVM parameters selection. After the selected key factors by the PSVM with GA model, the main factors that affect patent innovation generation have been quantitatively studied using the method of gray correlation coefficient analysis. Using a set of real data in China, the results show that the methods developed in this paper can provide valuable information for patent innovation management and related municipal planning projects.


2020 ◽  
pp. 104-117
Author(s):  
O.S. Amosov ◽  
◽  
S.G. Amosova ◽  
D.S. Magola ◽  
◽  
...  

The task of multiclass network classification of computer attacks is given. The applicability of deep neural network technology in problem solving has been considered. Deep neural network architecture was chosen based on the strategy of combining a set of convolution and recurrence LSTM layers. Op-timization of neural network parameters based on genetic algorithm is proposed. The presented results of modeling show the possibility of solving the network classification problem in real time.


2020 ◽  
Vol 2020 ◽  
pp. 1-11 ◽  
Author(s):  
Li-li Li ◽  
Kun Chen ◽  
Jian-min Gao ◽  
Hui Li

Aiming at the problems of the lack of abnormal instances and the lag of quality anomaly discovery in quality database, this paper proposed the method of recognizing quality anomaly from the quality control chart data by probabilistic neural network (PNN) optimized by improved genetic algorithm, which made up deficiencies of SPC control charts in practical application. Principal component analysis (PCA) reduced the dimension and extracted the feature of the original data of a control chart, which reduced the training time of PNN. PNN recognized successfully both single pattern and mixed pattern of control charts because of its simple network structure and excellent recognition effect. In order to eliminate the defect of experience value, the key parameter of PNN was optimized by the improved (SGA) single-target optimization genetic algorithm, which made PNN achieve a higher rate of recognition accuracy than PNN optimized by standard genetic algorithm. Finally, the above method was validated by a simulation experiment and proved to be the most effective method compared with traditional BP neural network, single PNN, PCA-PNN without parameters optimized, and SVM optimized by particle swarm optimization algorithm.


Complexity ◽  
2020 ◽  
Vol 2020 ◽  
pp. 1-13
Author(s):  
Yong Tian ◽  
Lina Ma ◽  
Songtao Yang ◽  
Qian Wang

Reliable assessment on the environmental impact of aircraft operation is vital for the performance evaluation and sustainable development of civil aviation. A new methodology for calculating the greenhouse effect of aircraft cruise is proposed in this paper. With respect to both cruise strategies and wind factors, a genetic algorithm-optimized wavelet neural network topology is designed to model the fuel flow-rate and developed using the real flight records data. Validation tests demonstrate that the proposed model with preferred network architecture can outperform others investigated in this paper in terms of accuracy and stability. Numerical examples are illustrated using 9 flights from Beijing Capital International Airport to Shanghai Hongqiao International Airport operated by Boeing 737–800 aircraft on October 2, 2019, and the generated fuel consumption, CO2 and NOx emissions as well as temperature change for different time horizons can be effectively given through the proposed methodology, which helps in the environmental performance evaluation and future trajectory planning for aircraft cruise.


2018 ◽  
Vol 15 (4) ◽  
pp. 172988141879299 ◽  
Author(s):  
Zhiyu Zhou ◽  
Hanxuan Guo ◽  
Yaming Wang ◽  
Zefei Zhu ◽  
Jiang Wu ◽  
...  

This article presents an intelligent algorithm based on extreme learning machine and sequential mutation genetic algorithm to determine the inverse kinematics solutions of a robotic manipulator with six degrees of freedom. This algorithm is developed to minimize the computational time without compromising the accuracy of the end effector. In the proposed algorithm, the preliminary inverse kinematics solution is first computed by extreme learning machine and the solution is then optimized by an improved genetic algorithm based on sequential mutation. Extreme learning machine randomly initializes the weights of the input layer and biases of the hidden layer, which greatly improves the training speed. Unlike classical genetic algorithms, sequential mutation genetic algorithm changes the order of the genetic codes from high to low, which reduces the randomness of mutation operation and improves the local search capability. Consequently, the convergence speed at the end of evolution is improved. The performance of the extreme learning machine and sequential mutation genetic algorithm is also compared with that of a hybrid intelligent algorithm, and the results showed that there is significant reduction in the training time and computational time while the solution accuracy is retained. Based on the experimental results, the proposed extreme learning machine and sequential mutation genetic algorithm can greatly improve the time efficiency while ensuring high accuracy of the end effector.


2017 ◽  
Vol 28 (02) ◽  
pp. 1750018
Author(s):  
Troy Shinbrot ◽  
Miguel Vivar Lazo ◽  
Theo Siu

We examine a dynamical network model of visual processing that reproduces several aspects of a well-known optical illusion, including subtle dependencies on curvature and scale. The model uses a genetic algorithm to construct the percept of an image, and we show that this percept evolves dynamically so as to produce the illusions reported. We find that the perceived illusions are hardwired into the model architecture and we propose that this approach may serve as an archetype to distinguish behaviors that are due to nature (i.e. a fixed network architecture) from those subject to nurture (that can be plastically altered through learning).


Electronics ◽  
2021 ◽  
Vol 10 (16) ◽  
pp. 1879
Author(s):  
Zahid Ali Siddiqui ◽  
Unsang Park

In this paper, we present a novel incremental learning technique to solve the catastrophic forgetting problem observed in the CNN architectures. We used a progressive deep neural network to incrementally learn new classes while keeping the performance of the network unchanged on old classes. The incremental training requires us to train the network only for new classes and fine-tune the final fully connected layer, without needing to train the entire network again, which significantly reduces the training time. We evaluate the proposed architecture extensively on image classification task using Fashion MNIST, CIFAR-100 and ImageNet-1000 datasets. Experimental results show that the proposed network architecture not only alleviates catastrophic forgetting but can also leverages prior knowledge via lateral connections to previously learned classes and their features. In addition, the proposed scheme is easily scalable and does not require structural changes on the network trained on the old task, which are highly required properties in embedded systems.


Sign in / Sign up

Export Citation Format

Share Document