Optimizing Deep Neural Network Architecture with Enhanced Genetic Algorithm

Ajay Shrestha; Ausif Mahmood

doi:10.1109/icmla.2019.00222

ScienceGate Book Chapters

JOURNAL ARTICLE

Optimizing Deep Neural Network Architecture with Enhanced Genetic Algorithm

Ajay Shrestha Ausif Mahmood

Year: 2019 Pages: 1365-1370

DOI: 10.1109/icmla.2019.00222

Get Full-Text PDF Get Analytical Report

Abstract

Only weights and biases are learned by gradient descent-based training of Deep Neural Networks (DNN). The other parameters, i.e., hyperparameters have a huge influence on the quality of the model but finding optimal values for them is not a trivial solution. The hyperparameter space grows exponentially and they also display non-linearity and interactions. A network with twelve hyperparameters, each with five potential values can have about 250 million unique combinations of hyperparameter sequences. If training on each set takes 6 minutes, exhaustive training on all potential combinations of the hyperparameters to find the optimal values will take almost 3000 years. Expert knowledge or random selection are some alternate options, but they are not scalable or consistently reliable. Metaheuristics such as evolutionary algorithms are a great choice for solving combinatorial optimization problems like hyperparameter optimization. While other researchers have used evolutionary algorithms such as standard implementation of genetic algorithm (GA), we introduce additional nature-inspired enhancements to GA for better exploration of the hyperparameter solution space to optimize the DNN architecture. The training is complemented with Monte Carlo based variance reduction method called importance sampling. We demonstrate that these fine-tunings result in improvements in the network accuracy on MNIST and CIFAR-10 datasets that outperforms standard use of genetic algorithm.

Keywords:

Hyperparameter MNIST database Computer science Machine learning Artificial intelligence Artificial neural network Genetic algorithm Evolutionary algorithm Algorithm

Metrics

Cited By

1.84

FWCI (Field Weighted Citation Impact)

Refs

0.89

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Machine Learning and Data Classification

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Metaheuristic Optimization Algorithms Research

Physical Sciences → Computer Science → Artificial Intelligence

Optimizing Deep Neural Network Architecture with Enhanced Genetic Algorithm

Abstract

Metrics

Citation History

Topics

Related Documents

Optimizing neural network architecture through a coarse genetic algorithm

Optimizing a neural network architecture with an adaptive parameter genetic algorithm

Architecture neural network deep optimizing based on self organizing feature map algorithm

Optimizing a Deep Residual Neural Network with Genetic Algorithm for Acute Lymphoblastic Leukemia Classification

Genetic algorithm for neural network architecture optimization