JOURNAL ARTICLE

Optimizing Deep Neural Network Architecture with Enhanced Genetic Algorithm

Abstract

Only weights and biases are learned by gradient descent-based training of Deep Neural Networks (DNN). The other parameters, i.e., hyperparameters have a huge influence on the quality of the model but finding optimal values for them is not a trivial solution. The hyperparameter space grows exponentially and they also display non-linearity and interactions. A network with twelve hyperparameters, each with five potential values can have about 250 million unique combinations of hyperparameter sequences. If training on each set takes 6 minutes, exhaustive training on all potential combinations of the hyperparameters to find the optimal values will take almost 3000 years. Expert knowledge or random selection are some alternate options, but they are not scalable or consistently reliable. Metaheuristics such as evolutionary algorithms are a great choice for solving combinatorial optimization problems like hyperparameter optimization. While other researchers have used evolutionary algorithms such as standard implementation of genetic algorithm (GA), we introduce additional nature-inspired enhancements to GA for better exploration of the hyperparameter solution space to optimize the DNN architecture. The training is complemented with Monte Carlo based variance reduction method called importance sampling. We demonstrate that these fine-tunings result in improvements in the network accuracy on MNIST and CIFAR-10 datasets that outperforms standard use of genetic algorithm.

Keywords:
Hyperparameter MNIST database Computer science Machine learning Artificial intelligence Artificial neural network Genetic algorithm Evolutionary algorithm Algorithm

Metrics

25
Cited By
1.84
FWCI (Field Weighted Citation Impact)
61
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Metaheuristic Optimization Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.