Using swarm intelligence to optimize neural network hyperparameters: comparative analysis on MNIST and CIFAR-10

Authors

DOI:

https://doi.org/10.47813/2782-2818-2024-4-2-0291-0297

Keywords:

swarm intelligence, hyperparameter optimization, neural networks, particle swarm optimization, grid search, MNIST, CIFAR-10, algorithm efficiency, machine learning methods.

Abstract

Swarm Intelligence offers powerful methods for solving optimization problems used in configuring hyperparameters of neural networks. This article examines the performance of the particle swarm optimization algorithm compared to Grid Search on two different datasets: MNIST and CIFAR-10. Experimental results show that the effectiveness of optimization methods varies depending on the complexity of the task and the data.

Author Biographies

А. А. Inkizhekov

Anatolii Inkizhekov, PhD student at the Faculty of Digital Technologies and Design of N.F. Katanov Khakass State University, Abakan, Russia

A. S. Dulesov

Alexander Dulesov, Doctor of Technical Sciences, Associate Professor, Professor of the Department of Digital Technologies and Design, Katanov Khakass State University, Abakan, Russia

References

Kennedy J., Eberhart R. Particle Swarm Optimization. Proceedings of IEEE International Conference on Neural Networks. 1995; 4: 1942-1948. DOI: 10.1109/ICNN.1995.488968 DOI: https://doi.org/10.1109/ICNN.1995.488968

Poli R., Kennedy J., Blackwell T. Particle swarm optimization: An overview. Swarm Intelligence. 2007; 1(1): 33-57. DOI: 10.1007/s11721-007-0002-0 DOI: https://doi.org/10.1007/s11721-007-0002-0

Shi Y., Eberhart R.C. A modified particle swarm optimizer. 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence. 1998; Cat.No.98TH8360: 69-73. DOI: 10.1109/ICEC.1998.699146 DOI: https://doi.org/10.1109/ICEC.1998.699146

Goodfellow I., Bengio Y., Courville A. Deep Learning. MIT Press; 2016.

LeCun Y., Bottou L., Bengio Y., Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE. 1998; 86(11): 2278-2324. DOI: 10.1109/5.726791 DOI: https://doi.org/10.1109/5.726791

Krizhevsky A., Hinton G. Learning multiple layers of features from tiny images. Technical Report. University of Toronto; 2009. Link. http://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf

Published

2024-06-27

How to Cite

Inkizhekov А. А., & Dulesov, A. S. (2024). Using swarm intelligence to optimize neural network hyperparameters: comparative analysis on MNIST and CIFAR-10. Modern Innovations, Systems and Technologies, 4(2), 0291–0297. https://doi.org/10.47813/2782-2818-2024-4-2-0291-0297

Issue

Section

IT and informatics