Using swarm intelligence to optimize neural network hyperparameters: comparative analysis on MNIST and CIFAR-10
DOI:
https://doi.org/10.47813/2782-2818-2024-4-2-0291-0297Keywords:
swarm intelligence, hyperparameter optimization, neural networks, particle swarm optimization, grid search, MNIST, CIFAR-10, algorithm efficiency, machine learning methods.Abstract
Swarm Intelligence offers powerful methods for solving optimization problems used in configuring hyperparameters of neural networks. This article examines the performance of the particle swarm optimization algorithm compared to Grid Search on two different datasets: MNIST and CIFAR-10. Experimental results show that the effectiveness of optimization methods varies depending on the complexity of the task and the data.
References
Kennedy J., Eberhart R. Particle Swarm Optimization. Proceedings of IEEE International Conference on Neural Networks. 1995; 4: 1942-1948. DOI: 10.1109/ICNN.1995.488968 DOI: https://doi.org/10.1109/ICNN.1995.488968
Poli R., Kennedy J., Blackwell T. Particle swarm optimization: An overview. Swarm Intelligence. 2007; 1(1): 33-57. DOI: 10.1007/s11721-007-0002-0 DOI: https://doi.org/10.1007/s11721-007-0002-0
Shi Y., Eberhart R.C. A modified particle swarm optimizer. 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence. 1998; Cat.No.98TH8360: 69-73. DOI: 10.1109/ICEC.1998.699146 DOI: https://doi.org/10.1109/ICEC.1998.699146
Goodfellow I., Bengio Y., Courville A. Deep Learning. MIT Press; 2016.
LeCun Y., Bottou L., Bengio Y., Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE. 1998; 86(11): 2278-2324. DOI: 10.1109/5.726791 DOI: https://doi.org/10.1109/5.726791
Krizhevsky A., Hinton G. Learning multiple layers of features from tiny images. Technical Report. University of Toronto; 2009. Link. http://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 A. A. Inkizhekov, A. S. Dulesov
This work is licensed under a Creative Commons Attribution 4.0 International License.
The journal MIST - "Modern Innovations, Systems and Technologies" publishes materials under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) license, hosted on the official website of the non-profit corporation Creative Commons:
This work is licensed under a Creative Commons Attribution 4.0 International License.
This means that users can copy and distribute materials in any medium and in any format, adapt and transform texts, use content for any purpose, including commercial ones. At the same time, the terms of use must be observed - an indication of the author of the original work and the source: you should indicate the output of the articles, provide a link to the source, and also indicate what changes have been made