An application of the whale optimization algorithm with Levy flight strategy for clustering of medical datasets
DOI:
https://doi.org/10.11121/ijocta.01.2021.001091Keywords:
Clustering, Whale optimization algorithm , Levy flight , K-means , K-medoids , Fuzzy c-meansAbstract
Clustering, which is handled by many researchers, is separating data into clusters without supervision. In clustering, the data are grouped using similarities or differences between them. Many traditional and heuristic algorithms are used in clustering problems and new techniques continue to be developed today. In this study, a new and effective clustering algorithm was developed by using the Whale Optimization Algorithm (WOA) and Levy flight (LF) strategy that imitates the hunting behavior of whales. With the developed WOA-LF algorithm, clustering was performed using ten medical datasets taken from the UCI Machine Learning Repository database. The clustering performance of the WOA-LF was compared with the performance of k-means, k-medoids, fuzzy c-means and the original WOA clustering algorithms. Application results showed that WOA-LF has more successful clustering performance in general and can be used as an alternative algorithm in clustering problems.
Downloads
References
Evirgen, F. (2016). Analyze the optimal solutions of optimization problems by means of fractional gradient based system using VIM. An International Journal of Optimization and Control: Theories & Applications (IJOCTA), 6(2), 75-83.
Evirgen, F. (2017). Conformable fractional gradient based dynamic system for constrained optimization problem. Special issue of the 3rd International Conference on Computational and Experimental Science and Engineering (ICCESEN 2016),1066-1069.
Evirgen, F. and Yavuz, M. (2018). An alternative approach for nonlinear optimization problem with Caputo-Fabrizio derivative. ITM Web of Conferences,01009.
Cui, D. (2017). Application of whale optimization algorithm in reservoir optimal operation. Advances in Science and Technology of Water Resources, 37(3), 72-79.
Zhang, C., Ouyang, D., and Ning, J. (2010). An artificial bee colony approach for clustering. Expert systems with applications, 37(7), 4761-4767.
Selim, S. Z. and Alsultan, K. (1991). A simulated annealing algorithm for the clustering problem. Pattern recognition, 24(10), 1003-1008.
Maulik, U. and Mukhopadhyay, A. (2010). Simulated annealing based automatic fuzzy clustering combined with ANN classification for analyzing microarray data. Computers & operations research, 37(8), 1369-1380.
Maulik, U. and Bandyopadhyay, S. (2000). Genetic algorithm-based clustering technique. Pattern recognition, 33(9), 1455-1465.
Shelokar, P., Jayaraman, V. K., and Kulkarni, B. D. (2004). An ant colony approach for clustering. Analytica Chimica Acta, 509(2), 187-195.
Van der Merwe, D. and Engelbrecht, A. P. (2003). Data clustering using particle swarm optimization. The 2003 Congress on Evolutionary Computation, 2003. CEC'03.,215-220.
Cura, T. (2012). A particle swarm optimization approach to clustering. Expert Systems with Applications, 39(1), 1582-1588.
Karaboga, D. and Ozturk, C. (2011). A novel clustering approach: Artificial Bee Colony (ABC) algorithm. Applied soft computing, 11(1), 652-657.
Armano, G. and Farmani, M. R. (2014). Clustering analysis with combination of artificial bee colony algorithm and k-means technique. International Journal of Computer Theory and Engineering, 6, 141-145.
Karthikeyan, S. and Christopher, T. (2014). A hybrid clustering approach using artificial bee colony (ABC) and particle swarm optimization. International Journal of Computer Applications, 100(15).
Mane, S. U. and Gaikwad, P. G. (2014). Hybrid particle swarm optimization (HPSO) for data clustering. International Journal of Computer Applications, 97(19).
Mirjalili, S. and Lewis, A. (2016). The whale optimization algorithm. Advances in engineering software, 95, 51-67.
Kennedy, J. and Eberhart, R. (1995). Particle swarm optimization. Proceedings of ICNN'95-international conference on neural networks,1942-1948.
Storn, R. and Price, K. (1997). Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. Journal of global optimization, 11(4), 341-359.
Rashedi, E., Nezamabadi-Pour, H., and Saryazdi, S. (2009). GSA: a gravitational search algorithm. Information sciences, 179(13), 2232-2248.
Yao, X., Liu, Y., and Lin, G. (1999). Evolutionary programming made faster. IEEE Transactions on Evolutionary computation, 3(2), 82-102.
Nasiri, J. and Khiyabani, F. M. (2018). A whale optimization algorithm (WOA) approach for clustering. Cogent Mathematics & Statistics, 5(1), 1483565.
Canayaz, M. and Özda?, R. (2017). Data clustering based on the whale optimization. Middle East Journal of Technic, 2(2), 178-187.
Al-Temeemy, A. A., Spencer, J., and Ralph, J. (2010). Levy flights for improved ladar scanning. 2010 IEEE International Conference on Imaging Systems and Techniques,225-228.
Chen, Y. (2010). Research and simulation on Levy flight model for DTN. 2010 3rd International Congress on Image and Signal Processing,4421-4423.
Pereyra, M. A. and Batatia, H. (2010). A Levy flight model for ultrasound in skin tissues. 2010 IEEE International Ultrasonics Symposium,2327-2331.
Terdik, G. and Gyires, T. (2008). Lévy flights and fractal modeling of internet traffic. IEEE/ACM Transactions on Networking, 17(1), 120-129.
Sutantyo, D. K., Kernbach, S., Levi, P., and Nepomnyashchikh, V. A. (2010). Multi-robot searching algorithm using Lévy flight and artificial potential field. 2010 IEEE Safety Security and Rescue Robotics,1-6.
Rhee, I., Shin, M., Hong, S., Lee, K., Kim, S. J., and Chong, S. (2011). On the levy-walk nature of human mobility. IEEE/ACM transactions on networking, 19(3), 630-643.
Edwards, A. M., Phillips, R. A., Watkins, N. W., Freeman, M. P., Murphy, E. J., Afanasyev, V., et al. (2007). Revisiting Lévy flight search patterns of wandering albatrosses, bumblebees and deer. Nature, 449(7165), 1044-1048.
Viswanathan, G. M., Afanasyev, V., Buldyrev, S., Murphy, E., Prince, P., and Stanley, H. E. (1996). Lévy flight search patterns of wandering albatrosses. Nature, 381(6581), 413-415.
Yang, X.-S. and Deb, S. (2013). Multiobjective cuckoo search for design optimization. Computers & Operations Research, 40(6), 1616-1624.
Yang, X.-S. (2010). Firefly algorithm, Levy flights and global optimization. Research and development in intelligent systems XXVI. Springer, 209-218.
Murphy, P. and Aha, D. (1994). UCI repository of machine learning. University of California, Department of Information and Computer Science.
Ozdamar, K. (2002). Paket programlari ile istatistiksel veri analizi-1. Kaan Kitabevi, Eskisehir.
Tatl?dil, H. (1996). Uygulamal? Çok De?i?kenli ?statistiksel Analiz. Cem Ofset Ltd. ?ti, Ankara.
Hair Jr, J., Anderson, R., and Tatham, R. (1998). Multivariate data analysis.NJ: PrenticeYHall Inc, Upper Saddle River.
Lorr, M. (1983). Cluster analysis for social scientists.Jossey-Bass Incorporated Pub.
Boushaki, S. I., Kamel, N., and Bendjeghaba, O. (2018). A new quantum chaotic cuckoo search algorithm for data clustering. Expert Systems with Applications, 96, 358-372.
Frigui, H. and Krishnapuram, R. (1999). A robust competitive clustering algorithm with applications in computer vision. Ieee transactions on pattern analysis and machine intelligence, 21(5), 450-465.
Karakoyun, M. (2015). Kurba?a s?çrama algoritmas?n?n kümeleme problemlerine uygulanmas?. Master Thesis. Selçuk University.
Sharma, S. (1996). Applied multivariate techniques.
Tabak, J. (2014). Geometry: the language of space and form. Infobase Publishing.
MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. Proceedings of the fifth Berkeley symposium on mathematical statistics and probability,281-297.
Han, J. and Kamber, M. (2010). Data mining: concepts and techniques. [Nachdr.], Amsterdam: Elsevier/Morgan Kaufmann, 11, 6.
Tan, P.-N., Steinbach, M., and Kumar, V. (2006). Classification: basic concepts, decision trees, and model evaluation. Introduction to data mining, 1, 145-205.
Xu, R. and Wunsch, D. (2005). Survey of clustering algorithms. IEEE Transactions on neural networks, 16(3), 645-678.
Dodge, Y. (2012). Statistical data analysis based on the L1-norm and related methods. Birkhäuser.
Dinçer, ?. E. (2006). Veri madencili?inde K-means algoritmas? ve t?p alan?nda uygulanmas?. Master Thesis. Kocaeli University.
Karakoyun, M., Saglam, A., Baykan, N. A., and Altun, A. A. (2017). Non-locally color image segmentation for remote sensing images in different color spaces by using data-clustering methods. 5th International Conference on Advanced Technology & Sciences (ICAT'17), 6-12.
Höppner, F., Klawonn, F., Kruse, R., and Runkler, T. (1999). Fuzzy cluster analysis: methods for classification, data analysis and image recognition. John Wiley & Sons.
Moertini, V. (2002). Introduction to Five DataClustering Algorithms Clustering Algorithm. Integral, 7(2).
Goldbogen, J. A., Friedlaender, A. S., Calambokidis, J., Mckenna, M. F., Simon, M., and Nowacek, D. P. (2013). Integrative approaches to the study of baleen whale diving behavior, feeding performance, and foraging ecology. BioScience, 63(2), 90-100.
Tany?ld?z?, E. and Cigal?, T. (2017). Kaotik Harital? Balina Optimizasyon Algoritmalar?. F?rat Üniversitesi Mühendislik Bilimleri Dergisi, 29(1), 307-317.
Pavlyukevich, I. (2007). Lévy flights, non-local search and simulated annealing. Journal of Computational Physics, 226(2), 1830-1844.
Reynolds, A. M. and Frye, M. A. (2007). Free-flight odor tracking in Drosophila is consistent with an optimal intermittent scale-free search. PloS one, 2(4), e354.
Shlesinger, M. F. (2006). Search research. Nature, 443(7109), 281-282.
Chechkin, A. V., Metzler, R., Klafter, J., and Gonchar, V. Y. (2008). Introduction to the theory of Lévy flights. Anomalous transport, 1, 129.
Yang, X.-S. (2010). Engineering optimization: an introduction with metaheuristic applications. John Wiley & Sons.
Lee, C.-Y. and Yao, X. (2001). Evolutionary algorithms with adaptive lévy mutations. Proceedings of the 2001 congress on evolutionary computation (IEEE Cat. No. 01TH8546), 568-575.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Ay?e Nagehan Mat, Onur ?nan, Murat Karakoyun
This work is licensed under a Creative Commons Attribution 4.0 International License.
Articles published in IJOCTA are made freely available online immediately upon publication, without subscription barriers to access. All articles published in this journal are licensed under the Creative Commons Attribution 4.0 International License (click here to read the full-text legal code). This broad license was developed to facilitate open access to, and free use of, original works of all types. Applying this standard license to your work will ensure your right to make your work freely and openly available.
Under the Creative Commons Attribution 4.0 International License, authors retain ownership of the copyright for their article, but authors allow anyone to download, reuse, reprint, modify, distribute, and/or copy articles in IJOCTA, so long as the original authors and source are credited.
The readers are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material
- for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
under the following terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
This work is licensed under a Creative Commons Attribution 4.0 International License.