Comparative Studies of Different Fuzzy-C-Means Clustering Algorithms for Machine Learning

Abstract

A common machine learning technique for grouping data into clusters according to similarity is fuzzy C-Means (FCM) clustering, which permits each data point to belong to numerous clusters with differing degrees of membership. Because of its adaptability, FCM is a desirable option for applications including anomaly detection, pattern identification, and image segmentation. To overcome certain drawbacks including initialization sensitivity, computational cost, and managing data noise, several iterations and adaptations of the FCM algorithm have been put forth. In the context of machine learning applications, this work compares a number of enhanced and updated FCM algorithms. The study highlights the theoretical underpinnings, advantages, and disadvantages of the basic FCM algorithm as well as more sophisticated variants including Weighted FCM, Kernelized FCM, and Possibilistic FCM. Performance parameters such as resilience to noise, convergence speed, computing economy, and clustering accuracy are used in the analysis. The influence of these algorithms in other fields, such as picture clustering, medical diagnosis, and customer segmentation, is also examined in this research. The main conclusions show that although the standard FCM technique is popular because it is straightforward and efficient, more sophisticated variants, such Kernelized FCM, perform better in intricate, non-linear datasets. While possibilistic FCM delivers increases in noise tolerance and fuzzy membership interpretability, weighted FCM is superior at handling outliers.

Country : India

1 Moksud Alam Mallik2 Hafsa Yasmeen3 Nousheen Begum4 Md Saiful Islam5 Sheik Jamil Ahmed

  1. Dean R&D and Associate Professor, Department of CSE (Data Science), Lords Institute of Engineering and Technology, Hyderabad, India
  2. UG Student, Department of CSE (Data Science), Lords Institute of Engineering and Technology, Hyderabad, India
  3. UG Student, Department of CSE (Data Science), Lords Institute of Engineering and Technology, Hyderabad, India
  4. Assistant Professor, Department of Electronics and Communication Engineering, Sphoorthy Engineering College, Nadergul, Hyderabad, India
  5. Assistant Professor, Department of CSE (Data Science), Madanapalle Institute of Technology & Science, Madanapalle, Andhra Pradesh, India

IRJIET, Volume 9, Special Issue of INSPIRE’25 April 2025 pp. 400-406

doi.org/10.47001/IRJIET/2025.INSPIRE65

References

  1. Oyewole, G.J. and Thopil, G.A. (2022) ‘Data clustering: Application and Trends’, Artificial Intelligence Review, 56(7), pp. 6439–6475. doi:10.1007/s10462-022-10325-y.
  2. Sikarwar, N. and Tomar, R.S. (2024) ‘A new approach for wireless sensor networks based on tree-based routing using hybrid fuzzy c-means with genetic algorithm’, Engineering, Technology & Applied Science Research, 14(3), pp. 14141–14147. doi:10.48084/etasr.7078.
  3. Kusumadewi, S., Rosita, L. and Wahyuni, E. (2023) Performance of fuzzy c-means and fuzzy subtractive clustering on medical data imputation [Preprint]. doi:10.2139/ssrn.4473319.
  4. (No date a) Figure 13: K-nearest neighbour (KNN). [Preprint]. doi:10.7717/peerj-cs.437/fig-13.
  5. Yadav, H., Singh, J. and Gosain, A. (2023) ‘Experimental analysis of fuzzy clustering techniques for outlier detection’, Procedia Computer Science, 218, pp. 959–968. doi:10.1016/j.procs.2023.01.076.
  6. Cai, Q. and Liu, W. (2009a) ‘TSK fuzzy model using kernel-based fuzzy C-means clustering’, 2009 IEEE International Conference on Fuzzy Systems, pp. 308–312. doi:10.1109/fuzzy.2009.5277146.
  7. Singh, P. et al. (2023) ‘Comparative study of rough set-based FCM and K-means clustering for tumor segmentation from brain MRI images’, Revue d’IntelligenceArtificielle, 37(4), pp. 921–927. doi:10.18280/ria.370412.
  8. Mrugalska, B. (no date) Clustering of cities based on their smart performances: A comparative approach of fuzzy c-means, k-means, and K-Medoids_supp2-3333753.Docx [Preprint]. doi:10.1109/access.2023.3333753/mm1.
  9. Rani, Y., Manju and Rohil, H. (2014) ‘Comparative analysis of birch and cure hierarchical clustering algorithm using WEKA 3.6.9’, The SIJ Transactions on Computer Science Engineering & its Applications (CSEA), 02(01), pp. 25–29. doi:10.9756/sijcsea/v2i1/0201080201.
  10. Hashemi, S.E., Gholian-Jouybari, F. and Hajiaghaei-Keshteli, M. (2023) ‘A fuzzy c-means algorithm for optimizing data clustering’, Expert Systems with Applications, 227, p. 120377. doi:10.1016/j.eswa.2023.120377.
  11. Almomany, A., Jarrah, A. and Al Assaf, A. (2022) ‘FCM clustering approach optimization using parallel high-speed Intel FPGA technology’, Journal of Electrical and Computer Engineering, 2022, pp. 1–11. doi:10.1155/2022/8260283.
  12. Mehidi, I., Belkhiat, D.E. and Jabri, D. (2022) Comparative analysis of improved of FCM algorithms in the segmentation of retinal blood vessels [Preprint]. doi:10.21203/rs.3.rs-886041/v1.
  13. Eftekhari, S.H. et al. (2024) ‘Employing statistical algorithms and clustering techniques to assess lithological facies for identifying Optimal Reservoir Rocks: A case study of the Mansouri oilfields, SW Iran’, Minerals, 14(3), p. 233. doi:10.3390/min14030233.
  14. ‘Fuzzy clustering and genetic algorithms’ (2010) Biomedical Signal Analysis [Preprint]. doi:10.7551/mitpress/7558.003.0009.
  15. Alqhtani, S.M. et al. (2024a) ‘Improved brain tumor segmentation and classification in brain MRI with FCM-SVM: A diagnostic approach’, IEEE Access, 12, pp. 61312–61335. doi:10.1109/access.2024.3394541.
  16. Bharadiya, J.P. (2023) ‘The role of machine learning in Transforming Business Intelligence’, International Journal of Computing and Artificial Intelligence, 4(1), pp. 16–24. doi:10.33545/27076571.2023.v4.i1a.60.