oalogo2  

AUTHOR(S):

Gokhan Zorluoglu, Mustafa Agaoglu

 

TITLE

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods

pdf PDF

ABSTRACT

Breast cancer is a very serious malignant tumor originating from the breast cells. The disease occurs generally in women, but also men can rarely have it. During the prognosis of breast cancer, abnormal growth of cells in breast takes place and this growth can be in two types which are benign (non-cancerous) and malignant (cancerous). In this study, the aim is to diagnose the breast cancer using various intelligent techniques including Decision Trees (DT), Support Vector Machines (SVM), Artificial Neural Network (ANN) and also the ensemble of these techniques. Experimental studies were done using SPSS Clementine software and the results show that the ensemble model is better than the individual models according to the evaluation metric which is the accuracy. In order to increase the efficiency of the models, feature selection technique is applied. Moreover, models are also analyzed in terms of other error measures like sensitivity and specificity.

KEYWORDS

Artificial Neural Network, breast cancer, cross validation, C5.0, data mining, decision tree, support vector machine

REFERENCES

[1] Makinacı, M., Güneşer, C. (no date). GöğüsKanseriVerilerininSınıflandırılması.

[2] Mangasarian, O.L., Street, W.N., Wolberg, W.H. (1995). Breast cancer diagnosis and prognosis via linear programming. Operations Research, 43(4), pp. 570-577.

[3] Wolberg, W.H., Street, W.N, Mangasarian, O.L. (1994). Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Letters vol.77, 163-171.

[4] Wolberg, W.H., Street, W.N., Mangasarian, O.L. (1995a). Image analysis and machine learning applied to breast cancer diagnosis and prognosis. Analytical and Quantitative Cytology and Histology, Vol. 17 No. 2, pp. 77-87.

[5] Wolberg, W.H., Street, W.N., Heisey, D.M., Mangasarian, O.L. (1995b). Computerized breast cancer diagnosis and prognosis from fine needle aspirates. Archives of Surgery; 130:511-516.

[6] Wolberg, W.H., Street, W.N., Heisey, D.M., Mangasarian, O.L. (1995c) Computer-derived nuclear features distinguish malignant from benign breast cytology. Human Pathology, 26:792—796.

[7] Bache, K., Lichman, M. (2013). UCI Machine Learning Repository

[http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

[8] Bennett, K.P. (1992). Decision tree construction via linear programming. In Proceedings of the 4th Midwest Artificial Intelligence and Cognitive Science Society Conference, pp. 97-101.

[9] Bennett, K.P., Mangasarian, O.L (1992). Robust linear programming discrimination of two linearly inseparable sets. Optimization Methods and Software, 1:23-34.

[10] Nicholas, E. (2008). Introduction to Clementine and Data Mining. Brigham Young University

[11] Salama, G.I., Abdelhalim, M.B., Zeid, M.A. (2012). Breast Cancer Diagnosis on Three Different Datasets Using Multi-Classifiers. International Journal of Computer and Information Technology (2277 – 0764), Volume 01– Issue 01,

[12] Frank, A., Asuncion, A. (2010). UCI Machine Learning Repository

[http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

Cite this paper

Gokhan Zorluoglu, Mustafa Agaoglu. (2017) Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods. International Journal of Oncology and Cancer Therapy, 2, 24-27

 

cc.png
Copyright © 2017 Author(s) retain the copyright of this article.
This article is published under the terms of the Creative Commons Attribution License 4.0