Hybrid and Ensemble Learning Approaches for Accurate Breast Cancer Detection and Classification

Main Article Content

Sridhar Siripurapu
Sateesh Gudla
Sridhar Bokka
Ramarao Bonula
Nataraj Dasari

Abstract

Breast cancer is a leading cause of mortality among women worldwide, underscoring the need for accurate differentiation between malignant and benign tumors to support early diagnosis and timely treatment. Malignant tumors are invasive and often require aggressive therapy, while benign tumors are non-cancerous and localized. Advances in Machine Learning (ML) and Deep Learning (DL) have significantly enhanced diagnostic performance in healthcare. This study explores hybrid and ensemble learning approaches -including Bagging, Boosting (AdaBoost, Gradient Boosting, and XGBoost), and Stacking—combined with traditional ML classifiers such as Logistic Regression, Support Vector Machine, K-Nearest Neighbor, Decision Tree, and Random Forest, alongside DL models including Convolutional Neural Networks and Long Short-Term Memory networks for breast cancer detection. Experiments were conducted on the Wisconsin Diagnostic Breast Cancer dataset. The workflow incorporated preprocessing, feature selection, and SMOTE-based class imbalance handling applied to training folds only. Model robustness was ensured through 10-fold stratified cross-validation with GridSearchCV hyperparameter tuning. Results show that ensemble and hybrid models outperform individual classifiers, with SVM-KNN and boosting methods demonstrating particularly strong and clinically relevant performance.

Article Details

How to Cite
Sridhar Siripurapu, Sateesh Gudla, Sridhar Bokka, Ramarao Bonula, & Nataraj Dasari. (2026). Hybrid and Ensemble Learning Approaches for Accurate Breast Cancer Detection and Classification. Science & Technology Asia, 31(1), 125–144. retrieved from https://ph02.tci-thaijo.org/index.php/SciTechAsia/article/view/258753
Section
Engineering

References

Singh I, Sanwal K, Praveen S. Breast cancer detection using two-fold genetic evolution of neural network ensembles. In: 2016 International Conference on Data Science and Engineering (ICDSE). 2016.

Eman N, Cabatuan MK, Dadios EP, Gan Lim LA. Detecting and tracking female breasts using neural network in real-time. In: 2013 IEEE Region 10 Conference (TENCON). 2013.

Gupta N, Gupta HK, Srivastava R, Saxena C, Surjeete. Design and implementation of artificial neural network classifiers based on hypertuning parameters for breast cancer diagnosis. Procedia Comput Sci. 2024.

Tariq N. Breast cancer detection using artificial neural networks. J Mol Biomark Diagn.

Higa A. Diagnosis of breast cancer using decision tree and artificial neural network algorithms. Int J Comput Appl Technol Res.

Wadhwa G, Mathur M. A convolutional neural network approach for the diagnosis of breast cancer. In: 2020 Sixth International Conference on Parallel, Distributed and Grid Computing (PDGC). 2020.

Liang Y, Yang J, Quan X, Zhang H. Metastatic breast cancer recognition in histopathology images using convolutional neural network with attention mechanism. In: 2019 Chinese Automation Congress (CAC). 2019.

Gonçalves CB, Souza JR, Fernandes H. CNN architecture optimization using bioinspired algorithms for breast cancer detection in infrared images. Comput Biol Med. 2021.

Wang Z, Li M, Wang H, Jiang H, Yao Y, Zhang H, et al. Breast cancer detection using extreme learning machine based on feature fusion with CNN deep features. IEEE Access. 2019.

Wu Y, Zhu H. Investigation of long short-term memory based ultrawide band microwave breast tumor size prediction. In: 2019 12th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISPBMEI). 2019.

Begum A, Kumar VD, Asghar J, Hemalatha D, Arulkumaran G. A combined deep CNN-LSTM with a random forest approach for breast cancer diagnosis. 2022.

Liew XY, Hameed N, Clos J. An investigation of XGBoost-based algorithm for breast cancer classification. Mach Learn Appl. 2021.

Maleki A, Raahemi M, Nasiri H. Breast cancer diagnosis from histopathology images using deep neural network and XGBoost. Biomed Signal Process Control. 2023.

Gorishniy Y, et al. Revisiting deep learning models for tabular data. In: Advances in Neural Information Processing Systems (NeurIPS). 2021.

Arik SO, Pfister T. TabNet: Attentive interpretable tabular learning. In: AAAI Conference on Artificial Intelligence. 2021.

Huang H, et al. Transformer-based models for tabular data: A survey. IEEE Access. 2023.

Hutter F, Kotthoff L, Vanschoren J, editors. Automated machine learning: Methods, systems, challenges. Springer; 2022.

Shwartz-Ziv R, et al. Tabular deep learning: A review. ACM Comput Surv. 2024.

Author(s). AutoML for clinical decision support systems. IEEE J Biomed Health Inform. 2023–5.

Pranatha MDA, Pramaita N, Sudarma M, Widyantara IMO. Filtering outlier data using box whisker plot method for fuzzy time series rainfall forecasting. In: 2018 4th International Conference on Wireless and Telematics (ICWT). IEEE; 2018. p.1–4.

Shah D, Patel S, Bharti SK. Heart disease prediction using machine learning techniques. SN Comput Sci. 2020;1:1–6.

Nti IK, Boateng ON, Aning J. Performance of ML algorithms with different K values in KCV. Int J Inf Technol Comput Sci. 2021;6:61–71.

Fernandez A, et al. SMOTE for learning from imbalanced data: Progress and challenges, marking the 15-year anniversary. J Artif Intell Res. 2018;61:863–905.

Blagus R, Lusa L. SMOTE for highdimensional class-imbalanced data. BMC Bioinformatics. 2013;14:106.

Bergstra J, et al. Algorithms for hyperparameter optimization. In: Advances in Neural Information Processing Systems. 2011;24.

JaiKrishna SV, Chantarakasemchit O, Meesad P. A breakup machine learning approach for breast cancer prediction. In: 2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE). 2019.

Dhungel N, Carneiro G, Bradley AP. Automated mass detection in mammograms using cascaded deep learning and random forests. In: 2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA). 2015.

Botlagunta M, et al. Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms. Sci Rep. 2023;13(1):1–17.

Binsaif N. Application of machine learning models to the detection of breast cancer. Mobile Inf Syst. 2022;2022.

Al-Fahaidy FAK, Al-Fuhaidi B, Al-Darouby I, Al-Abady F, Al-Qadry M, Al-Gamal A. A diagnostic model of breast cancer based on digital mammogram images using machine learning techniques. Appl Comput Intell Soft Comput. 2022;2022:1–17.

Ghosh P. Breast cancer Wisconsin (diagnostic) dataset. UCI Machine Learning Repository; 2022. Available from: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(diagnostic)

Akbulut S, Cicek IB, Colak C. Classification of breast cancer on the strength of potential risk factors with boosting models: A public health informatics application. Med Bull Haseki. 2022;60:196–203.

Dey N, Rajinikanth V, Hassanien AE. An examination system to classify breast thermal images into early/acute DCIS class. In: International Conference on Data Science and Applications. Springer; 2021. p.209–20.

Ara S, Das A, Dey A. Malignant and benign breast cancer classification using machine learning algorithms. In: 2021 International Conference on Artificial Intelligence (ICAI). IEEE; 2021. p.97–101.

Sengar PP, Gaikwad MJ, Nagdive AS. Comparative study of machine learning algorithms for breast cancer prediction. In: 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT). IEEE; 2020. p.796–801.

Ak MF. A comparative analysis of breast cancer detection and diagnosis using data visualization and machine learning applications. Healthcare. 2020;8:111.

Abdar M, Zomorodi-Moghadam M, Zhou X, Gururajan R, Tao X, Barua PD, et al. A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recognit Lett. 2020;132:123–31.