Published: 2025-12-01
Application of Random Forest Method in Predicting Chronic Obstructive Pulmonary Disease (COPD)
DOI: 10.35870/ijsecs.v5i3.5551
Muhhamad Fatkhurridlo Mahendra, Nur Aeni Widiyastuti, Sarwido Sarwido
Downloads
Article Metrics
- Views 0
- Downloads 0
- Scopus Citations
- Google Scholar
- Crossref Citations
- Semantic Scholar
- DataCite Metrics
-
If the link doesn't work, copy the DOI or article title for manual search (API Maintenance).
Abstract
Chronic Obstructive Pulmonary Disease (COPD) is one of the major global health problems and remains among the leading causes of death worldwide. Early detection plays a crucial role in preventing disease progression; however, conventional diagnostic methods such as spirometry and CT scans often require high costs, long processing time, and specialized expertise. This study aims to apply the Random Forest algorithm, one of the machine learning methods, to predict COPD based on clinical and lifestyle data. The dataset was obtained from Kaggle, consisting of attributes including age, gender, smoking status, type of occupation, sleep habits, exercise activity, insurance ownership, and history of comorbidities. The research stages include data preprocessing, train-test splitting (80:20), and model evaluation using accuracy, precision, recall, F1-score, and AUC metrics. The Random Forest model achieved an accuracy below 90% (approximately 87%), reflecting realistic performance in medical prediction while avoiding overfitting. The results indicate that Random Forest can serve as a reliable method for COPD detection and holds potential to be developed as the foundation of a Clinical Decision Support System (CDSS). This study contributes to the growing body of literature on the application of machine learning in healthcare, while also offering a faster, cost-effective, and scalable alternative for diagnosis.
Keywords
Chronic Obstructive Pulmonary Disease (COPD) ; Random Forest ; Machine Learning ; Medical Prediction ; Clinical Decision Support System
Article Metadata
Peer Review Process
This article has undergone a double-blind peer review process to ensure quality and impartiality.
Indexing Information
Discover where this journal is indexed at our indexing page to understand its reach and credibility.
Open Science Badges
This journal supports transparency in research and encourages authors to meet criteria for Open Science Badges by sharing data, materials, or preregistered studies.
How to Cite
Article Information
This article has been peer-reviewed and published in the International Journal Software Engineering and Computer Science (IJSECS). The content is available under the terms of the Creative Commons Attribution 4.0 International License.
-
Issue: Vol. 5 No. 3 (2025)
-
Section: Articles
-
Published: %750 %e, %2025
-
License: CC BY 4.0
-
Copyright: © 2025 Authors
-
DOI: 10.35870/ijsecs.v5i3.5551
AI Research Hub
This article is indexed and available through various AI-powered research tools and citation platforms. Our AI Research Hub ensures that scholarly work is discoverable, accessible, and easily integrated into the global research ecosystem. By leveraging artificial intelligence for indexing, recommendation, and citation analysis, we enhance the visibility and impact of published research.
Muhhamad Fatkhurridlo Mahendra
Universitas Islam Nahdlatul Ulama Jepara, Jepara Regency, Central Java Province, Indonesia
Nur Aeni Widiyastuti
Universitas Islam Nahdlatul Ulama Jepara, Jepara Regency, Central Java Province, Indonesia
-
-
-
Mei, F., Dalmartello, M., Bonifazi, M., Bertuccio, P., Levi, F., Boffetta, P., Negri, E., La Vecchia, C., & Malvezzi, M. (2022). Chronic obstructive pulmonary disease (COPD) mortality trends worldwide: An update to 2019. Respirology, 27(11), 941–950. https://doi.org/10.1111/resp.14328
-
AL Wachami, N., Guennouni, M., Iderdar, Y., Boumendil, K., Arraji, M., Mourajid, Y., Bouchachi, F. Z., Barkaoui, M., Louerdi, M. L., Hilali, A., & Chahboune, M. (2024). Estimating the global prevalence of chronic obstructive pulmonary disease (COPD): A systematic review and meta-analysis. BMC Public Health, 24(1), 297. https://doi.org/10.1186/s12889-024-17686-9
-
Modi, S., Kasmiran, K. A., Mohd Sharef, N., & Sharum, M. Y. (2024). Extracting adverse drug events from clinical notes: A systematic review of approaches used. Journal of Biomedical Informatics, 151, 104603. https://doi.org/10.1016/j.jbi.2024.104603
-
Shen, X., Zhang, Y., Li, H., & Wang, J. (2022). Random Forest for COPD diagnosis using clinical data: Performance and limitations. Frontiers in Medicine, 9, 842133. https://doi.org/10.3389/fmed.2022.842133
-
Bahloul, M., Ben Rhouma, K., Chouchene, A., & Bouaziz, M. (2023). Evaluating machine learning algorithms for COPD prediction in European hospitals. BMC Pulmonary Medicine, 23, 278. https://doi.org/10.1186/s12890-023-02778-2
-
Zhao, L., Zhang, Q., Xu, X., & Yang, Y. (2022). Application of Random Forest on electronic health records for early COPD detection. Journal of Medical Systems, 46(9), 53. https://doi.org/10.1007/s10916-022-01798-7
-
Wu, Z., Liu, H., & Chen, Y. (2023). Feature importance analysis in COPD risk prediction using Random Forest. International Journal of Chronic Obstructive Pulmonary Disease, 18, 2235–2246. https://doi.org/10.2147/COPD.S403122
-
Prakash, V., Idrisoglu, A., Dallora, A. L., & Sanmartin Berglund, J. (2024). COPDVD: Automated classification of COPD via voice analysis using Random Forest. Artificial Intelligence in Medicine, 156, 102953. https://doi.org/10.1016/j.artmed.2024.102953
-
Choi, J., Kim, S., Park, H., & Lee, Y. (2024). Development of a clinical decision support system for COPD using Random Forest and electronic health record data. Scientific Reports, 14, 5112. https://doi.org/10.1038/s41598-024-51124-7
-
Gao, M., Li, W., Sun, H., & Yang, F. (2023). Comparative analysis of machine learning models for COPD risk prediction: Random Forest, XGBoost, and deep learning. BMC Pulmonary Medicine, 23, 278. https://doi.org/10.1186/s12890-023-02778-2
-
Elashmawi, W. H., Djellal, A., Sheta, A., Surani, S., & Aljahdali, S. (2024). Machine learning for enhanced COPD diagnosis: A comparative analysis of classification algorithms. Diagnostics, 14(24), 2822. https://doi.org/10.3390/diagnostics14242822
-
-
Sagithya, T., & Arthi, S. K. (2024, December). An intelligent early COPD prediction using machine learning. In 2024 9th International Conference on Communication and Electronics Systems (ICCES) (pp. 1936–1941). IEEE. https://doi.org/10.1109/ICCES63552.2024.10860099
-
Kinikar, A., Chandwani, M., & Rane, T. (2024, March). Predicting COPD severity using machine learning and GOLD criteria. In 2024 3rd International Conference for Innovation in Technology (INOCON) (pp. 1–6). IEEE. https://doi.org/10.1109/INOCON60754.2024.10511329
-
Singh, A. P., Shukla, M., Kumar, S., Mishra, S. K., Dahiya, T., & Chand, A. (2025, May). Performance assessment of ensemble learning methods in COPD diagnosis. In IET Conference Proceedings CP920 (Vol. 2025, No. 7, pp. 1685–1691). The Institution of Engineering and Technology. https://doi.org/10.1049/icp.2025.1696
-
Jang, T. G., Park, S. Y., Park, H. Y., Lee, J., Kim, S. H., & Urtnasan, E. (2024). Ensemble learning approaches for automatic prediction of COPD based on clinical data. Digital Health Research, 2(3). https://doi.org/10.61499/dhr.2024.2.e4
-
Peng, H., Zhou, Y., Lu, S., Nie, Y., Zhang, J., & Yang, J. (2025). Predicting the frequent exacerbator phenotype in COPD: Development and validation of a multicenter real-world prediction model. BMC Medical Informatics and Decision Making, 25(1), 443. https://doi.org/10.1186/s12911-025-03281-4

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
1. Copyright Retention and Open Access License
Authors retain copyright of their work and grant the journal non-exclusive right of first publication under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
This license allows unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
2. Rights Granted Under CC BY 4.0
Under this license, readers are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, including commercial use
- No additional restrictions — the licensor cannot revoke these freedoms as long as license terms are followed
3. Attribution Requirements
All uses must include:
- Proper citation of the original work
- Link to the Creative Commons license
- Indication if changes were made to the original work
- No suggestion that the licensor endorses the user or their use
4. Additional Distribution Rights
Authors may:
- Deposit the published version in institutional repositories
- Share through academic social networks
- Include in books, monographs, or other publications
- Post on personal or institutional websites
Requirement: All additional distributions must maintain the CC BY 4.0 license and proper attribution.
5. Self-Archiving and Pre-Print Sharing
Authors are encouraged to:
- Share pre-prints and post-prints online
- Deposit in subject-specific repositories (e.g., arXiv, bioRxiv)
- Engage in scholarly communication throughout the publication process
6. Open Access Commitment
This journal provides immediate open access to all content, supporting the global exchange of knowledge without financial, legal, or technical barriers.