A Federated Learning Approach to Secure AI-Based Patient Outcome Prediction Across Hospitals

Sarah Mavire¹*, Kumbirai Bernard Muhwati², Carrol Donna Kudaro³, & Joy Awoleye⁴
¹Department of Computer Science, Yeshiva University, USA
²Department of Computer Science, Yeshiva University, USA
³Department of Computer Science, Yeshiva University, USA
⁴Department of Computer Science, Yeshiva University, USA
DOI – http://doi.org/10.37502/IJSMR.2025.8806

FULL TEXT – PDF

Abstract

The potential of artificial intelligence to transform healthcare is increasingly realized through patient outcome prediction models. However, traditional centralized training methods for such models pose significant privacy risks, particularly when sensitive patient data must be shared across institutions. This paper proposes a federated learning (FL) framework for developing robust and secure patient outcome prediction models across hospitals while ensuring data privacy and regulatory compliance. Using synthetic and real-world datasets such as MIMIC- III and eICU, it creates a multi-hospital-based environment, where models are trained locally and then aggregated in a centralized manner without sharing raw patient data. The LSTM- based and transformer-based architectures are being applied in experiments to time-series health record data, and the accuracy of prediction is statistically significant in the outcomes of ICU mortality and readmission. The FL model achieves competitive performance compared to centralized training, with less than 3% performance degradation and full compliance with privacy-preserving standards. Differential privacy and secure aggregation enhancements was also explored to improve robustness against adversarial participants. Our findings indicate that federated learning presents a scalable, secure, and practical approach to collaborative AI in healthcare, bridging the gap between innovation and privacy protection.

Keywords: Federated Learning (FL), Artificial Intelligence (AI), patient outcome prediction, healthcare, privacy, Electronic Health Records (EHRs), Differential Privacy (DP), Secure Aggregation, LSTM, transformer-based architectures, MIMIC-III, eICU.

References

Bonawitz, K., Ivanov, V., Kreuter, B., Marcedone, A., McMahan, H. B., Patel, S., … & Seth, K. (2017). Practical secure aggregation for privacy-preserving machine learning. Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 1175–1191. https://doi.org/10.1145/3133956.3133982
Confidential Computing Zoo. (n.d.). Vertical federated learning. In Confidential Computing Zoo solutions. Retrieved June 21, 2025, from https://cczoo.readthedocs.io/en/latest/Solutions/vertical-federated-learning/vfl.html
Dayan, I., Roth, H. R., Zhong, A., Harouni, A., Gentili, A., Abidin, A. Z., … & Xu, D. (2021). Federated learning for predicting clinical outcomes in patients with COVID-19. Nature Medicine, 27(10), 1735–1743. https://doi.org/10.1038/s41591-021-01506-3
Esteva, A., Robicquet, A., Ramsundar, B., Kuleshov, V., DePristo, M., Chou, K., Cui, C., Corrado, G., Thrun, S., & Dean, J. (2019). A guide to deep learning in healthcare. Nature Medicine, 25(1), 24–29. https://doi.org/10.1038/s41591-018-0316-z
Hitaj, B., Ateniese, G., & Perez-Cruz, F. (2017). Deep models under the GAN: Information leakage from collaborative deep learning. Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 603–618. https://doi.org/10.1145/3133956.3134012
Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. W. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://doi.org/10.1038/sdata.2016.35
Kaissis, G. A., Makowski, M. R., Rückert, D., & Braren, R. F. (2020). Secure, privacy- preserving and federated machine learning in medical imaging. Nature Machine Intelligence, 2(6), 305–311. https://doi.org/10.1038/s42256-020-0186-1
Lee, C. M., Delgado Fernandez, J., Potenciano Menci, S., & Rieger, A. (2023). Federated learning for credit risk assessment. In T. X. Bui (Ed.), Proceedings of the 56th Hawaii International Conference on System Sciences (HICSS) (pp. 422–431). University of Hawai ‘i at Mānoa. https://doi.org/10.24251/HICSS.2023.048
Li, T., Sahu, A. K., Talwalkar, A., & Smith, V. (2019, November 12). Federated learning: Challenges, methods, and future directions. Carnegie Mellon University School of Machine Learning. Retrieved from https://blog.ml.cmu.edu/2019/11/12/federated-learning-challenges-methods-and-future-directions/
McMahan, H.B., Moore, E., Ramage, D., Hampson, S. and y Arcas, B.A., (2017). Communication-efficient learning of deep networks from decentralized data. In: AISTATS 2017 – Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. PMLR, pp.1273–1282.
Miotto, R., Wang, F., Wang, S., Jiang, X., & Dudley, J. T. (2018). Deep learning for healthcare: Review, opportunities and challenges. Briefings in Bioinformatics, 19(6), 1236–1246. https://doi.org/10.1093/bib/bbx044
Nasr, M., Shokri, R., & Houmansadr, A. (2019). Comprehensive privacy analysis of deep learning: Passive and active white-box inference attacks against centralized and federated learning. IEEE Symposium on Security and Privacy (SP), 739–753. https://doi.org/10.1109/SP.2019.00065
Nguyen, P., Tran, T., Wickramasinghe, N., & Venkatesh, S. (2021). Artificial intelligence in healthcare: A review of datasets and methods. Journal of Biomedical Informatics, 118, 103752. https://doi.org/10.1016/j.jbi.2021.103752
Pati, S., Kumar, S., Varma, A., Edwards, B., Lu, C., Qu, L., Wang, J. J., Lakshminarayanan, A., Wang, S.-H., Sheller, M. J., Chang, K., Singh, P., Rubin, D. L., Kalpathy-Cramer, J., & Bakas, S. (2024). Privacy preservation for federated learning in health care. Patterns, 5(7), 100974. https://doi.org/10.1016/j.patter.2024.100974
Poplin, R., Chang, P.-C., Alexander, D., Schwartz, S., Colthurst, T., Ku, A., Newburger, D., Dijamco, J., Nguyen, N., Afshar, P. T., Gross, S. S., Dorfman, L., McLean, C. Y., & DePristo, M. A. (2018). A universal SNP and small-indel variant caller using deep neural networks. Nature Biotechnology, 36(10), 983–987. https://doi.org/10.1038/nbt.4235
Rajkomar, A., Oren, E., Chen, K., Dai, A. M., Hajaj, N., Hardt, M., Liu, P. J., Liu, X., Marcus, J., Sun, M., Sundberg, P., Yee, H., Zhang, K., Zhang, Y., Flores, G., Duggan, G. E., Irvine, J., Le, Q., Litsch, K., Mossin, A., Tansuwan, J., Wang, D., Wexler, J., Wilson, J., Ludwig, D., Volchenboum, S. L., Chou, K., Pearson, M., Madabushi, S., Shah, N. H., Butte, A. J., Howell, M. D., Cui, C., Corrado, G. S., & Dean, J. (2018). Scalable and accurate deep learning with electronic health records. npj Digital Medicine, 1(1), 18. https://doi.org/10.1038/s41746-018-0029-1
Rauniyar, A., Hagos, D. H., Jha, D., Håkegård, J. E., Bagci, U., Rawat, D. B., & Vlassov, V. (2023). Federated learning for medical applications: A taxonomy, current trends, challenges, and future research directions. IEEE Access, 11, 38679–38699. https://doi.org/10.1109/ACCESS.2023.3265587
Rieke, N., Hancox, J., Li, W., Milletarì, F., Roth, H. R., Albarqouni, S., Bakas, S., Galtier, M., Landman, B. A., Maier-Hein, K., Ourselin, S., Sheller, M. J., Summers, R. M., Trask, A., Xu, D., Baust, M., Cardoso, M. J., & Makropoulos, A. (2020). The future of digital health with federated learning. npj Digital Medicine, 3(1), 119. https://doi.org/10.1038/s41746-020-00323-1
Robai, M. P. (2024). Federated learning for secure and privacy-preserving data analytics in heterogeneous networks. GSC Advanced Research and Reviews, 21(2), 527–555. https://doi.org/10.30574/gscarr.2024.21.2.0451
Teo, Z. L., Jin, L., Liu, N., Li, S., Miao, D., Zhang, X., Ng, W. Y., Tan, T. F., Lee, D. M., Chua, K. J., Heng, J., Liu, Y., Goh, R. S. M., & Ting, D. S. W. (2024). Federated machine learning in healthcare: A systematic review on clinical applications and technical architecture. Cell Reports Medicine, 5(3), 101481. https://doi.org/10.1016/j.xcrm.2024.101481
What is Federated Learning? (2024, May 2024). GeeksforGeeks. Retrieved from https://www.geeksforgeeks.org/machine-learning/collaborative-learning-federated- learning/
Yang, Q. (2021). Toward responsible AI: An overview of federated learning for user- centered privacy-preserving computing. Nature Machine Intelligence, 3(7), 566–573. https://doi.org/10.1038/s42256-021-00303-4
Yang, Q., Liu, Y., Chen, T., & Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10(2), 12:1–12:19. https://doi.org/10.1145/3298981
Ye, M., Fang, X., Du, B., Yuen, P. C., & Tao, D. (2024). Heterogeneous federated learning: State-of-the-art and research challenges. ACM Computing Surveys, 56(3), Article 79, 1–44. https://doi.org/10.1145/3625558
Zhang, F., Kreuter, D., Chen, Y., Dittmer, S., Tull, S., Shadbahr, T., & BloodCounts! consortium. (2024). Recent methodological advances in federated learning for healthcare. Patterns, 5(6), 101006. https://doi.org/10.1016/j.patter.2024.101006