A Federated Learning Approach to Secure AI-Based Patient Outcome Prediction Across Hospitals

Sarah Mavire1*, Kumbirai Bernard Muhwati2, Carrol Donna Kudaro3, & Joy Awoleye4
1Department of Computer Science, Yeshiva University, USA
2Department of Computer Science, Yeshiva University, USA
3Department of Computer Science, Yeshiva University, USA
4Department of Computer Science, Yeshiva University, USA
DOI
– http://doi.org/10.37502/IJSMR.2025.8806

FULL TEXT – PDF

Abstract

The potential of artificial intelligence to transform healthcare is increasingly realized through patient outcome prediction models. However, traditional centralized training methods for such models pose significant privacy risks, particularly when sensitive patient data must be shared across institutions. This paper proposes a federated learning (FL) framework for developing robust and secure patient outcome prediction models across hospitals while ensuring data privacy and regulatory compliance. Using synthetic and real-world datasets such as MIMIC- III and eICU, it creates a multi-hospital-based environment, where models are trained locally and then aggregated in a centralized manner without sharing raw patient data. The LSTM- based and transformer-based architectures are being applied in experiments to time-series health record data, and the accuracy of prediction is statistically significant in the outcomes of ICU mortality and readmission. The FL model achieves competitive performance compared to centralized training, with less than 3% performance degradation and full compliance with privacy-preserving standards. Differential privacy and secure aggregation enhancements was also explored to improve robustness against adversarial participants. Our findings indicate that federated learning presents a scalable, secure, and practical approach to collaborative AI in healthcare, bridging the gap between innovation and privacy protection.

Keywords: Federated Learning (FL), Artificial Intelligence (AI), patient outcome prediction, healthcare, privacy, Electronic Health Records (EHRs), Differential Privacy (DP), Secure Aggregation, LSTM, transformer-based architectures, MIMIC-III, eICU.

References

  • Bonawitz, K., Ivanov, V., Kreuter, B., Marcedone, A., McMahan, H. B., Patel, S., … & Seth, K. (2017). Practical secure aggregation for privacy-preserving machine learning. Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 1175–1191. https://doi.org/10.1145/3133956.3133982
  • Confidential Computing Zoo. (n.d.). Vertical federated learning. In Confidential Computing Zoo solutions. Retrieved June 21, 2025, from https://cczoo.readthedocs.io/en/latest/Solutions/vertical-federated-learning/vfl.html
  • Dayan, I., Roth, H. R., Zhong, A., Harouni, A., Gentili, A., Abidin, A. Z., … & Xu, D. (2021). Federated learning for predicting clinical outcomes in patients with COVID-19. Nature Medicine, 27(10), 1735–1743. https://doi.org/10.1038/s41591-021-01506-3
  • Esteva, A., Robicquet, A., Ramsundar, B., Kuleshov, V., DePristo, M., Chou, K., Cui, C., Corrado, G., Thrun, S., & Dean, J. (2019). A guide to deep learning in healthcare. Nature Medicine, 25(1), 24–29. https://doi.org/10.1038/s41591-018-0316-z
  • Hitaj, B., Ateniese, G., & Perez-Cruz, F. (2017). Deep models under the GAN: Information leakage from collaborative deep learning. Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 603–618. https://doi.org/10.1145/3133956.3134012
  • Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. W. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://doi.org/10.1038/sdata.2016.35
  • Kaissis, G. A., Makowski, M. R., Rückert, D., & Braren, R. F. (2020). Secure, privacy- preserving and federated machine learning in medical imaging. Nature Machine Intelligence, 2(6), 305–311. https://doi.org/10.1038/s42256-020-0186-1
  • Lee, C. M., Delgado Fernandez, J., Potenciano Menci, S., & Rieger, A. (2023). Federated learning for credit risk assessment. In T. X. Bui (Ed.), Proceedings of the 56th Hawaii International Conference on System Sciences (HICSS) (pp. 422–431). University of Hawai ‘i at Mānoa. https://doi.org/10.24251/HICSS.2023.048
  • Li, T., Sahu, A. K., Talwalkar, A., & Smith, V. (2019, November 12). Federated learning: Challenges, methods, and future directions. Carnegie Mellon University School of Machine Learning. Retrieved from https://blog.ml.cmu.edu/2019/11/12/federated-learning-challenges-methods-and-future-directions/
  • McMahan, H.B., Moore, E., Ramage, D., Hampson, S. and y Arcas, B.A., (2017). Communication-efficient learning of deep networks from decentralized data. In: AISTATS 2017 – Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. PMLR, pp.1273–1282.
  • Miotto, R., Wang, F., Wang, S., Jiang, X., & Dudley, J. T. (2018). Deep learning for healthcare: Review, opportunities and challenges. Briefings in Bioinformatics, 19(6), 1236–1246. https://doi.org/10.1093/bib/bbx044
  • Nasr, M., Shokri, R., & Houmansadr, A. (2019). Comprehensive privacy analysis of deep learning: Passive and active white-box inference attacks against centralized and federated learning. IEEE Symposium on Security and Privacy (SP), 739–753. https://doi.org/10.1109/SP.2019.00065
  • Nguyen, P., Tran, T., Wickramasinghe, N., & Venkatesh, S. (2021). Artificial intelligence in healthcare: A review of datasets and methods. Journal of Biomedical Informatics, 118, 103752. https://doi.org/10.1016/j.jbi.2021.103752
  • Pati, S., Kumar, S., Varma, A., Edwards, B., Lu, C., Qu, L., Wang, J. J., Lakshminarayanan, A., Wang, S.-H., Sheller, M. J., Chang, K., Singh, P., Rubin, D. L., Kalpathy-Cramer, J., & Bakas, S. (2024). Privacy preservation for federated learning in health care. Patterns, 5(7), 100974. https://doi.org/10.1016/j.patter.2024.100974
  • Poplin, R., Chang, P.-C., Alexander, D., Schwartz, S., Colthurst, T., Ku, A., Newburger, D., Dijamco, J., Nguyen, N., Afshar, P. T., Gross, S. S., Dorfman, L., McLean, C. Y., & DePristo, M. A. (2018). A universal SNP and small-indel variant caller using deep neural networks. Nature Biotechnology, 36(10), 983–987. https://doi.org/10.1038/nbt.4235
  • Rajkomar, A., Oren, E., Chen, K., Dai, A. M., Hajaj, N., Hardt, M., Liu, P. J., Liu, X., Marcus, J., Sun, M., Sundberg, P., Yee, H., Zhang, K., Zhang, Y., Flores, G., Duggan, G. E., Irvine, J., Le, Q., Litsch, K., Mossin, A., Tansuwan, J., Wang, D., Wexler, J., Wilson, J., Ludwig, D., Volchenboum, S. L., Chou, K., Pearson, M., Madabushi, S., Shah, N. H., Butte, A. J., Howell, M. D., Cui, C., Corrado, G. S., & Dean, J. (2018). Scalable and accurate deep learning with electronic health records. npj Digital Medicine, 1(1), 18. https://doi.org/10.1038/s41746-018-0029-1
  • Rauniyar, A., Hagos, D. H., Jha, D., Håkegård, J. E., Bagci, U., Rawat, D. B., & Vlassov, V. (2023). Federated learning for medical applications: A taxonomy, current trends, challenges, and future research directions. IEEE Access, 11, 38679–38699. https://doi.org/10.1109/ACCESS.2023.3265587
  • Rieke, N., Hancox, J., Li, W., Milletarì, F., Roth, H. R., Albarqouni, S., Bakas, S., Galtier, M., Landman, B. A., Maier-Hein, K., Ourselin, S., Sheller, M. J., Summers, R. M., Trask, A., Xu, D., Baust, M., Cardoso, M. J., & Makropoulos, A. (2020). The future of digital health with federated learning. npj Digital Medicine, 3(1), 119. https://doi.org/10.1038/s41746-020-00323-1
  • Robai, M. P. (2024). Federated learning for secure and privacy-preserving data analytics in heterogeneous networks. GSC Advanced Research and Reviews, 21(2), 527–555. https://doi.org/10.30574/gscarr.2024.21.2.0451
  • Teo, Z. L., Jin, L., Liu, N., Li, S., Miao, D., Zhang, X., Ng, W. Y., Tan, T. F., Lee, D. M., Chua, K. J., Heng, J., Liu, Y., Goh, R. S. M., & Ting, D. S. W. (2024). Federated machine learning in healthcare: A systematic review on clinical applications and technical architecture. Cell Reports Medicine, 5(3), 101481. https://doi.org/10.1016/j.xcrm.2024.101481
  • What is Federated Learning? (2024, May 2024). GeeksforGeeks. Retrieved from https://www.geeksforgeeks.org/machine-learning/collaborative-learning-federated- learning/
  • Yang, Q. (2021). Toward responsible AI: An overview of federated learning for user- centered privacy-preserving computing. Nature Machine Intelligence, 3(7), 566–573. https://doi.org/10.1038/s42256-021-00303-4
  • Yang, Q., Liu, Y., Chen, T., & Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10(2), 12:1–12:19. https://doi.org/10.1145/3298981
  • Ye, M., Fang, X., Du, B., Yuen, P. C., & Tao, D. (2024). Heterogeneous federated learning: State-of-the-art and research challenges. ACM Computing Surveys, 56(3), Article 79, 1–44. https://doi.org/10.1145/3625558
  • Zhang, F., Kreuter, D., Chen, Y., Dittmer, S., Tull, S., Shadbahr, T., & BloodCounts! consortium. (2024). Recent methodological advances in federated learning for healthcare. Patterns, 5(6), 101006. https://doi.org/10.1016/j.patter.2024.101006