Federated Edge-Centric Observability for Multi-Cloud Healthcare Data Pipelines
DOI:
https://doi.org/10.63282/3050-9262.IJAIDSML-V7I1P134Keywords:
Multi-Cloud Healthcare, Data Observability, Federated Architecture, Edge Computing, Quality-As-A-Service, Chaos Engineering, Data Pipelines, Finops, PHI, EHR AnalyticsAbstract
Healthcare organizations increasingly operate data pipelines that span multiple public clouds and on-premise environments, integrating electronic health records (EHRs), claims, and Internet of Medical Things (IoMT) telemetry into shared analytic platforms. In this context, traditional centralized monitoring and data quality approaches—which assume that logs and data can be aggregated into a single platform—are costly and fragile. Moving protected health information (PHI) across cloud boundaries for inspection inflates egress spend and raises regulatory concerns, while platform-specific observability tools provide limited visibility into how failures propagate end-to-end across heterogeneous environments. This paper proposes a federated, edge-centric observability architecture for multi-cloud healthcare data pipelines. ssssLightweight observability agents are deployed near data sources in each cloud and on-premise region to instrument pipelines, compute local health and anomaly metrics, and evaluate policies. Above them, a cloud-agnostic Quality-as-a-Service (QaaS) control plane aggregates derived signals, manages models and policies, and orchestrates human-in-the-loop remediation using secure pointers rather than PHI replication. The architecture explicitly decouples the data plane from a quality and observability plane, enabling cross-cloud visibility while preserving data sovereignty. We evaluate the design on a synthetic, FHIR-compatible multi-cloud healthcare workload comprising 47 pipelines across four environments over 30 simulated days. Using chaos engineering, we inject realistic failure modes, including schema changes, delayed or partial loads, cross-cloud link failures, and backlogged batch processing. We compare the proposed federated approach to a representative centralized monitoring baseline along three dimensions: time-to-detection and localization of failures, monitoring-related inter-cloud data transfer costs, and resource overhead of agents and control plane. Across 216 chaos injections, the edge-centric architecture reduces median time-to-detection and localization while lowering monitoring-related egress volume by roughly 25% in this synthetic workload, with agent overhead of about 4–7% CPU and ~3% memory on batch pipelines. Given the limitations of synthetic data and simplified cost modeling, these results should be interpreted as early-stage validation of a promising architectural pattern rather than definitive benchmarks.
References
[1] J. A. Walonoski et al., “Synthea: An approach, method, and software mechanism for generating synthetic patients and the synthetic electronic health care record,” J. Am. Med. Inform. Assoc., vol. 25, no. 3, pp. 230–238, Mar. 2018.
[2] Health Level Seven International, "FHIR Release 4 (R4): Fast Healthcare Interoperability Resources," 2019. [Online]. Available: http://hl7.org/fhir/R4/
[3] C. N. Vorisek et al., “Fast Healthcare Interoperability Resources (FHIR) for interoperability in health research: A systematic review,” JMIR Med. Inform., vol. 10, no. 7, p. e35724, Jul. 2022.
[4] A. E. Lewis et al., “Electronic health record data quality assessment and tools: A systematic review,” J. Am. Med. Inform. Assoc., vol. 30, no. 10, pp. 1730–1742, Oct. 2023.
[5] F. T. Liu, K. M. Ting, and Z.-H. Zhou, “Isolation forest,” in Proc. 8th IEEE Int. Conf. Data Mining (ICDM), Pisa, Italy, 2008, pp. 413–422.
[6] P. Kairouz et al., “Advances and open problems in federated learning,” Found. Trends Mach. Learn., vol. 14, nos. 1–2, pp. 1–210, 2021.
[7] T. Li, A. K. Sahu, A. Talwalkar, and V. Smith, “Federated learning: Challenges, methods, and future directions,” IEEE Signal Process. Mag., vol. 37, no. 3, pp. 50–60, May 2020.
[8] A. A. Sinaci et al., “Privacy-preserving federated machine learning on FAIR health data: A real-world application,” Comput. Struct. Biotechnol. J., vol. 24, pp. 136–145, 2024.
[9] M. Stonebraker and U. Çetintemel, “‘One size fits all’: An idea whose time has come and gone,” in Proc. 21st Int. Conf. Data Eng. (ICDE), Tokyo, Japan, 2005, pp. 2–11.
[10] Google Cloud, "Multicloud database management: Architectures, use cases, and best practices," Google Cloud Architecture Center, 2023. Available: https://cloud.google.com/architecture
[11] Monte Carlo Data, "What is data observability? A complete guide," Monte Carlo Blog, 2022. Available: https://www.montecarlodata.com/blog
[12] B. Moses, The Rise of Data Observability: Architecting the Future of Data Trust. Sebastopol, CA, USA: O’Reilly Media, 2021.
[13] N. Forsgren, J. Humble, and G. Kim, Accelerate: The Science of Lean Software and DevOps. Portland, OR, USA: IT Revolution Press, 2018.
[14] A. Basiri et al., “Chaos engineering,” IEEE Softw., vol. 33, no. 3, pp. 35–41, May/Jun. 2016.
[15] Z. Dehghani, Data Mesh: Delivering Data-Driven Value at Scale. Sebastopol, CA, USA: O’Reilly Media, 2022.
[16] Gartner, "Top 10 data and analytics trends for 2021," Gartner, 2021. Available: https://www.gartner.com/smarterwithgartner/gartner-top-10-data-and-analytics-trends-for-2021
[17] U.S. Dept. of Health and Human Services, Office of the National Coordinator for Health IT, “Health Data, Technology, and Interoperability: Certification Program Updates, Algorithm Transparency, and Information Sharing (HTI-1) Final Rule,” Fed. Regist., vol. 89, no. 7, pp. 1192–1279, Jan. 9, 2024.
[18] N. Yaraghi, A. A. Seixas, and F. Zizi, "How ONC can strengthen its HTI-1 rule to ensure transparency, fairness, and equity in AI," Health Affairs—Forefront, Jun. 26, 2024.
[19] S. K. Y. Battula, “Adaptive data quality management for multi-cloud healthcare warehouses: FHIR-aware semantics and unsupervised thresholding,” *Int. J. Artif. Intell., Data Sci. Mach. Learn. (IJAIDSML)*, vol. 6, no. 4, pp. 218–226, Dec. 2025, doi: 10.63282/3050-9262.IJAIDSML-V6I4P130.










