The ‘Bootstrap’ Methodology: An Architecture for Incremental Integration and Normalization of Heterogeneous Healthcare Big Data in Near Real-Time Analytical SystemsJoshi Mehulkumar Citation: Joshi Mehulkumar, "The ‘Bootstrap’ Methodology: An Architecture for Incremental Integration and Normalization of Heterogeneous Healthcare Big Data in Near Real-Time Analytical Systems", Universal Library of Medical and Health Sciences, Volume 04, Issue 02. Copyright: This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. AbstractThe methodology examines the ‘Bootstrap’ methodology as a reproducible approach to the incremental integration and normalization of heterogeneous healthcare big data within analytical systems operating in near-real-time. The relevance of the study is driven by the rapid growth in the volume, arrival velocity, and structural heterogeneity of healthcare data, which render traditional full-reload METL processes operationally infeasible, impose excessive strain on infrastructure, and fail to ensure the required freshness of analytics. The purpose of the methodology is to formalize a two-phase algorithm for transitioning from full refresh to delta loading, based on capturing an initial data snapshot, subsequently extracting only changed records, and reliably merging them with the historical dataset. The scientific novelty lies in combining timestamp-based incremental loading, a preliminary standardization module for dirty medical data, and a dedicated state-tracking subsystem that ensures idempotency and recovery after failures. It is shown that applying the methodology enables reducing processing time by 75–80%, lowering computational and network load, and moving from retrospective reporting to daily or intra-day analytics. The article will be useful for data architects, ETL/METL engineers, IT executives in healthcare organizations, and researchers in digital health. Keywords: ‘Bootstrap’ Methodology, Healthcare, Big Data, Incremental Loading, Delta Integration. Download |
|---|