AI-driven and Non-AI Methods for Electronic Health Records Duplication Remediation for Healthcare Organizations

Authors

  • Aleksandr Borodich Vice President, Head of Business Applications Management, Citibank Moscow, Russia

Keywords:

Electronic Health Records (EHR), Patient Data Management, Record Deduplication, Machine Learning (ML), Data Governance, Health Informatics, AI-Driven Solutions

Abstract

Duplicate Electronic Health Records (EHRs) represent a critical challenge for healthcare organizations, leading to incomplete patient data, potential medical errors, increased operational costs, and compromised quality of care. Traditional methods such as deterministic and probabilistic matching, combined with Enterprise Master Patient Index (EMPI) systems and robust data governance, have long been the cornerstone in tackling duplicates. However, these approaches face limitations when handling large-scale, heterogeneous patient data. In response, AI-driven techniques—particularly Machine Learning (ML)—have emerged as powerful alternatives, enhancing record linkage accuracy, automation, and adaptive capabilities. This article provides an in-depth review of current non-AI (deterministic and probabilistic) and AI-based deduplication strategies, including advanced ML algorithms, biometric patient identification, and real-time re-checking services. We analyze case studies from leading healthcare systems, demonstrating a reduction of duplicate rates from over 20% to under 2%. Additionally, the paper explores key management and organizational factors for successful adoption of deduplication solutions, emphasizing the need for adequate training, policy development, and continuous monitoring. Concluding with practical recommendations and future directions, this research serves as a comprehensive resource for healthcare IT and data management executives aiming to ensure high-quality patient records, strengthen compliance, and support value-based care initiatives.

References

. Bess, O. (2024, Jan 12). The problem with duplicate and mismatched patient records. Physicians Practice. Retrieved from https://www.physicianspractice.com/view/the-problem-with-duplicate-and-mismatched-patient-records

. Church, G. (2023, Nov 5). The deadly cost of duplicate patient records. Chief Healthcare Executive. Retrieved from https://www.chiefhealthcareexecutive.com/view/the-deadly-cost-of-duplicate-patient-records-viewpoint

. Verato. (2021, Feb 10). Three hidden costs of duplicate records. [Blog post]. Retrieved from https://verato.com/blog/three-hidden-costs-of-duplicate-records/#:~:text=staff%20is%2C%20your%20EHR%20or,12%25%20just%20ten%20years%20ago.%5B2

. Church, G. (2023, Apr 23). Retrieved from Duplicate records hurt hospital finances as well as patients. https://www.chiefhealthcareexecutive.com/view/duplicate-records-hurt-hospital-finances-as-well-as-patients-gregg-church

. Black Book Market Research. (2021, Aug 27). Improving the Patient Identification Process and Interoperability to decrease Patient Record Error Rates. [Blog post]. Retrieved from https://www.blackbookmarketresearch.com/blog/improving-the-patient-identification-process-and-interoperability-to-decrease-patient-record-error-rates#:~:text=A%20continued%20evaluation%20of%20Black,lost%20revenue%20and%20increased%20costs

. Nelson, W., Khanna, N., Ibrahim, M., et al. (2023). Optimizing Patient Record Linkage in a Master Patient Index Using Machine Learning: Algorithm Development and Validation. JMIR Formative Research, 7, e44331. DOI: 10.2196/44331.

. Nagels, J., Wu, S., & Gorokhova, V. (2019). Deterministic vs. probabilistic: best practices for patient matching based on a comparison of two implementations. Journal of Digital Imaging, 32(6), 919–924. DOI: 10.1007/s10278-019-00223-x.

. Redfield, C., Tlimat, A., Halpern, Y., et al. (2020). Derivation and validation of a machine learning record linkage algorithm between emergency medical services and the emergency department. Journal of the American Medical Informatics Association, 27(1), 147–153. DOI: 10.1093/jamia/ocz176.

. Ross, M. K., Sanz, J., Tep, B., et al. (2020). Accuracy of an electronic health record patient linkage module evaluated between neighboring academic health care centers. Applied Clinical Informatics, 11(5), 725–732. DOI: 10.1055/s-0040-1718374.

. Mandy Roth (2018, June 07). Northwell on Mission to Annihilate Duplicate Patient Records. HealthLeaders Media. Retrieved from https://www.healthleadersmedia.com/innovation/northwell-mission-annihilate-duplicate-patient-records#:~:text=The%20pilot%20project%20assessed%20the,300%20mismatched%20records%20each%20day

Downloads

Published

2025-05-05

How to Cite

Aleksandr Borodich. (2025). AI-driven and Non-AI Methods for Electronic Health Records Duplication Remediation for Healthcare Organizations. American Scientific Research Journal for Engineering, Technology, and Sciences, 101(1), 358–370. Retrieved from https://asrjetsjournal.org/index.php/American_Scientific_Journal/article/view/11572

Issue

Section

Articles