Open Access Open Access  Restricted Access Subscription Access

Semantic Search in Electronic Health Records: A Case Study Using SemEHR System

Anushree Raj SR, Gajanan M Naik

Abstract


Electronic Health Records (EHRs) contain a blend of structured fields and unstructured clinical notes that together capture the complete story of a patient’s health. While structured data can be processed easily, the rich medical insights hidden in free-text notes often remain inaccessible to traditional keyword-based search methods. These limitations arise from variations in medical terminology, frequent use of abbreviations, and contextual expressions such as negation and temporality. This paper examines how semantic search, an approach that interprets the meaning and context of medical concepts can enhance information retrieval in healthcare settings. Using SemEHR, an open-source semantic search and analytics system deployed in multiple UK hospitals, as a case study, the work demonstrates how clinical notes can be transformed into ontology-linked concepts, temporally structured patient views, and clinically meaningful summaries. The evaluation of SemEHR across real hospital datasets shows substantial improvement in retrieval accuracy and efficiency compared to traditional search approaches. These findings highlight the potential of semantic search to support clinical research, patient identification, and medical decision-making by making unstructured EHR data more accessible, interpretable, and actionable.


Full Text:

PDF

References


Wu, H., Toti, G., Morley, K. I., Ibrahim, Z. M., Folarin, A., Jackson, R., Kartoglu, I., Agrawal, A., Stringer, C., Gale, D., et al.SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research. Journal of the American Medical Informatics Association, 2018.

Tamine, L., & Goeuriot, L.Semantic Information Retrieval on Medical Texts: Research Challenges, Survey, and Open Issues. ACM Computing Surveys, 2021.

Kejriwal, M., Haidarian, H., Chiu, M., Xiang, A., Shrestha, D., & Javed, F.A Semantic Search Engine for Helping Patients Find Doctors and Locations in a Large Healthcare Organization. SIGIR ’24, 2024.

Hsieh, S.-L., Chang, W.-Y., Chen, C.-H., & Weng, Y.-C.Semantic Similarity Measures in the Biomedical Domain by Leveraging a Web Search Engine. IEEE Journal of Biomedical and Health Informatics, 2013.

Bodenreider, O.The Unified Medical Language System (UMLS): Integrating Biomedical Terminology. Nucleic Acids Research, 2004.

Chapman, W. W., Bridewell, W., Hanbury, P., Cooper, G. F., & Buchanan, B. G.A Simple Algorithm for Identifying Negated Findings in Clinical Reports. Journal of Biomedical Informatics, 2001.

Aronson, A. R., & Lang, F. M.An Overview of MetaMap: Mapping Biomedical Text to the UMLS Metathesaurus. Proceedings of AMIA Symposium, 2010.

Savova, G. K., Masanz, J. J., Ogren, P. V., Zheng, J., Sohn, S., Kipper-Schuler, K. C., & Chute, C. G.Mayo Clinical Text Analysis and Knowledge Extraction System (cTAKES): Architecture and Applications. Journal of the American Medical Informatics Association, 2010.

Aronson, A. R.Effective Mapping of Biomedical Text to the UMLS Metathesaurus: The MetaMap Program. Proceedings of the AMIA Symposium, 2001.

Savova, G. K., Masanz, J. J., Ogren, P. V., et al.Mayo Clinical Text Analysis and Knowledge Extraction System (cTAKES): Architecture and Applications. Journal of the American Medical Informatics Association, 2010.


Refbacks

  • There are currently no refbacks.