Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Dec 28, 2022
Open Peer Review Period: Dec 28, 2022 - Feb 22, 2023
Date Accepted: Aug 3, 2023
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Developing a Semantically Based Query Recommendation for an Electronic Medical Record Search Engine: Query Log Analysis and Design Implications

Wu DT, Hanauer D, Murdock P, Vydiswaran VV, Mei Q, Zheng K

Developing a Semantically Based Query Recommendation for an Electronic Medical Record Search Engine: Query Log Analysis and Design Implications

JMIR Form Res 2023;7:e45376

DOI: 10.2196/45376

PMID: 37713239

PMCID: 10541636

Developing a Semantically-based Query Recommendation (SBQR) for an Electronic Medical Record Search Engine (EMERSE): Query Log Analysis and Design Implications

  • Danny T.Y. Wu; 
  • David Hanauer; 
  • Paul Murdock; 
  • VG Vinod Vydiswaran; 
  • Qiaozhu Mei; 
  • Kai Zheng

ABSTRACT

Background:

An effective and scalable information retrieval (IR) system can help clinicians and researchers utilize free-text information in electronic health records. In our previous study, a prototype medical IR system, with a semantically-based query recommendation (SBQR) feature was developed and empirically evaluated, demonstrating high perceived system performance by end users. This follow-up study explored potential factors contributing to the perceived performance through query log analysis.

Objective:

A common challenge of IR is that users may have limited knowledge of what they are searching for. An IR system needs to be carefully designed, especially its user interface, to help users work through this “berry-picking” process where user information needs and queries change when relevant documents are found in the search process. Our solution to addressing these issues is to use “query recommendation”, which has been an integral component in modern IR systems such as Google and Microsoft Bing.

Methods:

The query log data analyzed in the present paper was collected from our previous experimental study where an electronic medical record search engine (EMERSE) with the SBQR feature was developed. A logging mechanism was developed to keep track of the user query behaviors and the IR system output (retrieved documents). In this study, the initial query that a user entered was compared with the query achieved with the assistance of the SBQR. The results of this comparison can determine whether the use of the SBQR helped in constructing improved queries that were different from the original query.

Results:

The results show that the first SBQR-off query and last SBQR-on query were 77% similar, suggesting that the perceived positive performance of the system was more likely due to the automatic query expansion by the SBQR, rather than the manual manipulation of the queries by the users. Moreover, the Entropy analysis showed that search results converged in the scenarios with medium difficulty, and the level of convergence was highly correlated with the perceived performance.

Conclusions:

These findings suggest that the SBQR can contribute to participants’ positive perceptions of performance, contingent upon the scenario difficulty. Medical IR systems should consider designing an SBQR as a user-controlled option or semi-automated feature. Future work involves redesigning the experiment in a more controlled manner and conduct multi-site studies to demonstrate the effectiveness of EMERSE with SBQR for patient cohort identification.


 Citation

Please cite as:

Wu DT, Hanauer D, Murdock P, Vydiswaran VV, Mei Q, Zheng K

Developing a Semantically Based Query Recommendation for an Electronic Medical Record Search Engine: Query Log Analysis and Design Implications

JMIR Form Res 2023;7:e45376

DOI: 10.2196/45376

PMID: 37713239

PMCID: 10541636

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.