Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 30, 2022
Date Accepted: Jul 1, 2022

The final, peer-reviewed published version of this preprint can be found here:

Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals

Noor K, Roguski L, Bai X, Handy A, Klapaukh R, Folarin A, Romao L, Matteson J, Lea N, Zhu L, Asselbergs FW, Wong WK, Shah A, Dobson RJ

Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals

JMIR Med Inform 2022;10(8):e38122

DOI: 10.2196/38122

PMID: 36001371

PMCID: 9453582

Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals

  • Kawsar Noor; 
  • Lukasz Roguski; 
  • Xi Bai; 
  • Alex Handy; 
  • Roman Klapaukh; 
  • Amos Folarin; 
  • Luis Romao; 
  • Joshua Matteson; 
  • Nathan Lea; 
  • Leilei Zhu; 
  • Folkert W Asselbergs; 
  • Wai Keong Wong; 
  • Anoop Shah; 
  • Richard JB Dobson

ABSTRACT

Background:

As more healthcare organizations transition to using electronic health record (EHR) systems it is important for these organizations to maximize the secondary use of their data to support service improvement and clinical research. These organizations will find it challenging to have systems capable of harnessing the unstructured data fields in the record (clinical notes, letters etc) and more practically have such systems interact with all of the hospitals data systems (legacy and current).

Objective:

We describe the deployment of the EHR interfacing information extraction and retrieval platform Cogstack at University College London Hospital (UCLH).

Methods:

At UCLH we have deployed the CogStack platform; an information retrieval platform with natural language processing capabilities. The platform addresses the problem of data ingestion and harmonization from multiple data sources using Apache Nifi module for managing complex data flows. The platform also facilitates the extraction of structured data from the free text records through use of the MedCAT natural language processing library. Finally data science tools are made available to support data scientists and /or development of downstream applications dependent upon Cogstack ingested and analyzed data.

Results:

The platform has been deployed at the hospital and in particular how it has facilitated a number of research and service evaluation projects. To date we have processed over 30 million records and the insights produced from CogStack have informed a number of clinical research use cases at the hospitals.

Conclusions:

The CogStack platform can be configured to handle the data ingestion and harmonization challenges faced by a hospital. More importantly the platform enables the hospital to unlock important clinical information from the unstructured portion of the record using NLP technology.


 Citation

Please cite as:

Noor K, Roguski L, Bai X, Handy A, Klapaukh R, Folarin A, Romao L, Matteson J, Lea N, Zhu L, Asselbergs FW, Wong WK, Shah A, Dobson RJ

Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals

JMIR Med Inform 2022;10(8):e38122

DOI: 10.2196/38122

PMID: 36001371

PMCID: 9453582

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.