When did the patient arrive, when was a CT scan performed, when was the first puncture, when could the blood flow be restored... During mechanical thrombectomy, a range of data must be recorded in the patient report and then manually transferred to various registers for the clinical outcome and for prospective studies. "This is a labour-intensive task that is also prone to transcription errors," says Dr Nils Lehnen, who also conducts research at the University of Bonn. "We therefore asked ourselves whether an AI such as ChatGPT could perform this transfer faster and possibly even more reliably."
In radiology, ChatGPT is already being tested in various procedures - for example, in the simplification of reports or in answering patient questionson breast cancer screening. However, whether ChatGPT can correctly extract data from free-text reports of a mechanical thrombectomy for a database and simultaneously generate clinical data was previously unexplored and was the research objective of this new study.
Dr Lehnen's research group first created a German prompt for ChatGPT and tested it on 20 reports in order to identify errors and subsequently adapt the prompt. After the correction, the data extraction using ChatGPT was tested on 100 internal reports from the UKB. For optimal comparison, an experienced neuroradiologist also compiled the results without seeing the ChatGPT evaluation. The researchers then compared the results and found that ChatGPT had correctly extracted 94 per cent of data entries and no post-processing was required. The researchers only considered the ChatGPT data entries that exactly matched that of the expert to be correct. Any deviations, such as additional symbols, punctuation marks or synonyms, were categorised as incorrect.
To validate these results, the researchers tested a further 30 external reports with the same prompt. ChatGPT achieved 90 per cent correct data entries.
"This suggests that ChatGPT could be an alternative to manually retrieving this data," says Dr Lehnen. "However, the reports and the prompt were only created by us in German, so the results of our study may need to be confirmed for other languages. In addition, we still observed poor results for certain data points, which shows that human supervision is still needed. However, we expect that further optimisation of the prompt will further improve the results and that ChatGPT can make work easier in this area in the future."
Lehnen NC, Dorn F, Wiest IC, Zimmermann H, Radbruch A, Kather JN, Paech D.
Data Extraction from Free-Text Reports on Mechanical Thrombectomy in Acute Ischemic Stroke Using ChatGPT: A Retrospective Analysis.
Radiology. 2024 Apr;311(1):e232741. doi: 10.1148/radiol.232741