Automatic Adverse Drug Reaction Extraction from Electronic Health Records

Patients' electronic health records convey crucial information. The application of natural language processing techniques to these records may be an effective means of extracting information that may improve clinical decision making, clinical documentation and billing, disease prediction and the detection of adverse drug reactions. Adverse drug reactions are a major health problem, resulting in hospital re-admissions and even the death of thousands of patients. An automatic detection system can highlight said reactions in a document, summarise them and automatically report them.

In this context, the Basurto University Hospital and the Galdakao Hospital 'were interested in creating a system that would use natural language processing techniques to analyse patient health records in order to automatically identify any adverse effects' explains the engineer Sara Santiso, who also holds a PhD in Computer Science. After the hospitals contacted the IXA group at the UPV/EHU, several researchers started working to build a robust model with which to extract adverse drug reactions from electronic health records written in Spanish, based on clinical text mining.

To this end 'not only have we used techniques based on traditional machine learning algorithms, we have also explored deep learning techniques, reaching the conclusion that these are better able to detect adverse reactions' explains Santiso, one of the authors of the study. Machine learning and deep learning imitate the way the human brain learns, although they use different types of algorithms to do so.

Difficulties finding a corpus in Spanish

Santiso underscores the difficulties the team encountered when trying to find a large enough corpus with which to work: 'At first, we started with only a few health records, because they are difficult to obtain due to privacy issues; you have to sign confidentiality agreements in order to work with them' she explains. The research team has found that 'having a larger corpus helps the system learn the examples contained in it more effectively, thereby giving rise to better results'.

Through this study, which was carried out with health records written in Spanish, 'we are contributing to closing the gap between clinical text mining in English and that carried out in other languages, which accounts for less than 5% of all papers published in the field. Indeed, the extraction of clinical information is not yet fully developed due (among other things) to the potential for extracting information from other hospitals and in other languages' claims the researcher.

Although natural language processing has been of inestimable help in the computer-aided detection of adverse drug reactions, there is still room for improvement: 'To date, systems have tended to focus on detecting drug-disease pairs located in the same sentence.

However, health records contain implicit information that might reveal underlying relations (for example, information about antecedents might be relevant for determining the causes of an adverse event). In other words, future research should strive to detect both explicitly and implicitly-stated inter-sentence relationships'. Moreover, another issue that should be the subject of future research is the lack of electronic health records written in Spanish.

Sara Santiso, Alicia Pérez, Arantza Casillas.
Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants.
Computer Methods and Programs in Biomedicine, Volume 199, 2021. doi: 10.1016/j.cmpb.2020.105891

Most Popular Now

Research Shows AI Technology Improves Pa…

Existing research indicates that the accuracy of a Parkinson's disease diagnosis hovers between 55% and 78% in the first five years of assessment. That's partly because Parkinson's sibling movement disorders...

Who's to Blame When AI Makes a Medi…

Assistive artificial intelligence technologies hold significant promise for transforming health care by aiding physicians in diagnosing, managing, and treating patients. However, the current trend of assistive AI implementation could actually...

First Therapy Chatbot Trial Shows AI can…

Dartmouth researchers conducted the first clinical trial of a therapy chatbot powered by generative AI and found that the software resulted in significant improvements in participants' symptoms, according to results...

DMEA sparks: The Future of Digital Healt…

8 - 10 April 2025, Berlin, Germany. Digitalization is considered one of the key strategies for addressing the shortage of skilled workers - but the digital health sector also needs qualified...

DeepSeek: The "Watson" to Doct…

DeepSeek is an artificial intelligence (AI) platform built on deep learning and natural language processing (NLP) technologies. Its core products include the DeepSeek-R1 and DeepSeek-V3 models. Leveraging an efficient Mixture...

Stepping Hill Hospital Announced as SPAR…

Stepping Hill Hospital, part of Stockport NHS Foundation Trust, has replaced its bedside units with state-of-the art devices running a full range of information, engagement, communications and productivity apps, to...

DMEA 2025: Digital Health Worldwide in B…

8 - 10 April 2025, Berlin, Germany. From the AI Act, to the potential of the European Health Data Space, to the power of patient data in Scandinavia - DMEA 2025...