Open Access Peer-reviewed
Mining the data contained within Electronic Health Records (EHRs) can potentially generate a greater understanding of medication effects in the real world, complementing what we know from Randomised control trials (RCTs). We Propose a text mining approach to detect adverse events and medication episodes from the clinical text to enhance our understanding of adverse effects related to Clozapine, the most effective antipsychotic drug for the management of treatment-resistant schizophrenia, but underutilised due to concerns over its side effects.
We used data from de-identified EHRs of three mental health trusts in the UK (>50 million documents, over 500,000 patients, 2835 of which were prescribed Clozapine). We explored the prevalence of 33 adverse effects by age, gender, ethnicity, smoking status and admission type three months before and after the patients started Clozapine treatment. Where possible, we compared the prevalence of adverse effects with those reported in the Side Effects Resource (SIDER).
Sedation, fatigue, agitation, dizziness, hypersalivation, weight gain, tachycardia, headache, constipation and confusion were amongst the highest recorded Clozapine adverse effect in the three months following the start of treatment. Higher percentages of all adverse effects were found in the first month of Clozapine therapy. Using a significance level of (p< 0.05) our chi-square tests show a significant association between most of the ADRs and smoking status and hospital admission, and some in gender, ethnicity and age groups in all trusts hospitals. Later we combined the data from the three trusts hospitals to estimate the average effect of ADRs in each monthly interval. In gender and ethnicity, the results show significant association in 7 out of 33 ADRs, smoking status shows significant association in 21 out of 33 ADRs and hospital admission shows the significant association in 30 out of 33 ADRs.
A better understanding of how drugs work in the real world can complement clinical trials. Citation: Iqbal E, Govind R, Romero A, Dzahini O, Broadbent M, Stewart R, et al. (2020) The side effect profile of Clozapine in real world data of three large mental health hospitals. PLoS ONE 15(12): e0243437. https://doi.org/10.1371/journal.pone.0243437 Editor: Vincenzo De Luca, University of Toronto, CANADA Received: February 18, 2020; Accepted: November 22, 2020; Published: December 8, 2020 Copyright: © 2020 Iqbal et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Data Availability: SLAM: The ethical approval to access CRIS data requires the data to be stored behind the hospital’s firewall with access monitored by strict guidelines via a patient-led oversight and governance committee. It is due to this restriction that the data cannot be made available within the manuscript, supporting files or via a public repository. However, data access for research purposes is possible to subject to approval from the oversight committee and can be initiated by contacting the CRIS oversight committee (). Camden & Islington: The data used in this work has been obtained from the Clinical Record Interactive Search (CRIS), a system which has been implemented at the Camden & Islington NHS Foundation Trust (C&I). It provides authorised researchers with regulated access to anonymised information extracted from patient electronic health records. CRIS is governed by a strict information governance scheme which forbids anyone except for authorised researchers from accessing its records. Access to CRIS is restricted to 1) C&I employees or 2) those having an honorary contract or letter of access from the Trust. Once an honorary contract is established, researchers can only access CRIS once they submit a research project proposal through the CRIS Project Application form. The form is available here: http://www.candi.nhs.uk/health-professionals/research/ci-research-database/researchers-and-clinicians For further details, contact: . Oxford: CRIS provides a means of analysing anonymised data from the Oxford Health NHS Foundation Trust electronic case records. Ethical approval for such analyses was provided by Oxfordshire REC C National Research Ethics Service in July 2015. Access to clinical information is clearly a sensitive issue, and a security model was developed which has been considered and approved by the Oxford Health NHS Foundation Trust Caldicott Guardian and the Trust Executive, as well as forming part of the ethics application. Funding: This paper represents independent research funded by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London, and by the NIHR Oxford Health Biomedical Research Centre at Oxford University and Oxford Health NHS Foundation Trust (grant BRC-1215-20005). This research was supported by researchers at the National Institute for Health Research University College London Hospitals Biomedical Research Centre, and by awards establishing the Farr Institute of Health Informatics Research at UCL Partners, from the Medical Research Council, Arthritis Research UK, British Heart Foundation, Cancer Research UK, Chief Scientist Office, Economic and Social Research Council, Engineering and Physical Sciences Research Council, National Institute for Health Research, National Institute for Social Care and Health Research, and Wellcome Trust (grant MR/K006584/1). This study was supported by the Case Record Interactive Search (CRIS) system funded and developed by the National Institute for Health Research (NIHR) Mental Health Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London using the system within the NIHR Oxford Health Biomedical Research Centre (Oxford University and Oxford Health NHS Foundation Trust). Competing interests: The authors have declared that no competing interests exist. Adverse Drugs Reactions (ADRs) are troublesome, potentially life-threatening, and associated with more extended hospital stays. Improving our ability to detect these unwanted effects can lead to a greater understanding of individual risks and better patient management, which would help reduce the burden on healthcare providers [1–3]. Clinical trials provide the first insight into ADR profiles before a drug is available to the public. Once the drug is on the market, new ADRs and medication errors are spontaneously reported by physicians and pharmacists through a number of schemes (e.g. the YellowCard scheme used by the UK Medicine Health Regulatory Agency, MHRA, and the Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) in the US [4]). However, these spontaneous reporting systems have limitations. There can be under-reporting of observed ADRs, as well as no reporting of ADRs that are perceived as non-serious. It is possible that many novel ADRs are never entered into the system because it is challenging to establish a causal link between the drug and adverse events. The data collected in Electronic Health Records (EHRs) as part of routine care may provide additional insight into adverse events in real-world settings as well as potentially identifying new positive indications of drugs [5]. Clozapine is a atypical, also known as second-generation, antipsychotic drug. It is widely recognised as the gold standard in the treatment of schizophrenia and the most effective antipsychotic drug in the management of treatment-resistant schizophrenia [6, 7]. Nevertheless, Clozapine is an underutilised medication [8] with only 54% of all eligible patients being prescribed Clozapine in the UK [9]. One of the primary reasons for its limited use is the concern over its side effects, some of which are potentially fatal and require frequent monitoring [10]. Two common side effects associated with Clozapine are weight gain and hypoglycaemia, which together can lead to type II diabetes [11, 12]. Others include abdominal pain, agitation, akathisia, amnesia, blurred vision, confusion, constipation, convulsions, delirium, delusion, diarrhoea, dizziness, dry mouth, enuresis, fatigue, fever, headache, heartburn, hallucination, hyperkinesia, hypersalivation, hypertension, hypotension, insomnia, nausea, rash, restlessness, seizures, sleeplessness, sweating, syncope, tachycardia, tremor, vomiting and decrease in White blood cells (WBC) [13–16]. These side effects can be dose-related [17, 18]. A more acute side effect is agranulocytosis, a severe and dangerous lowered WBC count. Therefore, as soon as the patient starts taking Clozapine, blood monitoring begins to determine whether or not the patient is at risk of agranulocytosis [10, 19–21]. Other severe but rare side effects include myocarditis, neutropenia, cardiomyopathy, Creatinine Phosphokinase (CPK) increase, hepatic narcosis and Steven Johnson Syndrome (SJS) [22]. Several studies have report Clozapine-induced ADRs, but those are generally limited by the duration of the study (6 weeks to 2 years), cohort size (31 to 110 patients) and the number of the ADRs discussed (1 to 18) [23–28]. Limited work has been done to characterise a large population of patients experiencing Clozapine-induced ADRs. In addition, little work has been done to understand the relationship between age and Clozapine-induced ADRs. The literature suggests that there is a positive relationship between age and Clozapine-induced ADRs such as weight gain and cardiovascular risks, mainly myocarditis and cardiomyopathy [29, 30]. In young patients, several short-term and long-term studies have reported Clozapine-induced ADRs in different hospital settings. However, those only cover a single or a few ADRs at a time, rather than a range of ADRs [31–39]. Schizophrenia patients are more likely to be smokers as compared to patients with other psychiatric disorders [40]. They are more likely to be heavy smokers (25 or more cigarettes daily) compared with only 11% of the general population of smokers [41, 42]. The chemicals in cigarette smoke induce enzymes that accelerate the metabolism of antipsychotic drugs [43, 44], requiring a higher dosage of Clozapine in smokers compared to non-smokers. There is no study showing comprehensive profiling of Clozapine-induced ADRs in the smoker vs non-smoker patient populations. Males and Females exhibit different responses to drug treatment [45, 46]. Studies suggest that women are at a higher risk of ADRs than men, and ADR-related hospital admissions are more common in females and older patients [47, 48]. Some studies suggest that weight gain, hypertension, dyslipidemia and other metabolic synderomes are more prevalent in female Clozapine patients, whereas cardiac-related ADRs such as arrhythmia, QT prolongation are more prevalent in males [49, 50]. Although often ignored, ethnicity plays a significant role in response to psychotropic medication [51, 52]. Studies suggest Clozapine is used much less in the black population compared to any other ethnicity due to the lower normal range of WBC count [52, 53]. Clozapine is also known to cause Type II diabetes in the black and Asian population [12, 54, 55]. We developed a natural language processing (NLP) pipeline, ADEPt [56] to detect and validate mentions of adverse drug events in free-text mental health clinical notes. The ADEPt is equipped with an improved rule-based, ADE dictionary and modified ConText algorithm and distinguish from positive to negative mentions of ADEs, from current to past ADEs occurrences by applying temporal reasoning and from general to patient-specific ADEs mentions. In this paper, we combine information about adverse drug events using the ADEPt pipeline with medication episodes mined from clinical text to enhance the understanding of adverse effects related to Clozapine. We also characterise the population experiencing ADRs with Clozapine and compare the results with the Side Effects Resource (SIDER) [57], complementing what is already known about demographics, smoking status and hospital admissions, and to find out how different subgroups are most likely to experience ADRs when administered Clozapine.
We obtained data from the Clinical Record Interactive Search (CRIS) system, a de-identified version of the EHR of three large mental health National Health Services (NHS) Foundation Trusts in the UK, made available for research through a patient-led model of governance and oversight [58]: i) the South London and Maudsley (SLAM) NHS Foundation Trust is one of the largest mental health care providers in Europe serving a population over 1.4 million residents in four major hospitals London [59], ii) the Camden and Islington (C&I) NHS Foundation Trust provides mental health services in North London [60, 61] and iii) Oxford Health NHS Foundation Trust, which covers a geographic catchment area containing 1.9 million UK residents [62]. The Trust is a group or network of hospitals that work together to deliver a range of services to their catchement area. Both SLAM and C&I trusts represent large, mixed, multicultural areas of London, UK. SLAM data came from patients who received care from January 2007 to December 2016; Camden & Islington data came from patients who received care from June 2009 to December 2015, and Oxford data came from patients who received care from January 2010 to December 2014. The combined data of the three trusts comprises over 500,000 patient records and over 50 million documents. We used both the unstructured and structured data within the EHRs to identify periods of on-off Clozapine medication, and then to identify subsequent Clozapine-induced ADRs.
Using the General Architecture for Text Engineering (GATE) medication NLP application [59], a Natural Language processing system that has been repeatedly used in drug-related studies [59, 63, 64], we developed an algorithm to infer periods of ‘on-treatment’ medication episodes. We created a drug dictionary using 260 drugs commonly used in the psychiatric hospitals for Mental, Behavioural and Neurodevelopmental Disorders. The drug dictionary consists of 11 drug categories: Antidepressants, Antidiabetics, Antiepileptics, Antihypertensives, Antipsychotics, Anti-Dementia, Hypnotics & Anxiolytics, Lipid Regulatory, Mood Stabilizers, Non-Steroidal Anti-Inflammatory and Anti-Parkinson. We used the British National Formulary (BNF) and electronic Medicines Compendium (eMC) [65, 66] to generate a robust list of all available brand names. In order to increase coverage, we included brand names that have been discontinued, as these may be present in the clinical notes. The algorithm has been previously used in a dementia study to identify trajectories of cognitive decline in the SLAM EHR [67]. The algorithm works in two stages. In the first stage, it maps all brand names onto the generic names of the drugs. In the second stage, the algorithm sorts the records by date and measures the length of the interval between consecutive dates. These dates include prescription dates and positive mentions of a drug in the clinical notes, indicating that a patient is taking/continuing a particular drug. We extracted dated of positive mentions of medications using a specialised GATE NLP application [59, 63, 64]. We set the threshold for determining a medication episode between two consecutive prescribing dates to 42 days (6 weeks). Although the common practice for prescribing psychotropic drugs is 28 days (4 weeks), they can be prescribed for 42 days based on patient availability. If the gap between two consecutive dates is less than 42 days (6 weeks), the algorithm searches for the next date until it finds the date where the difference between two dates is greater than 42 days, or no data point is available. In each episode, the algorithm counts the number of data points it used to conclude an episode. Once the difference between two dates is over 42 days, the algorithm concludes the episode using the last available date, and subsequently starts a new episode. The duration threshold can be changed according to the prescribing practices associated with the medication being studied.
We used our Adverse Drug Event annotation Pipeline ADEPt to mine ADEs from the free-text data. We applied the pipeline to extract 66 ADEs on the Oxford data and 110 ADEs from the SLAM and C&I data (more recent work used a larger ADE dictionary).
We set out to uncover associations between medication episodes and adverse events by combining the medication timeline with the adverse events mined by the ADEPt pipeline. This is done in a number of steps: First, multiple discussions of an ADE on a given day are collapsed into a single event, filtering out negative mentions accordingly. The ADR algorithm then queries the medication timeline to identify drugs that the patient was taking at the time of an adverse event and then creates an ADR event.
The study used the chi-square statistical test with Bonferroni correction to quantify the significance of ADR associations in relation to gender, ethnic background, age, and smoking and hospital status. The data taken into account was based on a monthly interval after starting the drug clozapine. R programming language version 3.2.4 was used to conduct statistical analysis. Once periods of prescribing and associated possible ADRs have been identified, the prevalence of ADEs across different subpopulations within the EHR was explored. As such, age, gender, ethnicity, smoking status, hospital admissions (inpatient/outpatient) were extracted for each patient in each Trust, where data was available. Although efforts were taken to replicate the study in all three NHS Trusts, the identification and extraction of age groups, smoking status and inpatient/outpatient status in Oxford NHS Trust data was not successful. We retrieved the date of birth, gender, ethnicity, smoking status, hospital admission and diagnosis from each trust. A patient’s age was calculated using the date the patients started Clozapine and was further divided into eight distinct categories. Gender and ethnicity were derived from the latest entry recorded in each NHS trust. The ethnicity was divided into four major groups, white, black, Asians and others. Smoking status was calculated six months before and after the date that the patient started Clozapine. Hospital admission status was measured by looking into the patient admission and discharge data. If the patient started the Clozapine during their hospital admission, we classified them as an inpatient. We established six categories of diagnoses according to using ICD-10 codes. Three of these categories came from Serious Mental Illness (SMI), Schizophrenia (ICD-10: F20-F29) excluding (ICD-10: F25), Schizoaffective (ICD-10: F25) and Bipolar (ICD-10: F31). The other three categories were any mental, behavioural and neurodevelopmental disorders (ICD-10: F01-F99) excluding SMI patients, any other diagnosis and diagnosis not available. We collected diagnoses six months before and after the first prescription of Clozapine. Where we were unable to identify records of diagnosis during a year period, we increased the time span until a diagnosis was established for a patient. We compared our results to SIDER where possible, to understand the prevalence of Clozapine-related ADRs in the real-world EHR data we have collected. We identified 2835 patients who have taken Clozapine for at least three months in three large mental health trusts. Table 1 gives the cohort characteristics of the three trusts with respect to gender, ethnic background, age groups, smoking status, hospital admission and diagnosis. The ADR algorithm was implemented across the three NHS trusts and was evaluated in SLAM and Camden & Islington to assess its performance in detecting associations between drugs and ADEs; access limitations prohibited evaluation in the Oxford trust. A set of 300 cases were randomly selected from each trust, and manual validation was carried out by two annotators in SLAM and one annotator in C&I reading through the clinical notes according to the following criteria:
The algorithm achieved a 0.89 Positive Predictive Value (PPV) in SLAM and a 0.87 PPV in C&I. The False Discovery Rate (FDR) was 0.1 in SLAM and 0.12 in C&I. In SLAM, the level of agreement between the two annotators for ADEs 93% and kappa (κ = 0.56) and for drugs 94% and kappa (κ = 0.56) was achieved. SIDER contains side effects information from RCTs and systems such as the FAERS [4]. As of January 2018, SIDER contained 23 different sources on Clozapine adverse effects such as post-marketing, FDA, labels, Medsafe and Health Canada. Although SIDER collates data from a number of sources, much of its data is from RCTs, which are typically run on small and narrow populations, with little information on how medications work in real-world settings. The study results were compared with SIDER where possible; to understand the prevalence of Clozapine-induced ADRs in the real-world EHR data. The comparison results with SIDER are summarised in Table 2 for 33 ADRs. The results show the percentages of patients by ADR in each trust (SLAM, Camden & Islington and Oxford) and SIDER. The results are stratified into monthly intervals from the initiation of Clozapine treatment, three months prospective and three months retrospective. The columns (Three Months Early, Two Months Early, One Month Early, One Month Later, Two Months Later, and Three Months Later) show the percentages in each monthly interval. The last two columns (SIDER Low End and SIDER High End) show the SIDER reporting from different clinical trials and FDA studies. The complete results stratified into monthly intervals are provided in the (S1 Table) for gender, ethnicity, age groups, smoking status and hospital admissions. The statistical analysis was first carried out separately for each of the trust. To prevent influence of cofounder such as age, gender, ethnicity, smoking status and hospital admission were performed individually. The results are available in (S2 Table). The p-value of the test is adjusted through Bonferroni correction. Gender and ethnicity showed no significant association with any ADRs. Age group showed significant association with agitation, fatigue, feeling sick, sedation and tachycardia. In hospital admission, of the 33 ADRs, 15 were found to have a significant association (abdominal pain, agitation, confusion, dizziness, diarrhoea, fatigue, headache, hypersalivation, hypotension, hypertension, insomnia, sedation, tachycardia, tremor and vomiting). Smoking status showed that of the 33 ADRs, 8 were found to have a significant association (abdominal pain, agitation, confusion, dizziness, diarrhoea, fatigue, sedation and tachycardia). The datasets from the three Trusts were later combined, and the chi-square statistical test was performed to estimate the average effect of ADRs in each monthly interval. The combined analysis showed a significant frequency distribution after Bonferroni p-value adjustment in the categorical variables (gender, ethnicity, age group, hospital admissions and smoking status) and a number of the ADRs as follows:
The study presents a medication continuity timeline (start and stop dates) for patients under Clozapine treatment to obtained detailed insight of Clozapine-induced ADRs using data from three large UK-based mental health Trusts comprising over 50 million documents and over half a million patients. The study uses the ADEPt NLP pipeline [56] to extract ADEs from free-text psychiatric EHRs, as well as a set of algorithms for creating a medication continuity timeline. The timeline was used to investigate associations between medications and ADEs, characterising ADR susceptibility with respect to patient demographics, hospital admission and smoking status. Furthermore, the medication algorithm was used to assert the start date and measure the length of drug therapies. We used Clozapine as a use case, but the current work can be replicated on any psychotropic drug. The algorithm found 2835 patients in the three trusts that started and continued Clozapine treatment for at least three months. The work presented in this paper has been run inclusively on all available ADEs in the ADEPt pipeline. We selected 33 ADRs to explore further as these were known side effects associated with Clozapine. We compared our results with SIDER, an online side effect resource. SIDER only reports on 25 out of the 33 ADRs. The 8 ADRs not reported in SIDER are fatigue, feeling sick, headache, hyperprolactinemia, weight loss, shaking, rash and stomach pain. These 8 ADRs show interesting results across the three trusts, as shown in Table 2. The prevalence of ADRs was assessed over gender, ethnic background, age groups, smoking status and hospital admission. We stratified the data using demographics information, mainly gender, ethnic background, age groups, hospital admission and smoking status. Sedation, fatigue, agitation, hypersalivation, tachycardia, constipation, dizziness and weight gain are the most common [68–72] and most highly-recorded ADRs in all three NHS trusts. When comparing hospital admission data (inpatient vs outpatient), our results show that the inpatient group have a higher recording of any ADRs when the patient starts Clozapine therapy compared to the outpatient group. This is due to the inpatient group being more frequently monitored and recorded by clinicians. A similar pattern is observed when comparing smoking status (smoker vs non-smoker). Rare ADRs such as agranulocytosis, myocarditis, SJS, cardiomyopathy and pericarditis are reported in the analysis. However, rare ADRs require much attention before they can be declared as positive. Currently, we are manually validating our results on agranulocytosis, myocarditis and neutropenia in parallel studies by going through the clinical notes to improve our assertions. We also acknowledge that the results still may be specific to the UK in some way. SLAM and C&I trusts are based in London, the population are from very mixed backgrounds, and ethnicities and Oxford trust population are more homogenous. Therefore we believe the results are specific to the UK but may not cover all UK specific concerns. This study was first conducted and evaluated using SLAM’s psychiatric clinical notes. In SLAM, evaluation of each step was carried out manually by at least two annotators, and an inter-annotator agreement was achieved where possible. Due to limited resources, several challenges were faced when implementing the work in C&I and Oxford Trusts. The ADEPt pipeline and ADR timeline were not manually validated in Oxford Trust and the results presented in the C&I Trust are based on a single annotator. Although the proposed work has shown good results, lower precision and recall were found in rare ADEs as they are frequently recorded as warnings, potential and suspected occurrences.
Ongoing work focuses on extending the analyses to longer periods of time, other drugs such as Lurasidone, drug-drug interaction of Clozapine, adding more patient-related features such as Body Mass Index (BMI), education, mobility, employment status, welfare benefits, homelessness, blood results and alcohol use. We are also planning to apply for this work on other drugs. Current work is underway on the antipsychotic Lurasidone and drug-drug interaction of Clozapine. ADR extraction from the free-text clinical documents can help clinicians and researchers to predict risks and interventions of drug administration. Often under-utilised due to the fatal nature of ADR, Clozapine is the most effective drug for treatment-resistant schizophrenia. A number of limited studies report Clozapine-induced ADRs. One of its kind in terms of cohort size and the variety of ADRs, this study characterises and provides insight into Clozapine-induced ADRs on a large population of patients across three large mental health hospitals. As well as providing novel findings, the proposed method demonstrates the utility of wider ADR extraction beyond Clozapine, and the study can be replicated using any psychotropic drug. In the future, this work will be expanded to define extended periods of on-treatment episodes, other drugs and more patient-related features in the statistical analysis.
Disclaimer: The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.
|