Relationship between the sentiment of nursing notes and one-year mortality of elderly sepsis patients
Li Sheng, Qin Zheng

TL;DR
This study found that the sentiment in nursing notes is linked to the one-year survival of elderly sepsis patients.
Contribution
It introduces sentiment analysis of nursing notes as a novel predictor for sepsis patient mortality.
Findings
Lower sentiment polarity and higher subjectivity in nursing notes correlate with higher one-year mortality.
Sentiment polarity has a linear relationship with mortality, while subjectivity shows a nonlinear pattern.
Sentiment features are independently associated with one-year mortality in elderly sepsis patients.
Abstract
This study aimed to explore the correlation between the sentiment of nursing notes and the one-year mortality of sepsis patients. The box plot was used to compare the differences in sentiment polarity/sentiment subjectivity between different groups. Multivariate logistic regression was used to explore the correlation between sentiment polarity/sentiment subjectivity and one-year mortality of elderly sepsis patients. Ridge regression, XGBoost regression, and random forest were used to explore the importance of sentiment polarity and subjectivity in the one-year mortality of elderly sepsis patients. Restricted cubic spline (RCS) was used to explore whether there was a linear relationship between sentiment polarity, sentiment subjectivity and the one-year mortality of elderly sepsis patients. Kaplan-Meier (KM) curve was used to explore the relationship between the sentiment polarity (or…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig 1
Fig 2
Fig 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychology of Moral and Emotional Judgment · Psychosomatic Disorders and Their Treatments · Optimism, Hope, and Well-being
1 Introduction
Sepsis is a serious life-threatening disease, which is characterized by the imbalance of immune response caused by infection and life-threatening multiple organ dysfunction. At present, about 48.9 million people in the world suffer from sepsis, and 11.1 million people die of sepsis [1]. Sepsis cases mainly occur in low-income or middle-income countries. The prevalence rates of severe sepsis in surgical Intensive Care Unit (ICU) and general ICU in China were 8.7% and 37.3% respectively [2]. The prevalence of sepsis in the ICU in India was 33.2% [3]. The overall prevalence of sepsis in ICU is 24.5% in low-middle-income areas in Asia [4]. The overall prognosis of sepsis patients is still poor despite the improvement of medical technology. A systematic review indicated that the mortality rate of sepsis patients in China was 29.0% [5] The in-hospital mortality rate of sepsis in the American population is 20.52% [6]. Therefore, it is necessary and urgent to actively explore the factors related to the prognosis of sepsis.
Sentiment analysis is a process of analyzing, processing, inducing, and reasoning subjective texts with sentiment color, which is mainly based on text data and is the main content of natural language processing. The purpose of sentiment analysis is to understand the author’s critical attitude (support or opposition to likes or dislikes, etc.) or sentiment state (happiness, anger, sadness, fear, etc.) towards an entity (including products, services, people, organizations, events, and topics) in the text. In recent years, sentiment analysis has been gradually extended to the medical field. It is found that keyword extraction and sentiment analysis based on discharge reports have certain predictive performance for 30-day unplanned readmission [7]. Sentiment in nursing notes was correlated with 30-day mortality and survival rate of ICU patients [8].
At present, there are few studies on sepsis and the sentiment of nursing notes. The existing studies mainly focus on the short-term prognosis of sepsis patients, but no research has focused on the long-term prognosis. Therefore, this study explored the correlation between the sentiment of nursing notes and the one-year mortality of sepsis patients.
2 Methods
2.1 Data source and indicators
The patient data came from the Medical Information Mart for Intensive Care (MIMIC)-IV database, which was an open database containing the data of all ICU patients in internal medicine and surgery at Beth Israel Deaconess Medical Center of Harvard Medical School [9].
2.2 Inclusion and exclusion criteria
Inclusion criteria: diagnosed with sepsis, older than 60 years of age.
Exclusion criteria: (1) Missing time of death after discharge from hospital, hospital death. (2) Miss clinical records. (3) Note writing fuzzy cannot be identified; (4) Have been identified as wrong notes by the doctor; (5) Notes less than 12 hours before death.
2.3 Sentiment analysis
Python and Text Blob were used to extract sentiment polarity and sentiment subjectivity from nursing records [8]. Given a string of texts, the Pattern module of TextBlob was marked, retrieved, and calculated. Specifically, the module includes a dictionary of English adverbs and adjectives, which can be mapped to the polarity and subjectivity of sentiment scores [10].The TextBlob returns two scores: one indicating the sentiment polarity and the other indicating the sentiment subjectivity. The sentiment polarity score ranges from -1–1, with higher scores indicating a more positive sentiment. The sentiment subjectivity score ranges from 0 to 1, with higher scores indicating a more subjective sentiment. This study calculated the sentiment score by creating a TextBlob object initialized using a nursing record string and extracting the ‘ sentiment ‘ attribute from the object. Each individual nursing note of the patient was calculated to obtain an emotional polarity score and an emotional subjectivity score. Then the average score of each patient’s polarity and subjectivity was calculated as the patient’s two nursing emotional characteristic indicators.
2.4 Indicators
In this study, the primary outcome was one-year mortality.
Indicators with more than 30 missing data were excluded. Indicators included age, glucose, sentiment polarity, sentiment subjectivity, systolic blood pressure (SBP), diastolic blood pressure (DBP), platelets, anion gap, chloride, creatinine, red cell distribution width (RDW), white cell count (WBC), red blood cell count (RBC), Charlson Comorbidity Index (CCI), Glasgow coma scale (GCS), Sequential Organ Failure Assessment (SOFA), Simplified Acute Physiology Score II (SAPS II), Oxford acute severity of illness score (OASIS), systemic inflammatory response syndrome (SIRS), LOS (Length of stay), Length of stay in ICU, gender (male, female), tobacco (yes, no), alcohol (yes, no), surgery (yes, no), diabetes (yes, no), hypertension (yes, no), renal disease (yes, no), coronary artery disease (yes, no), mechanical ventilation (yes, no), sepsis shock (yes, no), acute kidney injury (AKI) stage (0,1,2,3).
One-year mortality: the patient dies within one year after discharge from the hospital due to illness or other reasons.
2.5 Statistical analysis
R language was used for data analysis. The median (P_25_-P_75_) was used to describe the quantitative data of non-normal distribution, and the Mann-Whitney U test was used to compare between groups. Qualitative data were described by composition comparison, and the chi-square test was used to compare between groups. The box plot was used to compare the differences in sentiment polarity/sentiment subjectivity between different groups. Multivariate logistic regression was used to explore the independent correlation between sentiment polarity/sentiment subjectivity and one-year mortality of elderly sepsis patients after controlling for confounding variables, with a significant difference between the two groups. Ridge regression, XGBoost regression, and random forest were used to explore the importance of sentiment polarity and subjectivity in the one-year mortality of elderly sepsis patients. Restricted cubic spline (RCS) was used to explore whether there was a linear relationship between sentiment polarity, sentiment subjectivity and the one-year mortality of elderly sepsis patients. Kaplan-Meier (KM) curve was used to explore the relationship between the sentiment polarity (or sentiment subjectivity) and the 1-year death of the patient. P < 0.05 is considered to be statistically significant.
3 Results
3.1 Patient baseline information
A total of 5305 patients with sepsis were enrolled in this study, of which 3632 patients died within one year. As can be seen from Table 1, there were differences in age, sentiment polarity, sentiment subjectivity, RDW, WBC, RBC, anion gap, chloride, CCI, GCS, SAPS II, OASIS, LOS, Length of stay in ICU, tobacco, alcohol, coronary artery disease, diabetes, hypertension, separate shock and AKI stage.
Table 1: Patient baseline data.
As shown in Fig 1A-1B, compared with the control group, the one-year mortality group year had lower sentiment polarity and higher sentiment subjectivity. However, there was no difference in sentiment polarity between men and women (P = 0.871, Fig 1C). The same result was found in sentiment subjectivity (P = 0.141, Fig 1D).
The box plot of sentiment polarity/sentiment subjectivity between different groups, (A) sentiment polarity between control and death group, (B) sentiment subjectivity between control and death group, (C) sentiment polarity between male and female group, (D) sentiment subjectivity between male and female group.
3.2 Correlation between sentiment polarity/sentiment subjectivity and one-year mortality of elderly sepsis patients
We used multivariate logistic regression to further explore whether sentiment polarity and sentiment subjectivity were independently related to the one-year mortality of elderly sepsis patients. The results of multivariate logistic regression (Table 2) showed that sentiment polarity and sentiment subjectivity were independently related to one-year mortality of elderly sepsis patients after adjusting for confounding variables, in which sentiment polarity was a protective factor and sentiment subjectivity was a risk factor.
Table 2: Correlation between sentiment polarity/sentiment subjectivity and one-year mortality of elderly sepsis patients.
Three machine learning models explored the independent factors of one-year mortality of elderly sepsis patients, and the results (Table 3) suggested that sentiment polarity and sentiment subjectivity were major factors.
Table 3: The importance of sentiment polarity and subjectivity in the one-year mortality of elderly sepsis patients.
We further explored whether there was a linear relationship between sentiment polarity, sentiment subjectivity and the one-year mortality of elderly sepsis patients. Fig 2A-2B showed that there was a linear relationship between sentiment polarity and the one-year mortality of elderly sepsis patients. At the same time, there was a nonlinear relationship between sentiment subjectivity and the one-year mortality of elderly sepsis patients. KM results showed that high subjectivity was beneficial to the 1-year survival of patients (Fig 3A, HR = 0.70 95% CI [0.66,0.75] p < 0.05), and low polarity was advantageous to the 1-year survival of patients (Fig 3B, HR = 0.89,95% CI [0.84, 0.96])(The high and low groups were divided according to the median.).
The linear relationship between sentiment polarity, sentiment subjectivity and the one-year mortality of elderly sepsis patients, (A) sentiment polarity, (B) sentiment subjectivity.
The KM of sentiment polarity and sentiment subjectivity, (A) sentiment subjectivity, (B) sentiment polarity.
4 Discussion
4.1 Nursing sentiment and prognosis
This study was based on the MIMIC-IV database to explore the correlation between the sentiment of nursing notes and the one-year mortality of sepsis patients. We found that sentiment polarity and sentiment subjectivity were independently related to one-year mortality of elderly sepsis patients.
Emotional scores in nursing notes reflect the health care provider’s attitude toward the patient. This emotion or attitude generally arises when the health care provider communicates with the patient. Therefore, the emotional attitude of the health care provider not only reflects the physician’s assessment of the patient’s current physical status, but is also based to some extent on feedback about the efficacy of the patient’s treatment. This feedback largely influences the patient’s adherence to subsequent treatment [11,12]. Specifically, the positive mood of the healthcare provider towards the patient indicates that the patient’s treatment is working well. After receiving such feedback, the patient’s trust in the treatment is increased, which makes them more willing to follow the medical advice and cooperate with the treatment. A more positive attitude and compliance with treatment lead to better patient outcomes. This results in a positive cycle that accelerates patient healing. For example, a cohort study of diabetic patients showed that a highly empathetic physician’s positive attitude toward the patient significantly increased the patient’s adherence to treatment, which in turn increased the rate of glycohemoglobin control [13]. In addition, healthcare professionals’ attitudes affect patients’ psychological state and physiological responses. Positive emotions reduce patients’ anxiety and depression levels. A randomized controlled trial showed that nurses trained in communication built better trust with patients, which led to lower postoperative anxiety and pain [14]. Chronic stress due to illness activates the hypothalamic-pituitary-adrenal (HPA) axis, leading to elevated cortisol and suppressed immune function [15]. Physicians’ emotions toward patients may affect the patients’ HPA axis, influencing their levels of inflammatory markers and thus their prognosis [16,17].
In this study, we found that sentiment polarity and sentiment subjectivity were independently related to one-year mortality of elderly sepsis patients. Sentiment polarity was negatively related to one-year mortality of elderly sepsis patients and sentiment subjectivity was positively related to one-year mortality of elderly sepsis patients. A study based on the MIMIC-III database found that sentiment polarity and sentiment subjectivity were independently associated with the 28-day hospitalization mortality of sepsis patients, and both sentiment polarity and sentiment subjectivity were negatively related to the 28-day hospitalization mortality of sepsis patients [18]. A similar relationship was also found in another study that explored the emotion of nursing notes and the 30-day death of sepsis patients [19]. Our results were inconsistent with those of other people’s research, which may be mainly related to the inconsistency of the selected research objects and research nodes. Specifically, the dependent variables in Gao et al. ‘s [18] and Zhou et al. ‘s [19] studies were 28-day and 30-day in-hospital deaths, respectively, whereas the dependent variable in our study was 1-year mortality. Different criteria for exclusion, which in turn lead to heterogeneity in the study population. Their studies included 1851 and 1844 patients, respectively, whereas our study included 5305 patients. The sentiment of medical staff’s notes not only affected the prognosis of sepsis patients but also affected the prognosis of other diseases. McCoy and others conducted sentiment analysis on the discharge records of psychiatric inpatients and patients in general medical units from 2011 to 2014, and found that more positive sentiments were related to patients’ lower risk of readmission [20]. In another study of severe acute renal failure, it was found that compared with the survival group, the sentiment subjectivity score of the death group was higher, the sentiment polarity score was lower, and the prediction model with sentiment score had better prediction performance [21]. Another study found similar results [8]. Studies on the correlation between sentiment polarity and prognosis have found a negative correlation between emotional polarity and prognosis, indicating that the more positive the sentiment of medical staff, the better the prognosis of patients. This also suggested that medical staff need to maintain a positive attitude in the daily diagnosis and treatment process.
4.2 Other factors and prognosis in sepsis
In this study, we also found some influencing factors of one-year mortality of elderly sepsis patients, such as age, CCI, and RDW. Many studies have found that age is an influential factor in the death of sepsis patients. Compared with young people, elderly patients have weakened physical resistance and impaired physiological barriers. CCI is the most commonly used comorbidity assessment tool at present. The higher the score, the more serious the number and degree of underlying diseases are. Elderly patients often have more basic diseases (comorbidities), and many common comorbidities increase the risk of infection and death [22]. RDW is a parameter reflecting the volume heterogeneity of peripheral red blood cells, which is closely related to the prognosis of many diseases. At the present stage, most studies focus on the influence of patients themselves on the prognosis, and few studies pay attention to the influence of medical staff. This study suggests that we should also pay attention to the influence of medical staff’s behavior or sentiment on the prognosis of diseases in subsequent studies.
4.3 Strengths and limitations
This study explored the relationship between the emotional score of nursing notes and the death of elderly patients with sepsis within one year. We found its potential in predicting the death of patients within one year, which suggested the application value of nursing notes in clinical practice. However, some limitations also existed in this study. Firstly, due to the limited data in the database, this study lacked some information about biomarkers of sepsis patients, such as Uncoupling Protein 2. Secondly, this study only revealed the association and cannot determine the causal relationship between them. Thirdly, TextBlob has a weak ability to analyze complex sentences, and may have errors in the emotional direction of complex sentences in nursing notes and form bias for subjectivity and polarity scores, which affects the accuracy of the conclusion. Fourthly, the subjective nature of nursing notes may be influenced by the personality of the health care provider. Fifthly, the study was not validated with external data due to the lack of suitable external data at present. In the future, we will conduct data collection in our hospital to validate the findings. Sixthly, this study was a single-center retrospective study, which inevitably had some bias. For example, MIMIC comprises deidentified health-related data from patients who were admitted to the critical care units of the Beth Israel Deaconess Medical Center. In the United States, this center is a top 3 hospital with a suitable staff ratio and a high level of nurse experience. Therefore, the conclusion of this study may only be suitable for such large hospitals. For other small hospitals, the conclusion may not be suitable. In addition, patients from a single center are biased and heterogeneous with patients from other hospitals (such as disease severity, race, social status, family economics, education level, etc.). These results may result in the conclusions obtained in this study not being able to be verified in patients from other hospitals.
4.4 Future research directions
Future research should focus on exploring the causal relationship between sentiment scores in nursing notes and patient prognosis via well-designed clinical trials, detailing the mechanisms between the two. Then, develop a multimodal prediction model that integrates sentiment analysis of nursing texts with traditional clinical scoring systems, and enhance the generalization of the model through multicenter validation; at the same time, it is necessary to build a real-time automated monitoring tool, incorporate dynamic sentiment indicators into the clinical decision support system to achieve early warning of high-risk patients, and deeply analyze the intrinsic relationship between sentiment characteristics (such as negative/positive word combination patterns) and pathophysiological mechanisms. In addition, the impact of cross-language and cultural differences on the validity of sentiment analysis needs to be addressed urgently, and adaptive algorithms (such as Chinese subjective dictionary construction technology) need to be developed to improve the universality of application in different medical contexts.
5 Conclusions
The sentiment of nursing notes was correlated with the one-year mortality of elderly sepsis patients.
Supporting information
S1 FileRaw data.(XLSX)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Rudd KE, Johnson SC, Agesa KM, Shackelford KA, Tsoi D, Kievlan DR, et al. Global, regional, and national sepsis incidence and mortality, 1990-2017: analysis for the Global Burden of Disease Study. Lancet. 2020;395(10219):200–11.31954465 10.1016/S 0140-6736(19)32989-7PMC 6970225 · doi ↗ · pubmed ↗
- 2Jiang W. Current status of sepsis epidemiology in China. J Grad Med. 2019;31(1):5–8.
- 3Hammond NE, Kumar A, Kaur P, Tirupakuzhi Vijayaraghavan BK, Ghosh A, Grattan S, et al. Estimates of sepsis prevalence and outcomes in adult patients in the ICU in India: a cross-sectional study. Chest. 2022;161(6):1543–54. doi: 10.1016/j.chest.2021.12.673 35092747 · doi ↗ · pubmed ↗
- 4Li A, Ling L, Qin H, Arabi YM, Myatra SN, Egi M, et al. Epidemiology, management, and outcomes of sepsis in IC Us among countries of differing National Wealth across Asia. Am J Respir Crit Care Med. 2022;206(9):1107–16. doi: 10.1164/rccm.202112-2743 OC 35763381 · doi ↗ · pubmed ↗
- 5Liu Y-C, Yao Y, Yu M-M, Gao Y-L, Qi A-L, Jiang T-Y, et al. Frequency and mortality of sepsis and septic shock in China: a systematic review and meta-analysis. BMC Infect Dis. 2022;22(1):564. doi: 10.1186/s 12879-022-07543-8 35729526 PMC 9210671 · doi ↗ · pubmed ↗
- 6Xu F, Zhang L, Huang T, Yang R, Han D, Zheng S, et al. Influence of ambulatory blood pressure-related indicators within 24 h on in-hospital death in sepsis patients. Int J Med Sci. 2022;19(3):460–71. doi: 10.7150/ijms.67967 35370467 PMC 8964320 · doi ↗ · pubmed ↗
- 7Zubillaga A, Laccourreye P, Kerexeta J, Larburu N, Alonso E, Gomez DJ. Hospital readmission prediction via keyword extraction and sentiment analysis on clinical notes. Stud Health Technol Inform. 2022;295:339–42.35773878 10.3233/SHTI 220732 · doi ↗ · pubmed ↗
- 8Waudby-Smith IER, Tran N, Dubin JA, Lee J. Sentiment in nursing notes as an indicator of out-of-hospital mortality in intensive care patients. P Lo S One. 2018;13(6):e 0198687. doi: 10.1371/journal.pone.0198687 29879201 PMC 5991661 · doi ↗ · pubmed ↗
