A Machine Learning Approach for Predicting Urine Output After Fluid Administration
Introduction
Sepsis is a global health problem with high mortality and morbidity. Currently intravenous fluid administration is recommended as the first-line resuscitative therapy for septic shock patients. However, excessive fluid administration could lead to complications like pulmonary edema, renal failure and impaired bowel function, and has been shown to be associated with increased mortality [1]. link to published paper in Computer Methods and Programs in Biomedicine
Fluid Non-responders and Urine Output
About 50% of patients with sepsis are actually not responsive to intravenous fluid resuscitation [2]. These “fluid non-responders” are at risk of being harmed by excessive fluid administration.
- Urine output (UO) is an important clinical parameter when assessing fluid balance, determining treatment strategy and predicting outcome.
Our Goal
Apply a machine learning method to predict whether a septic patient would have decreased UO or oliguria after fluid administration.
Data
Our data is from The Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) III database (v1.4) [11]. Sepsis was defined as a suspected infection (prescription of antibiotics and sampling of bodily fluids for microbiological culture) combined with evidence of organ dysfunction, defined by a Sequential Organ Failure Assessment (SOFA) score greater or equal to 2.
- 19,275 patients were included, totally 232,929 events.
- 135,735 events (58.27%) were labeled as having decreased UO
- 89,699 events (38.51%) were labeled as oliguria.
Preprocess
Data were included from up to 24 hours preceding the diagnosis of sepsis and until 48 hours following the onset of sepsis. For each patient, clinical data are aggregated into windows of 4 hours as an event. From each event, we extracted patient features as follow:
- demographics
- vital signs
- laboratory results
- amount of fluid administration
- amount of urine output
Dynamic Change of a Patient’s Condition
The dynamic change of a patient’s condition is an important clinical feature event [3], to bring this concept into practice, we extracted features acquired during the corresponding time window (Fc), as well as the features of precedent event (Fp).
Feature Sets
Outcome Measurements
We considered two outcome measurements regarding patients’ fluid status.
- Decreased amount of UO compared to that in the previous time window.
- Oliguria: defined as the amount of UO is less than 0.5 ml/kg per hour.
Model Architecture and Interpretability
A tree based algorithm is superior in interpretability to other machine learning algorithms, it provides information of which features are more decisive, this in turn can assist doctors in better understanding of patients’ condition and deciding a more accurate treatment plan. Hense, our model of choice is based on gradient tree boosting algorithm - eXtreme Gradient Boosting (XGBoost)[4].
- Advantages of XGBoost
- Copes better with sparsity of data.
- Surpasses traditional tree-based models by introducing regularization to avoid overfitting
- Utilizing gradient boosting to ensemble multiple tree models for better performance and also mitigates biases
Model Training and Features Selection
We trained XGBoost model using different features sets.
- Fc and Fp
- Fc alone
The importance of each features were ranked. Top ranked features were selected into different subsets ranging from top 15, 10, to 5 features. Individual models were trained on these subsets respectively.
Result
Regarding the attributes for training the models, adding the features of precedent event (Fp) slightly improved the performance of models. The more attributes used, the more accurate the models could be. Yet models using only the top 5 ranked features could still reach an optimal performance with AUC of 0.84 and 0.86 respectively.
Feature Rankings
Conclusion
- It is suggested that UO may be used to guide fluid resuscitation.
- We proposed an machine learning method in predicting septic patients’ responses to fluid resuscitation therapy, which yielded moderately accurate results with AUC up to 0.86.
- We exploited the concept of continuity of disease course by adding the features of precedent event.
- Models trained on the combined dataset demonstrated better result, which further concludes the idea that clinical conditions are better measured with dynamic indicators.
- Model using less features still could yield similar performance, demonstrating the clinical applicability of these models in clinical settings with limited data.
Limitations
- Retrospective clinical database: external validation with other clinical dataset would be needed to insure the clinical validity of this model.
- The clinical impact of this model on patients’ outcome require further clinical evaluation.
- We used 4 hours as our event window, which might be too narrow for some septic patients to generate clinically evident urine output increment.
Reference
- Sirvent J-M et al. Fluid balance in sepsis and septic shock as a determining factor of mortality. The American journal of emergency medicine. 2015
- Bentzer P et al. Will this hemodynamically unstable patient respond to a bolus of intravenous fluids? Jama. 2016
- Monnet X et al. Prediction of fluid responsiveness: an update. Annals of intensive care. 2016;
- Chen T, Guestrin C, editors. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining; 2016: ACM.