Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Deep Inverse Reinforcement Learning for Sepsis Treatment
41
Zitationen
3
Autoren
2019
Jahr
Abstract
Sepsis is a leading cause of mortality in hospitals, but its optimal treatment strategy still remains unclear. Recent years have witnessed several successful applications of Reinforcement Learning (RL) approaches in sepsis treatment, achieving far more efficient strategies than those by clinicians. To ensure such applications, an explicit reward function encoding medical domain knowledge should be specified beforehand to indicate the goal of learning. However, due to the paucity of clear understanding of sepsis itself, there is still considerable inconsistency in the formulation of reward functions for sepsis treatment. In this poster, we address the reward learning problem in RL for treatment of sepsis, which has been largely neglected by previous studies. A deep inverse RL with Mini-Tree (DIRL-MT) model is proposed to infer the best reward functions from a set of presumably optimal treatment trajectories using retrospective real medical data. In the model, the MT component learns the factors that are most important in influencing the mortality during sepsis treatment, while the DIRL component infers the complete reward function in terms of weights of those factors. Our work shows that PaO2 and PT can play a vital role and should be paid more attention in the design of more efficient treatment strategies for sepsis in the future.
Ähnliche Arbeiten
The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3)
2016 · 27.523 Zit.
pROC: an open-source package for R and S+ to analyze and compare ROC curves
2011 · 13.841 Zit.
APACHE II
1985 · 13.636 Zit.
Definitions for Sepsis and Organ Failure and Guidelines for the Use of Innovative Therapies in Sepsis
1992 · 13.190 Zit.
The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure
1996 · 11.537 Zit.