A Reinforcement Learning Framework for Personalized Anticoagulation Dosing in Critical Care: Integrating Batch-Constrained Policy Optimization and Off-Policy Evaluation

Lim, Yooseok; Park, In-Beom; Lee, Sujee

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A Reinforcement Learning Framework for Personalized Anticoagulation Dosing in Critical Care: Integrating Batch-Constrained Policy Optimization and Off-Policy Evaluationopen access

Authors: Lim, Yooseok; Park, In-Beom; Lee, Sujee

Issue Date: 2025

Publisher: IEEE

Keywords: Batch-Constrained Policy; Medical Information Mart for Intensive Care; Off-Policy Evaluation; Personalized Heparin Dosing Policy; Reinforcement Learning

Citation: IEEE Access, v.13, pp 203145 - 203157

Pages: 13

Indexed: SCIE
SCOPUS

Journal Title: IEEE Access

Volume: 13

Start Page: 203145

End Page: 203157

URI: https://scholarworks.dongguk.edu/handle/sw.dongguk/62244

DOI: 10.1109/ACCESS.2025.3638417

ISSN: 2169-3536
2169-3536

Abstract: Precise medication dosing in the intensive care unit (ICU) is vital for patient survival. Heparin, a widely used anticoagulant, requires careful administration due to patient-specific variability, and inappropriate dosing can cause severe complications such as stroke or hemorrhage. This study introduces a reinforcement learning (RL)-based decision-support framework for heparin dosing, integrating offline RL algorithms with rigorous evaluation. We employ Batch-Constrained Q-Learning (BCQ) to learn an optimal dosing policy from retrospective data, addressing distributional shift inherent in offline settings. The dosing policies are trained on the MIMIC-III database and evaluated on the MIMIC-IV database, and vice versa. Policy effectiveness is assessed through multiple off-policy evaluation (OPE) methods, demonstrating higher expected returns than clinician-derived strategies. Interpretability is enhanced through t-SNE visualization, showing that Q-values are well aligned with therapeutic aPTT targets. To our knowledge, this is the first study to combine BCQ, multi-metric OPE, and interpretability analysis for anticoagulation management across two large-scale ICU cohorts. By advancing both methodological rigor and clinical relevance, this work provides a foundation for reliable RL-based decision-support systems in critical care. © 2013 IEEE.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > Department of Industrial and Systems Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Park, In Beom photo

Park, In Beom: College of Engineering (Department of Industrial and Systems Engineering)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

30, Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea+82-2-2260-3114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE