On selecting a probabilistic classifier for appointment no-show prediction

作者：

Highlights：

• We study the integration of patient no-show prediction in appointment scheduling.

• Clinics should not use AUC to measure the performance of patient no-show prediction.

• They should use Brier's score or the Log Loss instead.

摘要

Appointment no-shows are disruptive to healthcare clinics, and may increase patient waiting time and clinic overtime, resulting in increased clinic costs. Appointment scheduling models typically mitigate the negative effects of no-shows through appointment overbooking. Recent work has proposed a predictive overbooking framework, where a probabilisitic classifier predicts the no-show probability of individual appointment requests, and a scheduling algorithm uses those predictions to optimally schedule appointments. Because predicting no-shows is typically an imbalanced classification problem, the preferred classifier is often chosen based upon the area under the receiver operator characteristic curve (AUC), which is a commonly used metric for many other imbalanced classification problems. Contrary to intuition, in this paper we show that employing the AUC to select a classifier results in significantly lower schedule efficiency than using other metrics such as Log Loss or Brier Score. Our computational experiments, validated on large real-world appointment data, suggest that by using Log Loss or Brier Score instead of AUC, practitioners can improve the schedule quality by 3–7%.

论文关键词：Healthcare,Appointment no-show,Appointment scheduling,Classification,Classification performance

论文评审过程：Received 9 June 2020, Revised 3 November 2020, Accepted 5 December 2020, Available online 9 December 2020, Version of Record 26 January 2021.

论文官网地址：https://doi.org/10.1016/j.dss.2020.113472