Loading…

Binary search of the optimal cut-point value in ROC analysis using the F 1 score

This paper introduces a binary search algorithm for determining the optimal probability cut-point value ( C ) of binary classifiers. Cut-points are operating points on the receiver operating characteristic curve that divide positive and negative predictions. Compared to the traditional exhaustive se...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of physics. Conference series 2023-10, Vol.2609 (1), p.12002
Main Authors:	Tan, Swee Chuan, Zhu, Siying
Format:	Article
Language:	English
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	This paper introduces a binary search algorithm for determining the optimal probability cut-point value ( C ) of binary classifiers. Cut-points are operating points on the receiver operating characteristic curve that divide positive and negative predictions. Compared to the traditional exhaustive search for optimal C value, the proposed method offers execution time efficiency ( O (log 2 ( k ))) and a small cut-point error of 1/2 n after k steps of binary search. Traditionally, the optimal C value is determined by stepping through all possible C values. This search is uninformed because there is no indication of the search direction. To address this issue, we derive the expectation of the F-Measure (aka F 1 score); and use it to guide the search process. Specifically, by comparing the F-Measure at the current cut-point with the F-Measure at expected cut-point, we can use the information to adjust C dynamically towards the optimal cut-point, resulting in optimal model performance. Our results on two classifiers trained from disease classification datasets suggest that the algorithm is robust and efficient, as compared to the traditional methods.
ISSN:	1742-6588 1742-6596
DOI:	10.1088/1742-6596/2609/1/012002