Loading…

Potential use of text classification tools as signatures of suicidal behavior: A proof-of-concept study using Virginia Woolf's personal writings

The present study analyzes the feasibility of text classification to predict individual suicidal behavior. Entries from Virginia Woolf's diaries and letters were used to assess whether a text classification algorithm could identify written patterns associated with suicide. This is a text classi...

Full description

Saved in:
Bibliographic Details
Published in:PloS one 2018-10, Vol.13 (10), p.e0204820-e0204820
Main Authors: de Ávila Berni, Gabriela, Rabelo-da-Ponte, Francisco Diego, Librenza-Garcia, Diego, Boeira, Manuela V, Kauer-Sant'Anna, Márcia, Passos, Ives Cavalcante, Kapczinski, Flávio
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The present study analyzes the feasibility of text classification to predict individual suicidal behavior. Entries from Virginia Woolf's diaries and letters were used to assess whether a text classification algorithm could identify written patterns associated with suicide. This is a text classification study. We compared 46 text entries from the two months before Virginia Woolf's suicide with 54 texts randomly selected from Virginia Woolf's work during other periods of her life. Letters and diaries were included, while books, novels, short stories, and article fragments were excluded. The data was analyzed using a Naïve-Bayes machine-learning algorithm. The model showed a balanced accuracy of 80.45%, sensitivity of 69%, and specificity of 91%. The Kappa statistic was 0.6, which means a good agreement, and the p-value of the model was 0.003. The area under the ROC curve (AUC) was 0.80. In other words, the model exhibited good performance when used for classifying Virginia Woolf's diaries and letters. The present study showed the feasibility of a machine-learning model coupled with text to identify individual written patterns associated with suicidal behavior. Our text signature was able to identify the period of two months preceding suicide with a high accuracy. This technique may be applied to subjects with psychiatric disorders by means of data captured from social media, e-mail, among others. The algorithm may then predict a specific outcome and enable early intervention by clinicians.
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0204820