Loading…

Argument mining as rapid screening tool of COVID-19 literature quality: Preliminary evidence

The COVID-19 pandemic prompted the scientific community to share timely evidence, also in the form of pre-printed papers, not peer reviewed yet. To develop an artificial intelligence system for the analysis of the scientific literature by leveraging on recent developments in the field of Argument Mi...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers in public health 2022-07, Vol.10, p.945181
Main Authors: Brambilla, Gianfranco, Rosi, Antonella, Antici, Francesco, Galassi, Andrea, Giansanti, Daniele, Magurano, Fabio, Ruggeri, Federico, Torroni, Paolo, Cisbani, Evaristo, Lippi, Marco
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The COVID-19 pandemic prompted the scientific community to share timely evidence, also in the form of pre-printed papers, not peer reviewed yet. To develop an artificial intelligence system for the analysis of the scientific literature by leveraging on recent developments in the field of Argument Mining. Scientific quality criteria were borrowed from two selected Cochrane systematic reviews. Four independent reviewers gave a blind evaluation on a 1-5 scale to 40 papers for each review. These scores were matched with the automatic analysis performed by an AM system named MARGOT, which detected claims and supporting evidence for the cited papers. Outcomes were evaluated with inter-rater indices (Cohen's Kappa, Krippendorff's Alpha, s statistics). MARGOT performs differently on the two selected Cochrane reviews: the inter-rater indices show a fair-to-moderate agreement of the most relevant MARGOT metrics both with Cochrane and the skilled interval scores, with larger values for one of the two reviews. The noted discrepancy could rely on a limitation of the MARGOT system that can be improved; yet, the level of agreement between human reviewers also suggests a different complexity between the two reviews in debating controversial arguments. These preliminary results encourage to expand and deepen the investigation to other topics and a larger number of highly specialized reviewers, to reduce uncertainty in the evaluation process, thus supporting the retraining of AM systems.
ISSN:2296-2565
2296-2565
DOI:10.3389/fpubh.2022.945181