Loading…

Unsupervised extractive multi-document summarization method based on transfer learning from BERT multi-task fine-tuning

Text representation is a fundamental cornerstone that impacts the effectiveness of several text summarization methods. Transfer learning using pre-trained word embedding models has shown promising results. However, most of these representations do not consider the order and the semantic relationship...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of information science 2023-02, Vol.49 (1), p.164-182
Main Authors:	Lamsiyah, Salima, Mahdaouy, Abdelkader El, Ouatik, Saïd El Alaoui, Espinasse, Bernard
Format:	Article
Language:	English
Subjects:	Computer Science Datasets Deep learning Documents Embedding Representations
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Text representation is a fundamental cornerstone that impacts the effectiveness of several text summarization methods. Transfer learning using pre-trained word embedding models has shown promising results. However, most of these representations do not consider the order and the semantic relationships between words in a sentence, and thus they do not carry the meaning of a full sentence. To overcome this issue, the current study proposes an unsupervised method for extractive multi-document summarization based on transfer learning from BERT sentence embedding model. Moreover, to improve sentence representation learning, we fine-tune BERT model on supervised intermediate tasks from GLUE benchmark datasets using single-task and multi-task fine-tuning methods. Experiments are performed on the standard DUC’2002–2004 datasets. The obtained results show that our method has significantly outperformed several baseline methods and achieves a comparable and sometimes better performance than the recent state-of-the-art deep learning–based methods. Furthermore, the results show that fine-tuning BERT using multi-task learning has considerably improved the performance.
ISSN:	0165-5515 1741-6485
DOI:	10.1177/0165551521990616