Loading…
Amplifying scientific paper’s abstract by leveraging data-weighted reconstruction
•This paper explores the impact of heterogeneous bibliographic network for generating scientific paper’s amplified abstract.•The amplified abstract is generated by leveraging target scientific paper’s abstract and citation sentence’s content and structure, which is addressed through document summari...
Saved in:
Published in: | Information processing & management 2016-07, Vol.52 (4), p.698-719 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •This paper explores the impact of heterogeneous bibliographic network for generating scientific paper’s amplified abstract.•The amplified abstract is generated by leveraging target scientific paper’s abstract and citation sentence’s content and structure, which is addressed through document summarization manner.•Sentence’s weight is learned by exploiting regularization for ranking on heterogeneous bibliographic network.•Data-weighted reconstruction is proposed to assign different priority to sentences when reconstructing the original document.•Various evaluation metrics are designed to validate the effectiveness of our approach.
In this paper, we focus on the problem of automatically generating amplified scientific paper’s abstractwhich represents the most influential aspects of scientific paper. The influential aspects can be illustrated by the target scientific paper’s abstract and citation sentences discussing the target paper, which are provided in papers citing the target paper. In this paper, we extract representative sentences through data-weighted reconstruction approach(DWR) by jointly leveraging target scientific paper’s abstract and citation sentences’ content and structure. In our study, we make two-folded contributions.
Firstly, sentence’s weight was learned by exploiting regularization for ranking on heterogeneous bibliographic network. Specially, Sentences-similar-Sentencesrelationship was identified by language modeling-based approach and added to the bibliographic network. Secondly, a data-weighted reconstruction objective function is optimized to select the most representative sentences which reconstructs the original sentence set with minimum error. In this process, sentences’ weight plays a critical role. Experimental evaluation over real dataset confirms the effectiveness of our approach. |
---|---|
ISSN: | 0306-4573 1873-5371 |
DOI: | 10.1016/j.ipm.2015.12.014 |