Loading…

Recent advances in document summarization

The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous appro...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge and information systems 2017-11, Vol.53 (2), p.297-336
Main Authors: Yao, Jin-ge, Wan, Xiaojun, Xiao, Jianguo
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33
cites cdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33
container_end_page 336
container_issue 2
container_start_page 297
container_title Knowledge and information systems
container_volume 53
creator Yao, Jin-ge
Wan, Xiaojun
Xiao, Jianguo
description The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.
doi_str_mv 10.1007/s10115-017-1042-4
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1937444984</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1937444984</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</originalsourceid><addsrcrecordid>eNp1kE9LxDAQxYMouK5-AG8LnjxEZ5o0bY6y6CosCKLnkH-VLjZdk1bQT29K9-DF0wzD772ZeYRcItwgQHWbEBBLClhRBF5QfkQWUKCkDFEcH3pkVXVKzlLaQQYF4oJcv3jrw7DS7ksH69OqDSvX27GbhmnsOh3bHz20fTgnJ43-SP7iUJfk7eH-df1It8-bp_XdllqGYqCIBiwTVhe2rBGs9rXxwNBJyaTgYIyXTeWs4SC8K3Why6zTaISotfOMLcnV7LuP_efo06B2_RhDXqlQsopzLmueKZwpG_uUom_UPrb52G-FoKZE1JyIyo-qKRE1aYpZkzIb3n384_yv6Bf9mmJb</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1937444984</pqid></control><display><type>article</type><title>Recent advances in document summarization</title><source>ABI/INFORM Global</source><source>Springer Link</source><creator>Yao, Jin-ge ; Wan, Xiaojun ; Xiao, Jianguo</creator><creatorcontrib>Yao, Jin-ge ; Wan, Xiaojun ; Xiao, Jianguo</creatorcontrib><description>The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.</description><identifier>ISSN: 0219-1377</identifier><identifier>EISSN: 0219-3116</identifier><identifier>DOI: 10.1007/s10115-017-1042-4</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Computer Science ; Data Mining and Knowledge Discovery ; Database Management ; Electronic documents ; Information processing ; Information Storage and Retrieval ; Information Systems and Communication Service ; Information Systems Applications (incl.Internet) ; Information technology ; IT in Business ; Sentences ; Summaries ; Survey Paper</subject><ispartof>Knowledge and information systems, 2017-11, Vol.53 (2), p.297-336</ispartof><rights>Springer-Verlag London 2017</rights><rights>Knowledge and Information Systems is a copyright of Springer, 2017.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</citedby><cites>FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</cites><orcidid>0000-0001-6887-1994</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1937444984/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1937444984?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11688,27924,27925,36060,44363,74895</link.rule.ids></links><search><creatorcontrib>Yao, Jin-ge</creatorcontrib><creatorcontrib>Wan, Xiaojun</creatorcontrib><creatorcontrib>Xiao, Jianguo</creatorcontrib><title>Recent advances in document summarization</title><title>Knowledge and information systems</title><addtitle>Knowl Inf Syst</addtitle><description>The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.</description><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Database Management</subject><subject>Electronic documents</subject><subject>Information processing</subject><subject>Information Storage and Retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Information technology</subject><subject>IT in Business</subject><subject>Sentences</subject><subject>Summaries</subject><subject>Survey Paper</subject><issn>0219-1377</issn><issn>0219-3116</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp1kE9LxDAQxYMouK5-AG8LnjxEZ5o0bY6y6CosCKLnkH-VLjZdk1bQT29K9-DF0wzD772ZeYRcItwgQHWbEBBLClhRBF5QfkQWUKCkDFEcH3pkVXVKzlLaQQYF4oJcv3jrw7DS7ksH69OqDSvX27GbhmnsOh3bHz20fTgnJ43-SP7iUJfk7eH-df1It8-bp_XdllqGYqCIBiwTVhe2rBGs9rXxwNBJyaTgYIyXTeWs4SC8K3Why6zTaISotfOMLcnV7LuP_efo06B2_RhDXqlQsopzLmueKZwpG_uUom_UPrb52G-FoKZE1JyIyo-qKRE1aYpZkzIb3n384_yv6Bf9mmJb</recordid><startdate>20171101</startdate><enddate>20171101</enddate><creator>Yao, Jin-ge</creator><creator>Wan, Xiaojun</creator><creator>Xiao, Jianguo</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0001-6887-1994</orcidid></search><sort><creationdate>20171101</creationdate><title>Recent advances in document summarization</title><author>Yao, Jin-ge ; Wan, Xiaojun ; Xiao, Jianguo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Database Management</topic><topic>Electronic documents</topic><topic>Information processing</topic><topic>Information Storage and Retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Information technology</topic><topic>IT in Business</topic><topic>Sentences</topic><topic>Summaries</topic><topic>Survey Paper</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yao, Jin-ge</creatorcontrib><creatorcontrib>Wan, Xiaojun</creatorcontrib><creatorcontrib>Xiao, Jianguo</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Knowledge and information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yao, Jin-ge</au><au>Wan, Xiaojun</au><au>Xiao, Jianguo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recent advances in document summarization</atitle><jtitle>Knowledge and information systems</jtitle><stitle>Knowl Inf Syst</stitle><date>2017-11-01</date><risdate>2017</risdate><volume>53</volume><issue>2</issue><spage>297</spage><epage>336</epage><pages>297-336</pages><issn>0219-1377</issn><eissn>0219-3116</eissn><abstract>The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s10115-017-1042-4</doi><tpages>40</tpages><orcidid>https://orcid.org/0000-0001-6887-1994</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0219-1377
ispartof Knowledge and information systems, 2017-11, Vol.53 (2), p.297-336
issn 0219-1377
0219-3116
language eng
recordid cdi_proquest_journals_1937444984
source ABI/INFORM Global; Springer Link
subjects Computer Science
Data Mining and Knowledge Discovery
Database Management
Electronic documents
Information processing
Information Storage and Retrieval
Information Systems and Communication Service
Information Systems Applications (incl.Internet)
Information technology
IT in Business
Sentences
Summaries
Survey Paper
title Recent advances in document summarization
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T15%3A06%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recent%20advances%20in%20document%20summarization&rft.jtitle=Knowledge%20and%20information%20systems&rft.au=Yao,%20Jin-ge&rft.date=2017-11-01&rft.volume=53&rft.issue=2&rft.spage=297&rft.epage=336&rft.pages=297-336&rft.issn=0219-1377&rft.eissn=0219-3116&rft_id=info:doi/10.1007/s10115-017-1042-4&rft_dat=%3Cproquest_cross%3E1937444984%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1937444984&rft_id=info:pmid/&rfr_iscdi=true