Loading…
Recent advances in document summarization
The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous appro...
Saved in:
Published in: | Knowledge and information systems 2017-11, Vol.53 (2), p.297-336 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33 |
---|---|
cites | cdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33 |
container_end_page | 336 |
container_issue | 2 |
container_start_page | 297 |
container_title | Knowledge and information systems |
container_volume | 53 |
creator | Yao, Jin-ge Wan, Xiaojun Xiao, Jianguo |
description | The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions. |
doi_str_mv | 10.1007/s10115-017-1042-4 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1937444984</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1937444984</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</originalsourceid><addsrcrecordid>eNp1kE9LxDAQxYMouK5-AG8LnjxEZ5o0bY6y6CosCKLnkH-VLjZdk1bQT29K9-DF0wzD772ZeYRcItwgQHWbEBBLClhRBF5QfkQWUKCkDFEcH3pkVXVKzlLaQQYF4oJcv3jrw7DS7ksH69OqDSvX27GbhmnsOh3bHz20fTgnJ43-SP7iUJfk7eH-df1It8-bp_XdllqGYqCIBiwTVhe2rBGs9rXxwNBJyaTgYIyXTeWs4SC8K3Why6zTaISotfOMLcnV7LuP_efo06B2_RhDXqlQsopzLmueKZwpG_uUom_UPrb52G-FoKZE1JyIyo-qKRE1aYpZkzIb3n384_yv6Bf9mmJb</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1937444984</pqid></control><display><type>article</type><title>Recent advances in document summarization</title><source>ABI/INFORM Global</source><source>Springer Link</source><creator>Yao, Jin-ge ; Wan, Xiaojun ; Xiao, Jianguo</creator><creatorcontrib>Yao, Jin-ge ; Wan, Xiaojun ; Xiao, Jianguo</creatorcontrib><description>The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.</description><identifier>ISSN: 0219-1377</identifier><identifier>EISSN: 0219-3116</identifier><identifier>DOI: 10.1007/s10115-017-1042-4</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Computer Science ; Data Mining and Knowledge Discovery ; Database Management ; Electronic documents ; Information processing ; Information Storage and Retrieval ; Information Systems and Communication Service ; Information Systems Applications (incl.Internet) ; Information technology ; IT in Business ; Sentences ; Summaries ; Survey Paper</subject><ispartof>Knowledge and information systems, 2017-11, Vol.53 (2), p.297-336</ispartof><rights>Springer-Verlag London 2017</rights><rights>Knowledge and Information Systems is a copyright of Springer, 2017.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</citedby><cites>FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</cites><orcidid>0000-0001-6887-1994</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/1937444984/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/1937444984?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11688,27924,27925,36060,44363,74895</link.rule.ids></links><search><creatorcontrib>Yao, Jin-ge</creatorcontrib><creatorcontrib>Wan, Xiaojun</creatorcontrib><creatorcontrib>Xiao, Jianguo</creatorcontrib><title>Recent advances in document summarization</title><title>Knowledge and information systems</title><addtitle>Knowl Inf Syst</addtitle><description>The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.</description><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Database Management</subject><subject>Electronic documents</subject><subject>Information processing</subject><subject>Information Storage and Retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Information technology</subject><subject>IT in Business</subject><subject>Sentences</subject><subject>Summaries</subject><subject>Survey Paper</subject><issn>0219-1377</issn><issn>0219-3116</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp1kE9LxDAQxYMouK5-AG8LnjxEZ5o0bY6y6CosCKLnkH-VLjZdk1bQT29K9-DF0wzD772ZeYRcItwgQHWbEBBLClhRBF5QfkQWUKCkDFEcH3pkVXVKzlLaQQYF4oJcv3jrw7DS7ksH69OqDSvX27GbhmnsOh3bHz20fTgnJ43-SP7iUJfk7eH-df1It8-bp_XdllqGYqCIBiwTVhe2rBGs9rXxwNBJyaTgYIyXTeWs4SC8K3Why6zTaISotfOMLcnV7LuP_efo06B2_RhDXqlQsopzLmueKZwpG_uUom_UPrb52G-FoKZE1JyIyo-qKRE1aYpZkzIb3n384_yv6Bf9mmJb</recordid><startdate>20171101</startdate><enddate>20171101</enddate><creator>Yao, Jin-ge</creator><creator>Wan, Xiaojun</creator><creator>Xiao, Jianguo</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0001-6887-1994</orcidid></search><sort><creationdate>20171101</creationdate><title>Recent advances in document summarization</title><author>Yao, Jin-ge ; Wan, Xiaojun ; Xiao, Jianguo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Database Management</topic><topic>Electronic documents</topic><topic>Information processing</topic><topic>Information Storage and Retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Information technology</topic><topic>IT in Business</topic><topic>Sentences</topic><topic>Summaries</topic><topic>Survey Paper</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yao, Jin-ge</creatorcontrib><creatorcontrib>Wan, Xiaojun</creatorcontrib><creatorcontrib>Xiao, Jianguo</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Knowledge and information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yao, Jin-ge</au><au>Wan, Xiaojun</au><au>Xiao, Jianguo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recent advances in document summarization</atitle><jtitle>Knowledge and information systems</jtitle><stitle>Knowl Inf Syst</stitle><date>2017-11-01</date><risdate>2017</risdate><volume>53</volume><issue>2</issue><spage>297</spage><epage>336</epage><pages>297-336</pages><issn>0219-1377</issn><eissn>0219-3116</eissn><abstract>The task of automatic document summarization aims at generating short summaries for originally long documents. A good summary should cover the most important information of the original document or a cluster of documents, while being coherent, non-redundant and grammatically readable. Numerous approaches for automatic summarization have been developed to date. In this paper we give a self-contained, broad overview of recent progress made for document summarization within the last 5 years. Specifically, we emphasize on significant contributions made in recent years that represent the state-of-the-art of document summarization, including progress on modern sentence extraction approaches that improve concept coverage, information diversity and content coherence, as well as attempts from summarization frameworks that integrate sentence compression, and more abstractive systems that are able to produce completely new sentences. In addition, we review progress made for document summarization in domains, genres and applications that are different from traditional settings. We also point out some of the latest trends and highlight a few possible future directions.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s10115-017-1042-4</doi><tpages>40</tpages><orcidid>https://orcid.org/0000-0001-6887-1994</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0219-1377 |
ispartof | Knowledge and information systems, 2017-11, Vol.53 (2), p.297-336 |
issn | 0219-1377 0219-3116 |
language | eng |
recordid | cdi_proquest_journals_1937444984 |
source | ABI/INFORM Global; Springer Link |
subjects | Computer Science Data Mining and Knowledge Discovery Database Management Electronic documents Information processing Information Storage and Retrieval Information Systems and Communication Service Information Systems Applications (incl.Internet) Information technology IT in Business Sentences Summaries Survey Paper |
title | Recent advances in document summarization |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T15%3A06%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recent%20advances%20in%20document%20summarization&rft.jtitle=Knowledge%20and%20information%20systems&rft.au=Yao,%20Jin-ge&rft.date=2017-11-01&rft.volume=53&rft.issue=2&rft.spage=297&rft.epage=336&rft.pages=297-336&rft.issn=0219-1377&rft.eissn=0219-3116&rft_id=info:doi/10.1007/s10115-017-1042-4&rft_dat=%3Cproquest_cross%3E1937444984%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c316t-11b0c36ca2c5810cae8be031d9939640bbe9f7dcb406ed5a2a5316a1b668ade33%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1937444984&rft_id=info:pmid/&rfr_iscdi=true |