Loading…

Sequence Generative Adversarial Network for Chinese Social Media Text Summarization

Although the sequence-to-sequence models have achieved state-of-the-art performance in many summarization datasets, there are still some problems in the processing of Chinese social media text, such as short sentences, lack of coherence and accuracy. These issues are caused by two factors: the princ...

Full description

Saved in:
Bibliographic Details
Main Authors: Yang, Wenchuan, Hua, Rui, Zhao, Qiuhan
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 4625
container_issue
container_start_page 4620
container_title
container_volume
creator Yang, Wenchuan
Hua, Rui
Zhao, Qiuhan
description Although the sequence-to-sequence models have achieved state-of-the-art performance in many summarization datasets, there are still some problems in the processing of Chinese social media text, such as short sentences, lack of coherence and accuracy. These issues are caused by two factors: the principle of the RNN-based sequence-to-sequence model is maximum likelihood estimation, which will lead to gradient vanishing or exploding when generating long summaries; the text in the Chinese social media is long and noisy, for which it is very difficult to generate high-quality summaries. To solve these issues, we apply a sequence generative adversarial network framework. The framework includes generator and discriminator, in which generator is used to generate summaries and discriminator is used to evaluate generated summaries. The softargmax layer is used as a connection layer to guarantee the co-training of generator and discriminator. Experiments are carried out on Large Scale Chinese Social Media Text Summarization Dataset. The length of the sentence, ROUGE score and artificial score of summary's quality are used to evaluate the generated summaries. The result shows that the sentences in the summaries generated by our model are longer and have higher accuracy.
doi_str_mv 10.1109/CAC48633.2019.8996751
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_8996751</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8996751</ieee_id><sourcerecordid>8996751</sourcerecordid><originalsourceid>FETCH-LOGICAL-i203t-32331e7f9e893caadeaf9bbc29bfeb2fdd9374b456a6f2dcc690b84b56512a423</originalsourceid><addsrcrecordid>eNotkE1OwzAUhA0SEqX0BAjJF0iw_RzHXlYRtEgFFinryo6fhaFNwEnLz-kJoqtZzHyj0RByzVnOOTM31bySWgHkgnGTa2NUWfATcsFLoblkRsIpmQildcYM6HMy6_tXxpgALgvJJqSu8WOPbYN0gS0mO8QD0rk_YOptinZLH3H47NIbDV2i1UtssUdad82f9YA-WrrGr4HW-91uzP-MfNdekrNgtz3Ojjolz3e362qZrZ4W99V8lUXBYMhAAHAsg0FtoLHWow3GuUYYF9CJ4L2BUjpZKKuC8E2jDHNaukIVXFgpYEqu_nsjIm7eUxwnfG-OH8AvsiBSQw</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Sequence Generative Adversarial Network for Chinese Social Media Text Summarization</title><source>IEEE Xplore All Conference Series</source><creator>Yang, Wenchuan ; Hua, Rui ; Zhao, Qiuhan</creator><creatorcontrib>Yang, Wenchuan ; Hua, Rui ; Zhao, Qiuhan</creatorcontrib><description>Although the sequence-to-sequence models have achieved state-of-the-art performance in many summarization datasets, there are still some problems in the processing of Chinese social media text, such as short sentences, lack of coherence and accuracy. These issues are caused by two factors: the principle of the RNN-based sequence-to-sequence model is maximum likelihood estimation, which will lead to gradient vanishing or exploding when generating long summaries; the text in the Chinese social media is long and noisy, for which it is very difficult to generate high-quality summaries. To solve these issues, we apply a sequence generative adversarial network framework. The framework includes generator and discriminator, in which generator is used to generate summaries and discriminator is used to evaluate generated summaries. The softargmax layer is used as a connection layer to guarantee the co-training of generator and discriminator. Experiments are carried out on Large Scale Chinese Social Media Text Summarization Dataset. The length of the sentence, ROUGE score and artificial score of summary's quality are used to evaluate the generated summaries. The result shows that the sentences in the summaries generated by our model are longer and have higher accuracy.</description><identifier>EISSN: 2688-0938</identifier><identifier>EISBN: 1728140943</identifier><identifier>EISBN: 9781728140940</identifier><identifier>DOI: 10.1109/CAC48633.2019.8996751</identifier><language>eng</language><publisher>IEEE</publisher><subject>Sequence Generative Adversarial Network ; Softargmax ; Text Summarization</subject><ispartof>2019 Chinese Automation Congress (CAC), 2019, p.4620-4625</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8996751$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,23930,23931,25140,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8996751$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Yang, Wenchuan</creatorcontrib><creatorcontrib>Hua, Rui</creatorcontrib><creatorcontrib>Zhao, Qiuhan</creatorcontrib><title>Sequence Generative Adversarial Network for Chinese Social Media Text Summarization</title><title>2019 Chinese Automation Congress (CAC)</title><addtitle>CAC</addtitle><description>Although the sequence-to-sequence models have achieved state-of-the-art performance in many summarization datasets, there are still some problems in the processing of Chinese social media text, such as short sentences, lack of coherence and accuracy. These issues are caused by two factors: the principle of the RNN-based sequence-to-sequence model is maximum likelihood estimation, which will lead to gradient vanishing or exploding when generating long summaries; the text in the Chinese social media is long and noisy, for which it is very difficult to generate high-quality summaries. To solve these issues, we apply a sequence generative adversarial network framework. The framework includes generator and discriminator, in which generator is used to generate summaries and discriminator is used to evaluate generated summaries. The softargmax layer is used as a connection layer to guarantee the co-training of generator and discriminator. Experiments are carried out on Large Scale Chinese Social Media Text Summarization Dataset. The length of the sentence, ROUGE score and artificial score of summary's quality are used to evaluate the generated summaries. The result shows that the sentences in the summaries generated by our model are longer and have higher accuracy.</description><subject>Sequence Generative Adversarial Network</subject><subject>Softargmax</subject><subject>Text Summarization</subject><issn>2688-0938</issn><isbn>1728140943</isbn><isbn>9781728140940</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2019</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotkE1OwzAUhA0SEqX0BAjJF0iw_RzHXlYRtEgFFinryo6fhaFNwEnLz-kJoqtZzHyj0RByzVnOOTM31bySWgHkgnGTa2NUWfATcsFLoblkRsIpmQildcYM6HMy6_tXxpgALgvJJqSu8WOPbYN0gS0mO8QD0rk_YOptinZLH3H47NIbDV2i1UtssUdad82f9YA-WrrGr4HW-91uzP-MfNdekrNgtz3Ojjolz3e362qZrZ4W99V8lUXBYMhAAHAsg0FtoLHWow3GuUYYF9CJ4L2BUjpZKKuC8E2jDHNaukIVXFgpYEqu_nsjIm7eUxwnfG-OH8AvsiBSQw</recordid><startdate>201911</startdate><enddate>201911</enddate><creator>Yang, Wenchuan</creator><creator>Hua, Rui</creator><creator>Zhao, Qiuhan</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201911</creationdate><title>Sequence Generative Adversarial Network for Chinese Social Media Text Summarization</title><author>Yang, Wenchuan ; Hua, Rui ; Zhao, Qiuhan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i203t-32331e7f9e893caadeaf9bbc29bfeb2fdd9374b456a6f2dcc690b84b56512a423</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Sequence Generative Adversarial Network</topic><topic>Softargmax</topic><topic>Text Summarization</topic><toplevel>online_resources</toplevel><creatorcontrib>Yang, Wenchuan</creatorcontrib><creatorcontrib>Hua, Rui</creatorcontrib><creatorcontrib>Zhao, Qiuhan</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Explore</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yang, Wenchuan</au><au>Hua, Rui</au><au>Zhao, Qiuhan</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Sequence Generative Adversarial Network for Chinese Social Media Text Summarization</atitle><btitle>2019 Chinese Automation Congress (CAC)</btitle><stitle>CAC</stitle><date>2019-11</date><risdate>2019</risdate><spage>4620</spage><epage>4625</epage><pages>4620-4625</pages><eissn>2688-0938</eissn><eisbn>1728140943</eisbn><eisbn>9781728140940</eisbn><abstract>Although the sequence-to-sequence models have achieved state-of-the-art performance in many summarization datasets, there are still some problems in the processing of Chinese social media text, such as short sentences, lack of coherence and accuracy. These issues are caused by two factors: the principle of the RNN-based sequence-to-sequence model is maximum likelihood estimation, which will lead to gradient vanishing or exploding when generating long summaries; the text in the Chinese social media is long and noisy, for which it is very difficult to generate high-quality summaries. To solve these issues, we apply a sequence generative adversarial network framework. The framework includes generator and discriminator, in which generator is used to generate summaries and discriminator is used to evaluate generated summaries. The softargmax layer is used as a connection layer to guarantee the co-training of generator and discriminator. Experiments are carried out on Large Scale Chinese Social Media Text Summarization Dataset. The length of the sentence, ROUGE score and artificial score of summary's quality are used to evaluate the generated summaries. The result shows that the sentences in the summaries generated by our model are longer and have higher accuracy.</abstract><pub>IEEE</pub><doi>10.1109/CAC48633.2019.8996751</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2688-0938
ispartof 2019 Chinese Automation Congress (CAC), 2019, p.4620-4625
issn 2688-0938
language eng
recordid cdi_ieee_primary_8996751
source IEEE Xplore All Conference Series
subjects Sequence Generative Adversarial Network
Softargmax
Text Summarization
title Sequence Generative Adversarial Network for Chinese Social Media Text Summarization
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T08%3A28%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Sequence%20Generative%20Adversarial%20Network%20for%20Chinese%20Social%20Media%20Text%20Summarization&rft.btitle=2019%20Chinese%20Automation%20Congress%20(CAC)&rft.au=Yang,%20Wenchuan&rft.date=2019-11&rft.spage=4620&rft.epage=4625&rft.pages=4620-4625&rft.eissn=2688-0938&rft_id=info:doi/10.1109/CAC48633.2019.8996751&rft.eisbn=1728140943&rft.eisbn_list=9781728140940&rft_dat=%3Cieee_CHZPO%3E8996751%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i203t-32331e7f9e893caadeaf9bbc29bfeb2fdd9374b456a6f2dcc690b84b56512a423%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=8996751&rfr_iscdi=true