Loading…

Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation

The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and gene...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2020-06
Main Authors: Chen, Xingyuan, Cai, Ping, Peng, Jin, Wang, Hongjun, Dai, Xinyu, Chen, Jiajun
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Chen, Xingyuan
Cai, Ping
Peng, Jin
Wang, Hongjun
Dai, Xinyu
Chen, Jiajun
description The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2387522935</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2387522935</sourcerecordid><originalsourceid>FETCH-proquest_journals_23875229353</originalsourceid><addsrcrecordid>eNqNi90KgjAYQEcQJOU7fNC1YN9a2qX9WF1n1zL0qya61Tajx8-gB-jqwOGcEQuQ80WULhEnLHSuieMYVwkKwQN2zupa6RtkkKvWk4WNdFSD0VDcCXbKVVZ1SktvLHgDp-5hzYvgoiuja-WV0bKFgt4eDqTJyq-ZsfFVto7CH6dsnu-L7TEa3mdPzpeN6e0wuhJ5mgjENRf8v-oDvo4_hA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2387522935</pqid></control><display><type>article</type><title>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</title><source>Publicly Available Content (ProQuest)</source><creator>Chen, Xingyuan ; Cai, Ping ; Peng, Jin ; Wang, Hongjun ; Dai, Xinyu ; Chen, Jiajun</creator><creatorcontrib>Chen, Xingyuan ; Cai, Ping ; Peng, Jin ; Wang, Hongjun ; Dai, Xinyu ; Chen, Jiajun</creatorcontrib><description>The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Discriminators ; Maximum likelihood estimation</subject><ispartof>arXiv.org, 2020-06</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2387522935?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Chen, Xingyuan</creatorcontrib><creatorcontrib>Cai, Ping</creatorcontrib><creatorcontrib>Peng, Jin</creatorcontrib><creatorcontrib>Wang, Hongjun</creatorcontrib><creatorcontrib>Dai, Xinyu</creatorcontrib><creatorcontrib>Chen, Jiajun</creatorcontrib><title>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</title><title>arXiv.org</title><description>The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.</description><subject>Discriminators</subject><subject>Maximum likelihood estimation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNi90KgjAYQEcQJOU7fNC1YN9a2qX9WF1n1zL0qya61Tajx8-gB-jqwOGcEQuQ80WULhEnLHSuieMYVwkKwQN2zupa6RtkkKvWk4WNdFSD0VDcCXbKVVZ1SktvLHgDp-5hzYvgoiuja-WV0bKFgt4eDqTJyq-ZsfFVto7CH6dsnu-L7TEa3mdPzpeN6e0wuhJ5mgjENRf8v-oDvo4_hA</recordid><startdate>20200622</startdate><enddate>20200622</enddate><creator>Chen, Xingyuan</creator><creator>Cai, Ping</creator><creator>Peng, Jin</creator><creator>Wang, Hongjun</creator><creator>Dai, Xinyu</creator><creator>Chen, Jiajun</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PIMPY</scope><scope>PKEHL</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200622</creationdate><title>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</title><author>Chen, Xingyuan ; Cai, Ping ; Peng, Jin ; Wang, Hongjun ; Dai, Xinyu ; Chen, Jiajun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_23875229353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Discriminators</topic><topic>Maximum likelihood estimation</topic><toplevel>online_resources</toplevel><creatorcontrib>Chen, Xingyuan</creatorcontrib><creatorcontrib>Cai, Ping</creatorcontrib><creatorcontrib>Peng, Jin</creatorcontrib><creatorcontrib>Wang, Hongjun</creatorcontrib><creatorcontrib>Dai, Xinyu</creatorcontrib><creatorcontrib>Chen, Jiajun</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied &amp; Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Xingyuan</au><au>Cai, Ping</au><au>Peng, Jin</au><au>Wang, Hongjun</au><au>Dai, Xinyu</au><au>Chen, Jiajun</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</atitle><jtitle>arXiv.org</jtitle><date>2020-06-22</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2020-06
issn 2331-8422
language eng
recordid cdi_proquest_journals_2387522935
source Publicly Available Content (ProQuest)
subjects Discriminators
Maximum likelihood estimation
title Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-19T00%3A31%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Adding%20A%20Filter%20Based%20on%20The%20Discriminator%20to%20Improve%20Unconditional%20Text%20Generation&rft.jtitle=arXiv.org&rft.au=Chen,%20Xingyuan&rft.date=2020-06-22&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2387522935%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_23875229353%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2387522935&rft_id=info:pmid/&rfr_iscdi=true