Loading…
Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation
The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and gene...
Saved in:
Published in: | arXiv.org 2020-06 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Chen, Xingyuan Cai, Ping Peng, Jin Wang, Hongjun Dai, Xinyu Chen, Jiajun |
description | The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2387522935</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2387522935</sourcerecordid><originalsourceid>FETCH-proquest_journals_23875229353</originalsourceid><addsrcrecordid>eNqNi90KgjAYQEcQJOU7fNC1YN9a2qX9WF1n1zL0qya61Tajx8-gB-jqwOGcEQuQ80WULhEnLHSuieMYVwkKwQN2zupa6RtkkKvWk4WNdFSD0VDcCXbKVVZ1SktvLHgDp-5hzYvgoiuja-WV0bKFgt4eDqTJyq-ZsfFVto7CH6dsnu-L7TEa3mdPzpeN6e0wuhJ5mgjENRf8v-oDvo4_hA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2387522935</pqid></control><display><type>article</type><title>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</title><source>Publicly Available Content (ProQuest)</source><creator>Chen, Xingyuan ; Cai, Ping ; Peng, Jin ; Wang, Hongjun ; Dai, Xinyu ; Chen, Jiajun</creator><creatorcontrib>Chen, Xingyuan ; Cai, Ping ; Peng, Jin ; Wang, Hongjun ; Dai, Xinyu ; Chen, Jiajun</creatorcontrib><description>The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Discriminators ; Maximum likelihood estimation</subject><ispartof>arXiv.org, 2020-06</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2387522935?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Chen, Xingyuan</creatorcontrib><creatorcontrib>Cai, Ping</creatorcontrib><creatorcontrib>Peng, Jin</creatorcontrib><creatorcontrib>Wang, Hongjun</creatorcontrib><creatorcontrib>Dai, Xinyu</creatorcontrib><creatorcontrib>Chen, Jiajun</creatorcontrib><title>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</title><title>arXiv.org</title><description>The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.</description><subject>Discriminators</subject><subject>Maximum likelihood estimation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNi90KgjAYQEcQJOU7fNC1YN9a2qX9WF1n1zL0qya61Tajx8-gB-jqwOGcEQuQ80WULhEnLHSuieMYVwkKwQN2zupa6RtkkKvWk4WNdFSD0VDcCXbKVVZ1SktvLHgDp-5hzYvgoiuja-WV0bKFgt4eDqTJyq-ZsfFVto7CH6dsnu-L7TEa3mdPzpeN6e0wuhJ5mgjENRf8v-oDvo4_hA</recordid><startdate>20200622</startdate><enddate>20200622</enddate><creator>Chen, Xingyuan</creator><creator>Cai, Ping</creator><creator>Peng, Jin</creator><creator>Wang, Hongjun</creator><creator>Dai, Xinyu</creator><creator>Chen, Jiajun</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PIMPY</scope><scope>PKEHL</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200622</creationdate><title>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</title><author>Chen, Xingyuan ; Cai, Ping ; Peng, Jin ; Wang, Hongjun ; Dai, Xinyu ; Chen, Jiajun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_23875229353</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Discriminators</topic><topic>Maximum likelihood estimation</topic><toplevel>online_resources</toplevel><creatorcontrib>Chen, Xingyuan</creatorcontrib><creatorcontrib>Cai, Ping</creatorcontrib><creatorcontrib>Peng, Jin</creatorcontrib><creatorcontrib>Wang, Hongjun</creatorcontrib><creatorcontrib>Dai, Xinyu</creatorcontrib><creatorcontrib>Chen, Jiajun</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied & Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Xingyuan</au><au>Cai, Ping</au><au>Peng, Jin</au><au>Wang, Hongjun</au><au>Dai, Xinyu</au><au>Chen, Jiajun</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation</atitle><jtitle>arXiv.org</jtitle><date>2020-06-22</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator's parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2020-06 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2387522935 |
source | Publicly Available Content (ProQuest) |
subjects | Discriminators Maximum likelihood estimation |
title | Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-19T00%3A31%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Adding%20A%20Filter%20Based%20on%20The%20Discriminator%20to%20Improve%20Unconditional%20Text%20Generation&rft.jtitle=arXiv.org&rft.au=Chen,%20Xingyuan&rft.date=2020-06-22&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2387522935%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_23875229353%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2387522935&rft_id=info:pmid/&rfr_iscdi=true |