Loading…

Automatic Detection of Chinese Generated Essayss Based on Pre-trained BERT

The text generator based on the pre-trained language model has powerful generation ability., and essays generator based on pre-trained language model produces essays of high quality. Essays generators may be misused., for example by making some changes on the basis of the generated document., a essa...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen, Xingyuan, Jin, Peng, Jing, Siyuan, Xie, Chunming
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The text generator based on the pre-trained language model has powerful generation ability., and essays generator based on pre-trained language model produces essays of high quality. Essays generators may be misused., for example by making some changes on the basis of the generated document., a essays of acceptable quality can be obtained. Corresponding efficient detection methods need to be developed. First., we built a essays text generator based on GPT-2 with training data., and then developed a generative essays detector with the pretrained language model BERT using the generated data and real data. Experiments show that the detector is 88% accurate for random sampling and 92% accurate for top-k sampling. Therefore., the detector based on the pretrained language model performs well and is an effective essays generation detector.
ISSN:2693-2865
DOI:10.1109/ITAIC54216.2022.9836571