Post-training with Data Augmentation for Improving T5-Based Question Generator

Gyu Min Park, Seong Eun Hong, Choong Seon Hong, Seong Bae Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Post-training is known to be effective for boosting the performance of a pre-trained language model. However, in the task of question generation, question generators post-trained with a well-designed training objective show poor performance without sufficient training examples. To handle this problem, this paper proposes a novel post-training for question generation which adopts a data augmentation technique to increase the number of training examples as well as post-training objectives. As post-training objectives, this paper introduces a new training objective, wh-words deletion, in addition to the well-known question infilling. Moreover, this paper employs back-translation techniques to increase the number of instances for post-training. To prove the effectiveness of the proposed method, this paper applies the post-training strategies to T5, a large-scale pre-trained language model, on SQuAD-QG. The experimental results demonstrate that the proposed post-training is helpful for enhancing the performance of answer-aware question generation.

Original languageEnglish
Title of host publicationAdvances in Computer Science and Ubiquitous Computing - Proceedings of CUTE-CSA 2022
EditorsJi Su Park, Laurence T. Yang, Yi Pan, Yi Pan, Jong Hyuk Park
PublisherSpringer Science and Business Media Deutschland GmbH
Pages703-709
Number of pages7
ISBN (Print)9789819912513
DOIs
Publication statusPublished - 2023
Event14th International Conference on Computer Science and its Applications, CSA 2022 and the 16th KIPS International Conference on Ubiquitous Information Technologies and Applications, CUTE 2022 - Vientiane, Lao People's Democratic Republic
Duration: 19 Dec 202221 Dec 2022

Publication series

NameLecture Notes in Electrical Engineering
Volume1028 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference14th International Conference on Computer Science and its Applications, CSA 2022 and the 16th KIPS International Conference on Ubiquitous Information Technologies and Applications, CUTE 2022
Country/TerritoryLao People's Democratic Republic
CityVientiane
Period19/12/2221/12/22

Bibliographical note

Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

Keywords

  • Deep learning
  • Natural language processing
  • Post-training
  • Question generation

Fingerprint

Dive into the research topics of 'Post-training with Data Augmentation for Improving T5-Based Question Generator'. Together they form a unique fingerprint.

Cite this