論文名稱(外文):Variable-Length Abstractive Summarization using Two-stage Transformer-based Method
指導教授(外文):Chung-Hsien Wu
外文關鍵詞:automatic summarization systemabstractive summarizationextractive summarizationtext segmentationvariable-length summarizationTransformerBERTLSTM
Due to the rapid growth of information available, how to efficiently process and utilize these text-based resources has become an increasingly crucial challenge to address. Such a problem can be solved with an automatic summarization system. Most summarization systems are divided into two types: extractive methods and abstractive methods. Extractive methods form the summary by extracting segments of text from the document. Abstractive methods process the document and then generate a text summary. The former can allow the user to specify the length of the summary, while the latter is able to produce a more fluent and human-like summary.
The main contribution of this thesis is to propose a two-stage method for training the variable-length abstractive summarization model. This is an improvement over previous models that cannot simultaneously achieve fluency and variable length for the summarization results. The variable-length abstractive summarization model is divided into a text segmentation module and three generation modules. The proposed text segmentation module, which utilizes BERT and Bidirectional LSTM, shows improved performance over existing methods. The generation modules combine extractive and abstractive methods to produce near state-of-the-art headline summaries.
A new large-scale Chinese text segmentation dataset called ChWiki_181k is introduced. A BERT-based text segmentation model is proposed to be the baseline model on ChWiki_181k. LCSTS is adopted to train summarization models, and a variable-length abstractive summarization system is trained with a two-stage method. The proposed variable-length abstractive summarization system achieved a maximum of 70% accuracy in human subjective evaluation, and the experimental result has shown the proposed model could generate proper variable-length summaries.
