Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation

Ruizhe Li; Xiao Li; Guanyi Chen; Chenghua Lin

Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation

Ruizhe Li, Xiao Li, Guanyi Chen, Chenghua Lin

Research output: Contribution to conference › Oral Presentation/ Invited Talk

6 Downloads (Pure)

Abstract

The Variational Autoencoder (VAE) is a popular and powerful model applied to text modelling to generate diverse sentences. However, an issue known as posterior collapse (or KL loss vanishing) happens when the VAE is used in text modelling, where the approximate posterior collapses to the prior, and the model will totally ignore the latent variables and be degraded to a plain language model during text generation. Such an issue is particularly prevalent when RNN-based VAE models are employed for text modelling. In this paper, we propose a simple, generic architecture called Timestep-Wise Regularisation VAE (TWR-VAE), which can effectively avoid posterior collapse and can be applied to any RNN-based VAE models. The effectiveness and versatility of our model are demonstrated in different tasks, including language modelling and dialogue response generation.

Original language	English
Publication status	Published - 2 Nov 2020

Bibliographical note

Accepted by COLING 2020, final camera ready version

Keywords

cs.CL
cs.LG

Access to Document

2011.01136v2
This work is licensed under a Creative Commons Attribution 4.0 International Licence. Licence details: http://creativecommons. org/licenses/by/4.0/
Submitted manuscript, 475 KBLicence: CC BY

Cite this

@conference{326e3e01fe464f749fef4ebcceab1388,

title = "Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation",

abstract = "The Variational Autoencoder (VAE) is a popular and powerful model applied to text modelling to generate diverse sentences. However, an issue known as posterior collapse (or KL loss vanishing) happens when the VAE is used in text modelling, where the approximate posterior collapses to the prior, and the model will totally ignore the latent variables and be degraded to a plain language model during text generation. Such an issue is particularly prevalent when RNN-based VAE models are employed for text modelling. In this paper, we propose a simple, generic architecture called Timestep-Wise Regularisation VAE (TWR-VAE), which can effectively avoid posterior collapse and can be applied to any RNN-based VAE models. The effectiveness and versatility of our model are demonstrated in different tasks, including language modelling and dialogue response generation. ",

keywords = "cs.CL, cs.LG",

author = "Ruizhe Li and Xiao Li and Guanyi Chen and Chenghua Lin",

note = "Accepted by COLING 2020, final camera ready version",

year = "2020",

month = nov,

day = "2",

language = "English",

}

TY - CONF

T1 - Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation

AU - Li, Ruizhe

AU - Li, Xiao

AU - Chen, Guanyi

AU - Lin, Chenghua

N1 - Accepted by COLING 2020, final camera ready version

PY - 2020/11/2

Y1 - 2020/11/2

N2 - The Variational Autoencoder (VAE) is a popular and powerful model applied to text modelling to generate diverse sentences. However, an issue known as posterior collapse (or KL loss vanishing) happens when the VAE is used in text modelling, where the approximate posterior collapses to the prior, and the model will totally ignore the latent variables and be degraded to a plain language model during text generation. Such an issue is particularly prevalent when RNN-based VAE models are employed for text modelling. In this paper, we propose a simple, generic architecture called Timestep-Wise Regularisation VAE (TWR-VAE), which can effectively avoid posterior collapse and can be applied to any RNN-based VAE models. The effectiveness and versatility of our model are demonstrated in different tasks, including language modelling and dialogue response generation.

AB - The Variational Autoencoder (VAE) is a popular and powerful model applied to text modelling to generate diverse sentences. However, an issue known as posterior collapse (or KL loss vanishing) happens when the VAE is used in text modelling, where the approximate posterior collapses to the prior, and the model will totally ignore the latent variables and be degraded to a plain language model during text generation. Such an issue is particularly prevalent when RNN-based VAE models are employed for text modelling. In this paper, we propose a simple, generic architecture called Timestep-Wise Regularisation VAE (TWR-VAE), which can effectively avoid posterior collapse and can be applied to any RNN-based VAE models. The effectiveness and versatility of our model are demonstrated in different tasks, including language modelling and dialogue response generation.

KW - cs.CL

KW - cs.LG

M3 - Oral Presentation/ Invited Talk

ER -