GeDi: A Powerful New Method for Controlling Language Models

We use smaller language models as generative classifiers to guide generation from larger language models. We show that this method can make generations friendlier, reduce bias and toxicity, and achieve zero-shot controllable generation of unseen topics.

22 Sep 2020 •