Research Question

Introduction

Define Meta-learning for LMs:

image.png

Define zero-shot, one-shot, few-shot:

image.png

Overall results:

A heuristic sense of the overall results can be seen in Figure 1.3, which aggregates the various tasks (though it should not be seen as a rigorous or meaningful benchmark in itself).

image.png

Approach

Our basic pre-training approach, including model, data, and training, is similar to the process described in GPT-2, with relatively straightforward scaling up of the model size, dataset size and diversity, and length of training.

Model and Architectures

image.png