tldr: using large expensive models to auto-label data to train small cheap model...

unbanned · on Jan 2, 2022

>to make it harder to compare to the considerable number of other papers

Naturally. There's a reason AI papers are not published in respected journals a significant proportion of the time.

cuteboy19 · on Jan 2, 2022

Isn't this just "transfer learning"? Surely there has to be a better way than "momma bird pukes into baby bird's mouth" type of training

gwern · on Jan 4, 2022

No. Transfer usually means using the same NN model (eg. GPT-3 checkpoints being retrained on Github and then called 'Codex'), or possibly some sort of distillation/sparsifying approach. This is about auto-generating training data, maybe not even meant to be used by a neural net at all.