Instruction fine tuning definitely increases the win rate in human judged compar...

phreeza · 2023-07-20T04:27:34

I completely agree that humans are not always right, but neither is the next token in a text sequence always "right",i.e. the thing that a raw LM is best at.

I just object to the blanket statement that fine tuning will make a model dumber. It will mostly mean the model is not as good at the original training task, but that really doesn't mean it is "dumber" by any definition.

The question then is what fine tuning does to tasks that are neither the original training task or the fine tuning task. This will depend on the fine tuning task. It seems that instruction fine tuning improves performance on tasks that involve human interaction, so I have a hard time seeing it as the model becoming dumber. Other fine tuning tasks, such as removing toxicity, may have a higher cost on unrelated tasks, so there one could say they caused the model to become dumber.