> Hard to generalize from before/after performance. While this is true, there ar...

		NitpickLawyer 7 days ago \| parent \| context \| favorite \| on: Canaries in the Coal Mine? Recent Employment Effec... > Hard to generalize from before/after performance. While this is true, there are ways to test (open models) on tasks created after the model was released. We see good numbers there as well, so something is generalising there.