Why would the model change over time when asking the same question? Just it's generation dataset for generating similar images? Or is this just tracking GPT's explicit model improvements over time?
The model being used is `gpt-4` but each result records the model that was used to generate the output. With incremental updates, the model will change automatically. This is to support incremental updates without needing anything from me. :)
The models seem to have been changing in the background, though as another commenter pointed out.... having a variance-calibrayion baseline for humans would be great too. :'))))