Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The models didn't exist when their training data was collected.

But... that's not really an excuse any more. Model vendors should understand now that the most natural thing in the world is for people to ask models directly about their own abilities and architecture.

I think models should have a final layer of fine-tuning or even system prompting to help them answer these kinds of questions in a useful way.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: