Hacker News new | past | comments | ask | show | jobs | submit login

Stack more GPT-3s! Have GPT-ception via stacks of multi-headed GPT blocks. I'm sure softmax attention can be modeled as a few-shot text generation problem.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: