Hacker News new | past | comments | ask | show | jobs | submit login
Faster Autoregressive Transformers with Linear Attention (arxiv.org)
6 points by fofoz 35 days ago | hide | past | favorite



Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact

Search: