Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
QuadmasterXLII
10 months ago
|
parent
|
context
|
favorite
| on:
“Emergent” abilities in LLMs actually develop grad...
The tokens are the structure over which the attention mechanism is permutation equivariant. This structure permeates the forward pass, its important at every layer and will be until we find something better than attention.
Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: