Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
posharma
on April 15, 2023
|
parent
|
context
|
favorite
| on:
What are transformer models and how do they work?
I found the blogs written by Jay Alammar to be much more informative and complete. It appears that companies are rehashing and compressing the same content to advertise their products.
quantisan
on April 15, 2023
|
next
[–]
https://jalammar.github.io/visualizing-neural-machine-transl...
and
https://jalammar.github.io/illustrated-transformer/
for anyone looking
macromackie
on April 15, 2023
|
prev
[–]
I believe Jay actually works at Cohere now (although I’m a little surprised that the post doesn’t state that).
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: