Hacker News new | past | comments | ask | show | jobs | submit login

In one word: locality. Images have it narratives do not.



In another one word: language. Music has some elements of language that images don't have.


Interestingly though the failures of image generation are often of the missing narrative kind: eg. windowless bedrooms, eyes/ears at wrong places on animals. It feels like that deep learning is good at local but fails at global coherence.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: