Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Imagen seems better at capturing details/nuance from the prompt, but subjectively the DALLE-2 images feel more “real” to me. Not sure why. Something about the lighting?


That feels about right. Imagen has a better text processing model, so it can tease apart the prompt, but DALLE has a rocking image part.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: