Offtopic sort of, but does anyone know if folks are working on combining vision ...

sillysaurusx · on Aug 25, 2021

The results are quite interesting:

https://www.reddit.com/r/Art/comments/p866wv/deep_dive_meai_...

codetrotter · on Aug 25, 2021

And here is a short guide and a link to a Google Collab notebook that anyone can use to create their own AI-powered art using VQGAN+CLIP: https://sourceful.us/doc/935/introduction-to-vqganclip

abalaji · on Aug 25, 2021

yeah there has definitely been work done in that space: it’s called multi-modal models

not sure if this is the latest work but here’s some results from Google’s AI Blog

https://ai.googleblog.com/2017/06/multimodel-multi-task-mach...

im3w1l · on Aug 25, 2021

What would be really cool is neural networks with routing. Like circuit switching or packet switching. No idea how you would train such a beast though.

Like imagine the vision part making a phonecall to the natural language part to ask it for help with something.

TylerLives · on Aug 25, 2021

Sounds like The Society of Mind - https://en.m.wikipedia.org/wiki/Society_of_Mind

wittenator · on Aug 25, 2021

Capsule networks have a routing algorithm as far as I know