Where I think it's gonna start to get *really* scary, and *much* more closely ap...

hahajk · on May 26, 2023

Isn't GPT-4 multimodal? I remember it designing a website based off of a sketch during the initial demo.

rst · on May 27, 2023

Theres a multimodal variant, but it's not widely available yet.

ShamelessC · on May 27, 2023

It is.

fnordpiglet · on May 27, 2023

This is happening. These projects are examples of feedback loops. While the models aren’t playing directly they are receiving feedback and iteratively improving. Constraining and optimizing using classical AI is an obvious next step. I agree this is when the magic happens.

MagicMoonlight · on May 27, 2023

A LLM is a form of input like a keyboard. If they put it in front of something like siri and used it as input instead of a processor then you could make siri actually functional.

It’s very good at understanding text but by itself it can’t think. Turning text into commands it knows is doable.