Thanks, but no. It's going to be robotic arm with gripper and camera. The rest can be either Raspberry Zero (for cam, control, with net or blutooth, something big for high level). Another option, not exclusive, NVidia Jetson Nano instead of Zero. It could be Raspberry Pi 3, but I don't want to do video processing on it. All this I have, just need to put them together. Adding AGX Orin will be a big thing. That's actually the goal. With video processing and LLM all in one mobile robot. As it's hobby R/D it will be configured and reconfigured many times. That's why I don't want to do low level by hand every time.
Another interesting option is Raspberry Pico * N + Tiny PC. For control and thinking. They can be connected via wifi or blutooth.
3D reconstruction, from stereo or mono camera, have both. Object detection, text reading. Ideally it should recognize the speaker, simple gestures. Take audio, feed speech to LLM, get the output. Being able to detect and move out of the way of walking humans. Most of it has been done, like 3d structure and localization from motion. There are reference implementations. I've done it before too.
It's sort of open ended project. Having LLM with vision on mobile robot with arm.. has a lot of applications. AGX Orin 64GB is capable of running serious models.
Another interesting option is Raspberry Pico * N + Tiny PC. For control and thinking. They can be connected via wifi or blutooth.