Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There's one shot that stood out to me, right at the end of the main video, where the robot puts a round belt on a pulley: https://youtu.be/4MvGnmmP3c0?si=f9dOIbgq58EUz-PW&t=163 . Of course there are probably many examples of this exact action in its training data, but it felt very intuitive in a way the shirt-folding and object-sorting tasks in these demos usually don't.

(Also there seems to be some kind of video auto-play/pause/scroll thing going on with the page? Whatever it is, it's broken.)




It felt extra fake - the cherry picked people lacking rudimentary mechanical skills, using the ~$50K set of Franka Emika arms vs their default 'budget' ALOHA 2 grippers, the sheer luck that helped the robots put the belt on instead of removing it from the pulley.

The trick was in that the belt was too tight for an average human to put on with brute force, and disabling the tensioner or using tricks would require better than average mechanical skills their specially chosen 'random humans' lacked.


Yeah, they went WAY over the top when they told the human to "make it look hard." A significant distraction from how impressive the robot actually is.


All while the robot video was at 3x speed to even keep up with the human


I slowed it down to 1/4 speed to check -- the autonomous video is sped up 3x, but the human video seems to be 1x. I say that because generally no one moves that slowly for a physical task, not just in the "problem solving" aspect, but also in the "getting a belt to the gears" aspect. So, it appears that the robot did a better job than the human, but I believe the human only spent 1/3 of the time in the clip. After stretching the belt, it was probably put on easily, and likely the human still completed the task in 2/3 of the time of the robot.

Reference video (saw your clip is robot-only, but the robot vs human video is more telling):

https://youtu.be/x-exzZ-CIUw?feature=shared&t=65


Earlier in the video, where it was going to fold a "fox", I was expecting a fox, but a fox face. I know I should have high expectations at this point, but was disappointed from the result given the prompt.


That stood out for me as well. But only because the humans seemed to be inept.


Oh no they trained too much on all the shopping channel videos, i knew that would be our downfall someday




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: