Ask HN: unstructured data and unsupervised learning

p1esk · on July 10, 2020

If you like the way NLP progresses with all the recent large models, the next natural step is to apply the same idea to video - predict the next frame. Obviously, it's not the same, as we don't have a "vocabulary" of frames, so some novel approach is needed. I haven't actually looked into video prediction/generation literature, so I don't know what's happening in that field.

jonathanbesomi · on July 10, 2020

Interesting; thank you! Related to videos: recently Open AI released GPT-3, a transformer models trained this time on images ...

p1esk · on July 10, 2020

I just did a quick search, here are a few relevant papers: https://arxiv.org/abs/2006.10704 https://arxiv.org/abs/1903.00271 http://www.cs.columbia.edu/~vondrick/transformer.pdf