What’s the difference between predicting the next video frame vs predicting reality?
Currently videos look a lot like this. Until the models grasp physics, causality, can model natural human behavior etc it will look a lot like this. This understanding is a necessary condition for modeling what happens next in a given video. Put anothe…