Discussion about this post

User's avatar
Avik De's avatar

Nice article Ani, and thanks in particular for the careful evaluation of the strengths and weaknesses of JEPA. In terms of world models, I think that there is much there other than either of these that are also potentially waiting in the wings. Video models can’t see forces or occluded objects, and we have representational concepts like geometry and dynamics that could also help. I’m interested to see the future of this topic over the next year - it’s certainly been heating up!

No posts

Ready for more?