Meta has released V-JEPA, which they say is a step towards self-learning AGI.

Lugh · 1 year ago

Meta has released V-JEPA, which they say is a step towards self-learning AGI.

threelonmusketeers@sh.itjust.works · 1 year ago

With V-JEPA, we mask out a large portion of a video so the model is only shown a little bit of the context. We then ask the predictor to fill in the blanks of what’s missing—not in terms of the actual pixels, but rather as a more abstract description in this representation space.

Hmm, it looks like it aims to do for videos what chatbot LLMs do for text or what content-aware fill does for images. A useful tool, to be sure, but I think the link to AGI seems a bit tenuous.

Meta has released V-JEPA, which they say is a step towards self-learning AGI.

Meta has released V-JEPA, which they say is a step towards self-learning AGI.

V-JEPA: The next step toward advanced machine intelligence