World Modelling

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation featured image

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation

OSKAR is a self-supervised multimodal foundation model that learns in the latent space by predicting masked multimodal features.

mohamed-abdelfattah