Multimodal Learning

OSMO: Open-vocabulary Self-eMOtion Tracking

OSMO introduces egocentric self-emotion tracking with a large-scale dataset, a multi-task benchmark, and OSIRIS, a multimodal model for coherent emotion timelines.

mohamed-abdelfattah

• Dec 8, 2025 • 1 min read

Multimodal Learning

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation

OSKAR is a self-supervised multimodal foundation model that learns in the latent space by predicting masked multimodal features.

mohamed-abdelfattah

• Dec 1, 2025 • 1 min read

No results found

Multimodal Learning

OSMO: Open-vocabulary Self-eMOtion Tracking

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation