LLMs

OSMO: Open-vocabulary Self-eMOtion Tracking featured image

Egocentric Vision

OSMO: Open-vocabulary Self-eMOtion Tracking

OSIRIS is an egocentric multimodal LLM for continuous, open-vocabulary human-state tracking from smart glasses.

Project Page Paper Poster Site

mohamed-abdelfattah

• Dec 8, 2025 • 1 min read

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation featured image

Multimodal Learning

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation

OSKAR is a self-supervised multimodal foundation model that learns in the latent space by predicting masked multimodal features.

Project Page Code Paper Poster Site

mohamed-abdelfattah

• Dec 1, 2025 • 1 min read