OSMO: Open-vocabulary Self-eMOtion Tracking
OSIRIS is an egocentric multimodal LLM for continuous, open-vocabulary human-state tracking from smart glasses.
mohamed-abdelfattah
OSIRIS is an egocentric multimodal LLM for continuous, open-vocabulary human-state tracking from smart glasses.
OSKAR is a self-supervised multimodal foundation model that learns in the latent space by predicting masked multimodal features.