OSMO: Open-vocabulary Self-eMOtion Tracking
OSMO introduces egocentric self-emotion tracking with a large-scale dataset, a multi-task benchmark, and OSIRIS, a multimodal model for coherent emotion timelines.
mohamed-abdelfattah
OSMO introduces egocentric self-emotion tracking with a large-scale dataset, a multi-task benchmark, and OSIRIS, a multimodal model for coherent emotion timelines.
OSKAR is a self-supervised multimodal foundation model that learns in the latent space by predicting masked multimodal features.