OSMO: Open-vocabulary Self-eMOtion Tracking
OSIRIS is an egocentric multimodal LLM for continuous, open-vocabulary human-state tracking from smart glasses.
mohamed-abdelfattah
OSIRIS is an egocentric multimodal LLM for continuous, open-vocabulary human-state tracking from smart glasses.
We take a step towards computer-aided waste detection and present the first in-the-wild industrial-grade waste detection and segmentation dataset, ZeroWaste.
This paper introduces ArtELingo, a new benchmark and dataset, designed to encourage work on diversity across languages and cultures.
A computer-vision pipeline using conventional 2D sleep-lab cameras to automatically detect iRBD from REM movement dynamics with up to 91.9% accuracy.