Projects | Mohamed Abdelfattah | AI Research Scientist

Selected Projects

I enjoy making things. Here are a selection of projects that I have worked on over the years.

Egocentric Vision

OSMO: Open-vocabulary Self-eMOtion Tracking

OSIRIS is an egocentric multimodal LLM for continuous, open-vocabulary human-state tracking from smart glasses.

Project Page Paper Poster Site

Multimodal Learning

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation

OSKAR is a self-supervised multimodal foundation model that learns in the latent space by predicting masked multimodal features.

Project Page Code Paper Poster Site

Action Recognition

MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning

MaskCLR improves the robustness of transformer-based action recognition methods against noisy and incomplete skeletons.

Project Page Paper Poster Site

Self-Supervised Learning

S-JEPA: Joint Embedding Predictive Architecture for Self-Supervised Skeletal Action Recognition

S-JEPA is an instantiation of JEPA for self-supervised skeletal action recognition.

Project Page Paper Poster Site

Datasets

ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes

We take a step towards computer-aided waste detection and present the first in-the-wild industrial-grade waste detection and segmentation dataset, ZeroWaste.

Project Page Code Dataset Paper

See all

No results found

OSMO: Open-vocabulary Self-eMOtion Tracking

OSKAR: Omnimodal Self-supervised Knowledge Abstraction and Representation

MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning

S-JEPA: Joint Embedding Predictive Architecture for Self-Supervised Skeletal Action Recognition

ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes