Augmenting Multimodal Deep Learning with Attention Mechanisms to Recognize “Sludge” Videos From Short-Form Content

This study introduces a multimodal architecture that utilizes video embeddings and image transcripts to recognize "sludge" content from short-form videos.

Group Members

Marc M. Olata

Alpha Romer N. Coma

Kristoffer Ian T. Sioson

Job Isaac M. Ong

Mentor

Dr. Beau Gray M. Habal

Topics

Computer Vision

AVP

Gallery