
Augmenting Multimodal Deep Learning with Attention Mechanisms to Recognize “Sludge” Videos From Short-Form Content
This study introduces a multimodal architecture that utilizes video embeddings and image transcripts to recognize "sludge" content from short-form videos.
Group Members
Mentor
Topics
AVP
Gallery






