Graph Neural Network

End-to-End Active Speaker Detection featured image

End-to-End Active Speaker Detection

Recent advances in the Active Speaker Detection (ASD) problem build upon a two-stage process -- feature extraction and spatio-temporal context aggregation. In this paper, we …

Juan leon alcazar
SegTAD: Precise Temporal Action Detection via Semantic Segmentation featured image

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most existing works draw inspiration from image object detection and tend to reformulate it …

avatar
Chen Zhao