Mattia soldan

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection

Temporal action detection (TAD) is a fundamental video understanding task that aims to identify human actions and localize their temporal boundaries in videos. Although this field …

Shuming liu

• Mar 2, 2025 • 1 min read

Deep Learning

Towards Automated Movie Trailer Generation

Movie trailers are an essential tool for promoting films and attracting audiences. However the process of creating trailers can be time-consuming and expensive. To streamline this …

Dawit mureja argaw

• Jun 4, 2024 • 1 min read

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

The recent and increasing interest in video-language research has driven the development of large-scale datasets that enable data-intensive machine learning techniques. In …

Mattia soldan

• Mar 2, 2022 • 1 min read

No results found

Mattia soldan

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection

Towards Automated Movie Trailer Generation

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions