1

R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning

Class-Incremental Learning(CIL) struggles with catastrophic forgetting when learning new knowledge, and Data-Free CIL (DFCIL) is even more challenging without access to the …

Qiankun gao

• Jul 1, 2022 • 1 min read

Deep Learning

End-to-End Active Speaker Detection

Recent advances in the Active Speaker Detection (ASD) problem build upon a two-stage process -- feature extraction and spatio-temporal context aggregation. In this paper, we …

Juan leon alcazar

• Jul 1, 2022 • 1 min read

Deep Learning

When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search

The key challenge in neural architecture search (NAS) is designing how to explore wisely in the huge search space. We propose a new NAS method called TNAS (NAS with trees), which …

Guocheng qian

• Jun 1, 2022 • 1 min read

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

The recent and increasing interest in video-language research has driven the development of large-scale datasets that enable data-intensive machine learning techniques. In …

Mattia soldan

• Mar 2, 2022 • 1 min read

Ego4D: Around the World in 3,000 Hours of Egocentric Video

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (house-hold, …

Chen Zhao

• Feb 2, 2022 • 1 min read

Deep Learning

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most existing works draw inspiration from image object detection and tend to reformulate it …

Chen Zhao

• Jan 1, 2022 • 1 min read

Video Self‑Stitching Graph Network for Temporal Action Localization

Short actions are critical and challenging in the task of action localization. We target this problem and propose a video self-stitching graph network (VSGN), which enhances …

Chen Zhao

• Mar 30, 2021 • 1 min read

Deep Learning

ThumbNet: One Thumbnail Image Contains All You Need for Recognition

Tackle the problem of network compression and acceleration in a novel perspective: enabling inference on thumbnail images without compromising accuracy. Propose supervised image …

Chen Zhao

• Jul 29, 2020 • 1 min read

G‑TAD: Sub‑Graph Localization for Temporal Action Detection

Temporal action detection is a fundamental yet challenging task in video understanding. Video context is a critical cue to effectively detect actions, but current works mainly …

Mengmeng xu

• Feb 27, 2020 • 1 min read

Chen Zhao

• Aug 1, 2018 • 1 min read

No results found

1

R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning

End-to-End Active Speaker Detection

When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Ego4D: Around the World in 3,000 Hours of Egocentric Video

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Video Self‑Stitching Graph Network for Temporal Action Localization

ThumbNet: One Thumbnail Image Contains All You Need for Recognition

G‑TAD: Sub‑Graph Localization for Temporal Action Detection

BoostNet: A Structured Deep Recursive Network to Boost Image Deblocking