Chen Zhao
Chen Zhao
Home
News
Publications
Awards
Contact
Light
Dark
Automatic
Deep learning
Towards Automated Movie Trailer Generation
Movie trailers are an essential tool for promoting films and attracting audiences. However the process of creating trailers can be …
Dawit Mureja Argaw
,
Mattia Soldan
,
Alejandro Pardo
,
Chen Zhao
,
Fabian Caba Heilbron
,
Joon Son Chung
,
Bernard Ghanem
PDF
Cite
Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning
Large pretrained models are increasingly crucial in modern computer vision tasks. These models are typically used in downstream tasks …
Chen Zhao
,
Shuming Liu
,
Karttikeya Mangalam
,
Guocheng Qian
,
Fatimah Zohra
,
Abdulmohsen Alghannam
,
Jitendra Malik
,
Bernard Ghanem
PDF
Cite
Code
Video
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Recently, temporal action detection (TAD) has seen significant performance improvement with end-to-end training. However, due to the …
Shuming Liu
,
Chen-Lin Zhang
,
Chen Zhao
,
Bernard Ghanem
PDF
Cite
Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Temporal action localization (TAL) requires long-form reasoning to predict actions of various durations and complex content. Given …
Chen Zhao
,
Shuming Liu
,
Karttikeya Mangalam
,
Bernard Ghanem
PDF
Cite
Code
Video
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries
With the recent advances in video and 3D understanding, novel 4D spatio-temporal methods fusing both concepts have emerged. Towards …
Jinjie Mai
,
Abdullah Hamdi
,
Silvio Giancola
,
Chen Zhao
,
Bernard Ghanem
PDF
Cite
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
Recently, conditional diffusion models have gained popularity in numerous applications due to their exceptional generation ability. …
Jiwen Yu
,
Yinhuai Wang
,
Chen Zhao
,
Bernard Ghanem
,
Jian Zhang
PDF
Cite
A Unified Continual Learning Framework with General Parameter-Efficient Tuning
The ‘pre-training → downstream adaptation’ presents both new opportunities and challenges for Continual Learning (CL). …
Qiankun Gao
,
Chen Zhao
,
Yifan Sun
,
Teng Xi
,
Gang Zhang
,
Bernard Ghanem
,
Jian Zhang
PDF
Cite
Large-capacity and Flexible Video Steganography via Invertible Neural Network
Video steganography is the art of unobtrusively concealing secret data in a cover video and then recovering the secret data through a …
Chong Mou
,
Youmin Xu
,
Jiechong Song
,
Chen Zhao
,
Bernard Ghanem
,
Jian Zhang
PDF
Cite
Code
ETAD: Training Action Detection End to End on a Laptop
Untrimmed video understanding such as temporal action detection (TAD) often suffers from the pain of huge demand for computing …
Shuming Liu
,
Mengmeng Xu
,
Chen Zhao
,
Xu Zhao
,
Bernard Ghanem
PDF
Cite
Just a Glimpse: Rethinking Temporal Information for Video Continual Learning
Class-incremental learning is one of the most important settings for the study of Continual Learning, as it closely resembles …
Lama Alssum
,
Juan Leo ́n Alca ́zar
,
Merey Ramazanova
,
Chen Zhao
,
Bernard Ghanem
Cite
»
Cite
×