Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning featured image

Dr<sup>2</sup>Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning

Large pretrained models are increasingly crucial in modern computer vision tasks. These models are typically used in downstream tasks by end-to-end finetuning, which is highly …

avatar
Chen Zhao
Ego-Exo4D: Understanding Skilled Human Activity from First-and Third-Person Perspectives featured image

Ego-Exo4D: Understanding Skilled Human Activity from First-and Third-Person Perspectives

We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric …

avatar
Chen Zhao
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames featured image

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

Recently, temporal action detection (TAD) has seen significant performance improvement with end-to-end training. However, due to the memory bottleneck, only models with limited …

Shuming liu
Trending
🎉 Easily create your own simple yet highly customizable blog featured image

🎉 Easily create your own simple yet highly customizable blog

Take full control of your personal brand and privacy by migrating away from the big tech platforms!

avatar
Chen Zhao
🧠 Sharpen your thinking with a second brain featured image

🧠 Sharpen your thinking with a second brain

Create a personal knowledge base and share your knowledge with your peers.

avatar
Chen Zhao
📈 Communicate your results effectively with the best data visualizations featured image

📈 Communicate your results effectively with the best data visualizations

Use popular tools such as HuggingFace, Plotly, Mermaid, and data frames.

avatar
Chen Zhao
👩🏼‍🏫 Teach academic courses featured image

👩🏼‍🏫 Teach academic courses

Embed videos, podcasts, code, LaTeX math, and even test students!

avatar
Chen Zhao
✅ Manage your projects featured image

✅ Manage your projects

Easily manage your projects - create ideation mind maps, Gantt charts, todo lists, and more!

avatar
Chen Zhao
Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization featured image

Re<sup>2</sup>TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

Temporal action localization (TAL) requires long-form reasoning to predict actions of various durations and complex content. Given limited GPU memory, training TAL end to end …

avatar
Chen Zhao
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model featured image

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model

Recently, conditional diffusion models have gained popularity in numerous applications due to their exceptional generation ability. However, many existing methods are …

Jiwen yu