Dr. Srijan Das presented "AAN: Attributes-Aware Network for Temporal Action Detection" at BMVC 2023 in Aberdeen
This paper explains how to utilize large-scale pre-trained vision language models (CLIP) for long-term action detection in videos.
Link to the paper: https://arxiv.org/abs/2309.00696