Convolutional Network for Online Video Understanding

May 16, 2021

Mentor: Shivangi Singh, Tanvi Nerkar
Project Members: Dev Barbhaya, Ankit Yadav, Lochan Gupta, Utkarsh Agrawal, Vinamra Shrivastava


Using UCF101 dataset, we implement high-quality action classification and video captioning within a video, where each video can consist of a few hundred frames. We will look at previous approaches and implement a convolutional network for online video understanding. The network architecture takes long-term content into account and enables fast per-video processing at the same time.

Documentation : Link