Mentor: Shivangi Singh, Tanvi Nerkar
Project Members: Dev Barbhaya, Ankit Yadav, Lochan Gupta, Utkarsh Agrawal, Vinamra Shrivastava
Abstract:
Using UCF101 dataset, we implement high-quality action classification and video captioning within a video, where each video can consist of a few hundred frames. We will look at previous approaches and implement a convolutional network for online video understanding. The network architecture takes long-term content into account and enables fast per-video processing at the same time.
Documentation : Link
Poster: