Dark Mode

Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

dkumar-22/Activity-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

7 Commits

Repository files navigation

Human-Activity-Recognition

A LSTM + CNN or LRCN model which recognises the action performed in a video. It is trained on the UFC-50 dataset.

Link To the UFC-50 Dataset: https://www.crcv.ucf.edu/data/UCF50.php
Google Colaboratory Link: https://colab.research.google.com/drive/1mftAh_YV_GhO0L8iA0M_G83agbbcUsKq?usp=sharing

Implementing the LRCN Approach

An approach known as Long-term Recurrent Convolutional Network (LRCN) was implemented, which combines CNN and LSTM layers in a single model. The Convolutional layers are used for spatial feature extraction from the frames, and the extracted spatial features are fed to LSTM layer(s) at each time-steps for temporal sequence modeling. This way the network learns spatiotemporal features directly in an end-to-end training, resulting in a robust model.

TimeDistributed wrapper layer is also used, which allows applying the same layer to every frame of the video independently. So it makes a layer (around which it is wrapped) capable of taking input of shape (no_of_frames, width, height, num_of_channels) if originally the layer's input shape was (width, height, num_of_channels) which is very beneficial as it allows to input the whole video into the model in a single shot.

About

A LSTM + CNN or LRCN model which recognises the action performed in a video.

Topics

Resources

Readme

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors