Vision Transformer-based Model for Human Action Recognition in Still Images

Authors

  • Divya Rani R Department of Computer Science, Kuvempu University Shivamogga, Karnataka, India
  • Prabhakar C J Department of Computer Science, Kuvempu University, Shivamogga, Karnataka, India

Keywords:

Human action recognition, HAR, Vision transformer, ViT, Transformer encoder

Abstract

Human action recognition is a critical task in computer vision, enabling systems to understand and interpret human actions from images.Action recognition in still images presents a unique challenge,as traditional methods often rely on temporal information that is absent in static images. This study explores the advantage of Vision Transformers (ViTs) for recognizing human actions in still images,exploiting their ability to capture complex patterns and relationships in visual data.We propose a robust method for recognizing human actions in still images, that employs spatial attention mechanisms to effectively highlight relevant features associated with various human poses and contexts and effectively addresses the challenges such as occlusion and varying poses by employing self-attention mechanism that focus on key pose and contextual cues. We conduct extensive experiments using benchmark datasets: Stanford40 and PASCAL VOC 2012 Action, the proposed the model achieved an impressive accuracy of 97.4% for Stanford40 and 94.8% for PASCAL VOC 2012 Action dataset. Experimental results demonstrates that the proposed method achieves SOTA performance on both still image datasets. The high accuracy suggests that the ViT model can generalize well across different action categories, even when the dataset includes variations in human posture, background, and scene complexity.

Downloads

Published

2024-09-12

How to Cite

Divya Rani R, & Prabhakar C J. (2024). Vision Transformer-based Model for Human Action Recognition in Still Images. Journal of Computational Analysis and Applications (JoCAAA), 33(08), 522–531. Retrieved from http://eudoxuspress.com/index.php/pub/article/view/1366

Issue

Section

Articles

Similar Articles

1 2 3 4 5 6 7 8 9 > >> 

You may also start an advanced similarity search for this article.