IJFANS International Journal of Food and Nutritional Sciences

ISSN PRINT 2319-1775 Online 2320-7876

Fusion of RGB and Skeletal Data Using Gated Features for Human Action Recognition

Main Article Content

Ch.Raghava Prasad
» doi: 10.48047/IJFANS/11/S6/001

Abstract

This paper offers a blended network of RGB, depth, and skeleton inputs fed into CNNs in both directions. In order to learn the combined temporal features of the action, CNNs are used to characterize the RGB and depth data, while LSTMs are used to encode the skeletal data in both directions. At last, the L2 distance metric is used to choose the probability distribution generated from the three inputs. Coupling the model with a mixed CNN BILSTM network and computing an L2 distance measure in place of score fusion improved performance to 94.73%. Finally, the proposed models were compared to both cutting-edge deep learning methods and classic machine learning models.

Article Details