Qi Guo, Shujun Zhang*, Hui Li
CMES-Computer Modeling in Engineering & Sciences, Vol.134, No.3, pp. 1653-1670, 2023, DOI:10.32604/cmes.2022.021784
- 20 September 2022
Abstract Continuous sign language recognition (CSLR) is challenging due to the complexity of video background, hand gesture variability, and temporal modeling difficulties. This work proposes a CSLR method based on a spatial-temporal graph attention network to focus on essential features of video series. The method considers local details of sign language movements by taking the information on joints and bones as inputs and constructing a spatial-temporal graph to reflect inter-frame relevance and physical connections between nodes. The graph-based multi-head attention mechanism is utilized with adjacent matrix calculation for better local-feature exploration, and short-term motion correlation modeling More >
Graphic Abstract