Skip to main content
Fig. 2 | Visual Computing for Industry, Biomedicine, and Art

Fig. 2

From: Fused behavior recognition model based on attention mechanism

Fig. 2

A architecture of ResNet34-3DRes18 network. The N frames images are obtained by the sparse sampling strategy. Then these images are processed by ResNet34 network to get their Feature map. The Feature map are stacked to obtain a temporal feature map, named Temporal FM. The Temporal FM is processed by 3DRes18 network to get the final action recognition result

Back to article page