• English
    • Tiếng Việt
  • English 
    • English
    • Tiếng Việt
  • Login
View Item 
  •   VinSpace Home
  • The College of Engineering and Computer Science
  • Minh Do, PhD.
  • View Item
  •   VinSpace Home
  • The College of Engineering and Computer Science
  • Minh Do, PhD.
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Efficient human vision inspired action recognition using adaptive spatiotemporal sampling

Thumbnail
View/Open
Minh Do.pdf (3.715Mb)
Date
2022-07-14
Author
Mac, C. Khoi Nguyen
Do, N. Minh
Vo, P. Minh
Metadata
Show full item record
Abstract
Adaptive sampling that exploits the spatiotemporal redundancy in videos is critical for always-on action recognition on wearable devices with limited computing and battery resources. The commonly used fixed sampling strategy is not context-aware and may under-sample the visual content, and thus adversely impacts both computation efficiency and accuracy. Inspired by the concepts of foveal vision and pre-attentive processing from the human visual perception mechanism, we introduce a novel adaptive spatiotemporal sampling scheme for efficient action recognition. Our system pre-scans the global scene context at low-resolution and decides to skip or request high-resolution features at salient regions for further processing. We validate the system on EPIC-KITCHENS and UCF-101 datasets for action recognition, and show that our proposed approach can greatly speed up inference with a tolerable loss of accuracy compared with those from state-of-the-art baselines. Source code is available at https://github.com/knmac/adaptive_spatiotemporal.
URI
https://vinspace.edu.vn/handle/VIN/577
Collections
  • Minh Do, PhD. [7]

Contact Us | Send Feedback
 

 

Browse

All of VinSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister

Contact Us | Send Feedback