个性化文献订阅>期刊> OPTICAL ENGINEERING
 

Hybrid generative-discriminative human action recognition by combining spatiotemporal words with supervised topic models

  作者 Sun, H; Wang, C; Wang, BL  
  选自 期刊  OPTICAL ENGINEERING;  卷期  2011年50-2;  页码  27203-27203  
  关联知识点  
 

[摘要]We present a hybrid generative-discriminative learning method for human action recognition from video sequences. Our model combines a bag-of-words component with supervised latent topic models. A video sequence is represented as a collection of spatiotemporal words by extracting space-time interest points and describing these points using both shape and motion cues. The supervised latent Dirichlet allocation (sLDA) topic model, which employs discriminative learning using labeled data under a generative framework, is introduced to discover the latent topic structure that is most relevant to action categorization. The proposed algorithm retains most of the desirable properties of generative learning while increasing the classification performance though a discriminative setting. It has also been extended to exploit both labeled data and unlabeled data to learn human actions under a unified framework. We test our algorithm on three challenging data sets: the KTH human motion data set, the Weizmann human action data set, and a ballet data set. Our results are either comparable to or significantly better than previously published results on these data sets and reflect the promise of hybrid generative-discriminative learning approaches. (C) 2011 Society of Photo-Optical Instrumentation Engineers (SPIE). [DOI: 10.1117/1.3537969]

 
      被申请数(0)  
 

[全文传递流程]

一般上传文献全文的时限在1个工作日内