A CLM-Based Method of Indoor Affordance Areas Classification for Service Robots
WU Peiliang1,2,3, LI Ya'nan1, YANG Fang1, KONG Lingfu1,3, HOU Zengguang2
1. School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China;
2. State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China;
3. The Key Laboratory for Computer Virtual Technology and System Integration of Hebei Province, Qinhuangdao 066004, China
Abstract：A representation and modeling method of indoor affordance areas based on CLM (codebookless model) is proposed to avoid using codebook. Firstly, multi-scale SURF (speeded-up robust feature) descriptors are extracted on grey-scale image. Then, the image is divided into some regular regions using the spatial pyramid method. By introducing Gaussian manifolds into vector space, each region is denoted as a single Gaussian model, and the mixed Gaussian model is combined to represent the whole image. Finally, the Gaussian model and the modified SVM (support vector machine) classifier are utilized to classify the indoor affordance areas. The experimental results on Scene 15 datasets show that the proposed method improves the classification accuracy by about 20% compared with the traditional codebook construction methods, is more robust to direction changes and uneven illumination, and effectively enhances the ability of service robots to cognize indoor affordance areas.
 Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway, USA: IEEE, 2006: 2169-2178.
 Azim T. Fisher kernels match deep models[J]. Electronics Letters, 2017, 53(6): 397-399.
 Bo C J, Lu H C, Wang D. Weighted generalized nearest neighbor for hyperspectral image classification[J]. IEEE Access, 2017, 5: 1496-1509.
 Song Y, Li Q, Huang H, et al. Low dimensional representation of Fisher vectors for microscopy image classification[J]. IEEE Transactions on Medical Imaging, 2017, 36(8):1636-1649.
 Zhang C J, Xiao X, Pang J B, et al. Beyond visual word ambiguity: Weighted local feature encoding with governing region[J]. Journal of Visual Communication and Image Representation, 2014, 25(6): 1387-1398.
 Yang Y B, Zhu Q H, Mao X J, et al. Visual feature coding for image classification integrating dictionary structure[J]. Pattern Recognition, 2015, 48(10): 3067-3075.
 Zhou W G, Yang M, Li H Q, et al. Towards codebook-free: Scalable cascaded hashing for mobile image search[J]. IEEE Transactions on Multimedia, 2014, 16(3): 601-611.
 Grauman K, Darrell T. The pyramid match kernel: Discriminative classification with sets of image features[C]//IEEE International Conference on Computer Vision. Piscataway, USA: IEEE, 2005: 1458-1465.
 Li F F, Perona P. A Bayesian hierarchical model for learning natural scene categories[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway, USA: IEEE, 2005: 524-531.
 Bo L F, Sminchisescu C. Efficient match kernel between sets of features for visual recognition[C]//Advances in Neural Information Processing Systems 22. Red Hook, USA: Curran Associates Inc., 2009: 135-143.
 Boiman O, Shechtman E, Irani M. In defense of nearestneighbor based image classification[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway, USA: IEEE, 2008: 1992-1999.
 Peng Z L, Li Y, Cai Z Q, et al. Deep Boosting: Joint feature selection and analysis dictionary learning in hierarchy[J]. Neurocomputing, 2016, 178(S1): 36-45.
 Nakayama H, Harada T, Kuniyoshi Y. Global Gaussian approach for scene categorization using information geometry[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway, USA: IEEE, 2010: 2336-2343.
 Wang Q L, Li P H, Zhang L, et al. Towards effective codebookless model for image classification[J]. Pattern Recognition, 2016, 59(S1): 63-71.
 Na Y, Liao M M, Jung C. Super-speed up robust features image geometrical registration algorithm[J]. IET Image Processing, 2016, 10(11): 848-864.
 Li P H, Wang Q L, Zhang L. A novel earth mover's distance methodology for image matching with Gaussian mixture models[C]//IEEE International Conference on Computer Vision. Piscataway, USA: IEEE, 2013: 1689-1696.
 Lovric M, Min-Oo M, Ruh E A. Multivariate normal distributions parametrized as a Riemannian symmetric space[J]. Journal of Multivariate Analysis, 2000, 74(1): 36-48.
 Amari S. Differential geometry of statistical models[M]//Lecture Notes in Statistics, vol.28. Berlin, Germany: Springer-Verlag, 1985: 11-65.
 Arsigny V, Fillard P, Pennec X, et al. Fast and simple calculus on tensors in the Log-Euclidean framework[C]//8th International Conference on Medical Image Computing and Computer-Assisted Intervention. Berlin, Germany: Springer-Verlag, 2005: 115-122.
 Pennec X. Probabilities and statistics on Riemannian manifolds: A geometric approach[R]. Nice, France: INRIA, 2004.
 Stein C. Lectures on the theory of estimation of many parameters[J]. Journal of Mathematical Sciences, 1986, 34(1): 1373-1403.
 Carreira J, Caseiro R, Batista J. Semantic segmentation with second-order pooling[C]//12th European Conference on Computer Vision. Berlin, Germany: Springer-Verlag, 2012: 430-443.
 Carreira J, Caseiro R, Batista J, et al. Free-form region description with second-order pooling[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(6): 1177-1189.