一种基于CLM的服务机器人室内功能区分类方法

英文篇名：A CLM-Based Method of Indoor Affordance Areas Classification for Service Robots
作者：吴培良 ; 李亚南 ; 杨芳 ; 孔令富 ; 侯增广
英文作者：WU Peiliang;LI Ya'nan;YANG Fang;KONG Lingfu;HOU Zengguang;School of Information Science and Engineering, Yanshan University;State Key Laboratory of Management and Control for Complex Systems, Institute of Automation,Chinese Academy of Sciences;The Key Laboratory for Computer Virtual Technology and System Integration of Hebei Province;
关键词：服务机器人 ; SURF特征提取 ; CLM模型 ; 功能区分类
英文关键词：service robot;;SURF(speeded-up robust feature) extraction;;codebookless model;;affordance area classification
中文刊名：JQRR
英文刊名：Robot
机构：燕山大学信息科学与工程学院;中国科学院自动化研究所复杂系统管理与控制国家重点实验室;河北省计算机虚拟技术与系统集成重点实验室;
出版日期：2018-03-15
出版单位：机器人
年：2018
期：v.40
基金：国家自然科学基金(61305113);; 河北省自然科学基金(F2016203358)
语种：中文;
页：JQRR201802007
页数：7
CN：02
ISSN：21-1137/TP
分类号：62-68

摘要

基于CLM(无码本模型)提出一种规避码本的室内功能区表示与建模方法.首先,在灰度级图像的基础上提取SURF(加速鲁棒特征)描述子;然后,运用空间金字塔方法将图像分成规则区域,在向量空间引入高斯流形,将每个区域用单高斯模型表示,并将其联合构成混合高斯模型以表示整幅图像;最后,将图像的高斯模型与改进的SVM(支持向量机)分类器联合使用,实现室内功能区的分类.在Scene 15数据集上的实验结果表明,本文方法相较于传统的构建码本方式在分类识别精度上提升约20%,同时对方向变化、光照不均匀等情况具有较好的鲁棒性,有效提升了服务机器人对室内功能区的认知能力.
A representation and modeling method of indoor affordance areas based on CLM(codebookless model) is proposed to avoid using codebook. Firstly, multi-scale SURF(speeded-up robust feature) descriptors are extracted on greyscale image. Then, the image is divided into some regular regions using the spatial pyramid method. By introducing Gaussian manifolds into vector space, each region is denoted as a single Gaussian model, and the mixed Gaussian model is combined to represent the whole image. Finally, the Gaussian model and the modified SVM(support vector machine) classifier are utilized to classify the indoor affordance areas. The experimental results on Scene 15 datasets show that the proposed method improves the classification accuracy by about 20% compared with the traditional codebook construction methods, is more robust to direction changes and uneven illumination, and effectively enhances the ability of service robots to cognize indoor affordance areas.

引文

[1]Lazebnik S,Schmid C,Ponce J.Beyond bags of features:Spatial pyramid matching for recognizing natural scene categories[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE,2006:2169-2178.
    [2]Azim T.Fisher kernels match deep models[J].Electronics Letters,2017,53(6):397-399.
    [3]Bo C J,Lu H C,Wang D.Weighted generalized nearest neighbor for hyperspectral image classification[J].IEEE Access,2017,5:1496-1509.
    [4]Song Y,Li Q,Huang H,et al.Low dimensional representation of Fisher vectors for microscopy image classification[J].IEEE Transactions on Medical Imaging,2017,36(8):1636-1649.
    [5]Zhang C J,Xiao X,Pang J B,et al.Beyond visual word ambiguity:Weighted local feature encoding with governing region[J].Journal of Visual Communication and Image Representation,2014,25(6):1387-1398.
    [6]Yang Y B,Zhu Q H,Mao X J,et al.Visual feature coding for image classification integrating dictionary structure[J].Pattern Recognition,2015,48(10):3067-3075.
    [7]Zhou W G,Yang M,Li H Q,et al.Towards codebook-free:Scalable cascaded hashing for mobile image search[J].IEEE Transactions on Multimedia,2014,16(3):601-611.
    [8]Grauman K,Darrell T.The pyramid match kernel:Discriminative classification with sets of image features[C]//IEEE International Conference on Computer Vision.Piscataway,USA:IEEE,2005:1458-1465.
    [9]Li F F,Perona P.A Bayesian hierarchical model for learning natural scene categories[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE,2005:524-531.
    [10]Bo L F,Sminchisescu C.Efficient match kernel between sets of features for visual recognition[C]//Advances in Neural Information Processing Systems 22.Red Hook,USA:Curran Associates Inc.,2009:135-143.
    [11]Boiman O,Shechtman E,Irani M.In defense of nearestneighbor based image classification[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE,2008:1992-1999.
    [12]Peng Z L,Li Y,Cai Z Q,et al.Deep Boosting:Joint feature selection and analysis dictionary learning in hierarchy[J].Neurocomputing,2016,178(S1):36-45.
    [13]Nakayama H,Harada T,Kuniyoshi Y.Global Gaussian approach for scene categorization using information geometry[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Piscataway,USA:IEEE,2010:2336-2343.
    [14]Wang Q L,Li P H,Zhang L,et al.Towards effective codebookless model for image classification[J].Pattern Recognition,2016,59(S1):63-71.
    [15]Na Y,Liao M M,Jung C.Super-speed up robust features image geometrical registration algorithm[J].IET Image Processing,2016,10(11):848-864.
    [16]Li P H,Wang Q L,Zhang L.A novel earth mover’s distance methodology for image matching with Gaussian mixture models[C]//IEEE International Conference on Computer Vision.Piscataway,USA:IEEE,2013:1689-1696.
    [17]Lovric M,Min-Oo M,Ruh E A.Multivariate normal distributions parametrized as a Riemannian symmetric space[J].Journal of Multivariate Analysis,2000,74(1):36-48.
    [18]Amari S.Differential geometry of statistical models[M]//Lecture Notes in Statistics,vol.28.Berlin,Germany:SpringerVerlag,1985:11-65.
    [19]Arsigny V,Fillard P,Pennec X,et al.Fast and simple calculus on tensors in the Log-Euclidean framework[C]//8th International Conference on Medical Image Computing and ComputerAssisted Intervention.Berlin,Germany:Springer-Verlag,2005:115-122.
    [20]Pennec X.Probabilities and statistics on Riemannian manifolds:A geometric approach[R].Nice,France:INRIA,2004.
    [21]Stein C.Lectures on the theory of estimation of many parameters[J].Journal of Mathematical Sciences,1986,34(1):1373-1403.
    [22]Carreira J,Caseiro R,Batista J.Semantic segmentation with second-order pooling[C]//12th European Conference on Computer Vision.Berlin,Germany:Springer-Verlag,2012:430-443.
    [23]Carreira J,Caseiro R,Batista J,et al.Free-form region description with second-order pooling[J],IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(6):1177-1189.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700