Image Classification Based on Non-negative Locality-constrained Linear Coding
-
摘要: 基于特征提取的图像分类算法的核心问题是如何对特征进行有效编码. 局部约束线性编码(Locality-constrained linear coding, LLC) 因其良好的特征重构性与局部平滑稀疏性, 已取得了很好的分类性能. 然而, LLC编码的分类性能对编码过程中的近邻数k的大小比较敏感, 随着k的增大, 编码中的某些负值元素与正值元素的差值绝对值也可能增大, 这使得LLC越来越不稳定. 本文通过在LLC优化模型的目标方程中引入非负约束, 提出了一种新型编码方式, 称为非负局部约束线性编码(Non-negative locality-constrained linear coding, NNLLC). 该模型一般采取迭代优化算法进行求解, 但其计算复杂度较大. 因此, 本文提出两种近似非负编码算法, 其编码速度与LLC一样快速. 实验结果表明, 在多个广泛使用的图像数据集上, 相比于LLC, NNLLC编码方式不仅在分类精确率上提高了近1%~4%, 而且对k的选取具有更强的鲁棒性.Abstract: The most important issue of image classification algorithm based on feature extraction is how to efficiently encode features. Locality-constrained linear coding (LLC) has achieved the state-of-the-art performance on several benchmarks, due to its underlying properties of better construction and local smooth sparsity. However, the performance of LLC on image classification is sensitive to the number of neighbors, i.e., the value of k. With the increase of k, the absolute difference of some negative and positive elements may likely become larger and larger. This will make LLC more unstable. In this paper, a new coding scheme called non-negative locality-constrained linear coding (NNLLC) is proposed. It adds an extra non-negative constraint to the objective function of LLC. Generally, this new model can be solved by iterative optimization methods, however, such solutions are quite impractical due to high computational cost. Therefore, two fast approximation algorithms are proposed; more importantly, they and LLC have a similar computational complexity. To compare with LLC, the experiment results on several widely used image datasets demonstrate that NNLLC not only can improve the classification accuracy by nearly 1%~4%, but also is more robust on the selection of k.
-
[1] Sivic J, Zisserman A. Video google: a text retrieval approach to object matching in videos. In: Proceedings of the 9th IEEE International Conference. Nice, France: IEEE, 2003. 1470-1477 [2] Csurka G, Dance C R, Fan L X, Willamowski J, Bray C. Visual categorization with bags of keypoints. In: Proceedings of the 2004 ECCV International Workshop on Statistical Learning in Computer Vision. Grenoble, France: ECCV, 2004. 1-22 [3] Zhao Zhong-Qiu, Ji Hai-Feng, Gao Jun, Hu Dong-Hui, Wu Xin-Dong. Sparse coding based multi-scale spatial latent semantic analysis for image classification. Chinese Journal of Computers, 2014, 37(6): 1251-1260(赵仲秋, 季海峰, 高隽, 胡东辉, 吴信东. 基于稀疏编码多尺度空间潜在语义分析的图像分类. 计算机学报, 2014, 37(6): 1251-1260) [4] Li Qian-Qian. Image Classification Research of Improved Non-negative Sparse Coding [Master dissertation], Nanjing University of Science and Technology, China, 2014.(李钱钱. 基于改进的非负稀疏编码图像分类研究[硕士学位论文], 南京理工大学, 中国, 2014.) [5] Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2006. 2169-2178 [6] Yang J C, Yu K, Gong Y H, Huang T. Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL: IEEE, 2009. 1794-1801 [7] Wang J J, Yang J C, Yu K, Lv F J, Huang T S, Gong Y H. Locality-constrained linear coding for image classification. In: Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, USA: IEEE, 2010. 3360-3367 [8] Yang J C, Wang J P, Huang T. Learning the sparse representation for classification. In: Proceedings of the 2011 IEEE International Conference on Multimedia and Expo (ICME). Barcelona, Spanish: IEEE, 2011. 1-6 [9] Yu K, Zhang T, Gong Y H. Nonlinear learning using local coordinate coding. In: Proceedings of the 2009 Advances in Neural Information Processing Systems. Vancouver, Canada: NIPS, 2009. 2223-2231 [10] Hoyer P O. Non-negative sparse coding. In: Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing. Falmouth, USA: IEEE, 2002. 557-565 [11] Zhang C J, Liu J, Tian Q, Xu C S, Lu H Q, Ma S D. Image classification by non-negative sparse coding, low-rank and sparse decomposition. In: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition. Colorado Springs, USA: IEEE, 2011. 1673-1680 [12] Lin T H, Kung H T. Stable and efficient representation learning with non-negativity constraints. In: Proceedings of the 31st International Conference on Machine Learning. Beijing, China: JMLR W&CP, 2014. 1323-1331 [13] Roweis S T, Saul L K. Nonlinear dimensionality reduction by locally linear embedding. Science, 2000, 290(5500): 2323-2326 [14] Goldfarb D, Idnani A. A numerically stable dual method for solving strictly convex quadratic programs. Mathematical Programming, 1983, 27(1): 1-33 [15] Fan R E, Chang K W, Hsieh C J, Wang X R, Lin C J. Liblinear: a library for large linear classification. The Journal of Machine Learning Research, 2008, 9: 1871-1874 [16] Li F F, Fergus R, Perona P. Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Computer Vision and Image Understanding, 2007, 106(1): 59-70 [17] Zhang H, Berg A C, Maire M, Malik J. SVM-KNN: discriminative nearest neighbor classification for visual category recognition. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2006. 2126-2136 [18] Boiman O, Shechtman E, Irani M. In defense of nearest-neighbor based image classification. In: Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. Anchorage, Alaska: IEEE, 2008. 1-8 [19] Balasubramanian K, Yu K, Lebanon G. Smooth sparse coding via marginal regression for learning sparse representations. In: Proceedings of the 30th International Conference on Machine Learning. Edinburgh, Scotland: ICML, 2012. 1326-1334 [20] van Gemert J C, Geusebroek J M, Veenman C J, Smeulders A W M. Kernel codebooks for scene categorization. In: Proceedings of the 10th European Conference on Computer Vision. Marseille, France: ECCV, 2008. 696-709
点击查看大图
计量
- 文章访问数: 2222
- HTML全文浏览量: 96
- PDF下载量: 1777
- 被引次数: 0