Combining Local and Global Information for Hair Shape Modeling
-
摘要: 头发在人体表观中具有重要作用,然而,因为缺少有效的形状模型,头发分割仍然是一个非常具有挑战性的问题. 本文提出了一种基于部件的模型,它对头发形状以及环境变化更加鲁棒. 该模型将局部与全局信息相结合以描述头发的形状.局部模型通过一系列算法构建,包括全局形状词表生成,词表分类器学习以及参数优化;而全局模型刻画不同的发型,采用支持向量机(Support vector machine,SVM)来学习,它为所有潜在的发型配置部件并确定势函数. 在消费者图片上的实验证明了本文算法在头发形状多变和复杂环境等条件下的准确性与有效性.Abstract: Hair plays an important role in human appearance. However, hair segmentation is still a challenging problem partially due to the lack of an effective model to handle its arbitrary shape variations. In this paper, we present a part-based model, which is robust to hair shape and environment variations. The model combines local and global information to describe the hair shape. The local model is learned by a series of algorithms, including global shape word vocabulary construction, shape word classifier learning and parameter optimization, while the global model which depicts different hair styles is learned using support vector machine (SVM) to configure parts and define potentials for all underlying hair shapes. Experiments performed on a set of consumer images show our algorithm's capability and robustness to handle hair shape variations and complex environments.
-
[1] Sinha P, Poggio T. United we stand: the role of head structure in face recognition. Perception, 2002, 31(1): 131-133 [2] Kumar N, Belhumeur P N, Nayar S K. Facetracer: a search engine for large collections of images with faces. In: Proceedings of the 10th European Conference on Computer Vision. Marseille, France: Springer-Verlag, 2008. 340-353 [3] Winn J, Shotton J. The layout consistent random field for recognizing and segmenting partially occluded objects. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2006. 37-44 [4] Kumar M P, Torr P H S, Zisserman A. OBJCUT: efficient segmentation using top-down and bottom-up cues. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(3): 530-545 [5] Andriluka M, Roth S, Schiele B. Pictorial structures revisited: people detection and articulated pose estimation. In: Proceedings of the 2009 Computer Society Conference on Computer Vision and Pattern Recognition. Miami, Florida, USA: IEEE, 2009. 1014-1021 [6] Borenstein E, Ullman S. Combined top-down/bottom-up segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(12): 2019-2125 [7] Wang N, Ai H Z, Lao S H. A compositional exemplar-based model for hair segmentation. In: Proceedings of the 10th Asian Conference on Computer Vision. Queenstown, New Zealand: Springer-Verlag, 2010. 171-184 [8] Wang N, Ai H Z, Tang F. What are good parts for hair shape modeling? In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition. Providence, RI, USA: IEEE, 2012. 662-669 [9] Yacoob Y, Davis L S. Detection and analysis of hair. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28(7): 1164-1169 [10] Rousset C, Coulon P Y. Frequential and color analysis for hair mask segmentation. In: Proceedings of the 15th IEEE International Conference on Image Processing. San Diego, California, USA: IEEE, 2008. 2276-2279 [11] Lee K C, Anguelov D, Sumengen B, Gokturk S B. Markov random field models for hair and face segmentation. In: Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition. Amsterdam, The Netherlands: IEEE, 2008. 1-6 [12] Wang D, Shan S G, Zeng W, Zhang H M, Chen X L. A novel two-tier Bayesian based method for hair segmentation. In: Proceedings of the 16th International Conference on Image Processing. Cairo, Egypt: IEEE, 2009. 2401-2404 [13] Scheffler C, Odobez J M. Joint adaptive colour modelling and skin, hair and clothes segmentation using coherent probabilistic index maps. In: Proceedings of the 22nd British Machine Vision Conference. University of Dundee: BMVA Press, 2011. 53. 1-53. 11 [14] Borenstein E, Malik J. Shape guided object segmentation. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, NY, USA: IEEE, 2006. 969-976 [15] Felzenszwalb P, McAllester D, Ramanan D. A discriminatively trained, multiscale, deformable part model. In: Proceedings of the 2008 Computer Society Conference on Computer Vision and Pattern Recognition. Anchorage, Alaska, USA: IEEE, 2008. 1-8 [16] Levin A, Weiss Y. Learning to combine bottom-up and top-down segmentation. In: Proceedings of the 9th European Conference on Computer Vision. Graz, Austria: Springer, 2006. 581-594 [17] Zhu L, Chen Y H, Lin Y, Lin C X, Yuille A. Recursive segmentation and recognition templates for 2D parsing. In: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems. Vancouver, British Columbia, Canada: Curran Associates, Inc., 2008. 1985-1992 [18] Tsochantaridis I, Hofmann T, Joachims T, Altun Y. Support vector machine learning for interdependent and structured output spaces. In: Proceedings of the 21st International Conference on Machine Learning. New York, NY, USA: ACM, 2004. 104-111 [19] Geurts P, Ernst D, Wehenkel L. Extremely randomized trees. Machine Learning, 2006, 63(1): 3-42 [20] Shotton J, Winn J, Rother C, Criminisi A. Textonboost: joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: Proceedings of the 9th European Conference on Computer Vision. Graz, Austria: Springer-Verlag, 2006. 1-15 [21] Shotton J, Johnson M, Cipolla R. Semantic texton forests for image categorization and segmentation. In: Proceedings of the 2008 Computer Society Conference on Computer Vision and Pattern Recognition. Anchorage, Alaska, USA: IEEE, 2008. 1-8 [22] Kolmogorov V. Convergent tree-reweighted message passing for energy minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006, 28(10): 1568-1583 [23] Chang C C, Lin C J. LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): Article No.27 1-27 [24] Huang G B, Ramesh M, Berg T, Learned-Miller E. Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical Report 07-49, University of Massachusetts, Amherst, USA, 2007
点击查看大图
计量
- 文章访问数: 1433
- HTML全文浏览量: 56
- PDF下载量: 724
- 被引次数: 0