Large Scale Web Image Online Annotation by Learning Label Set Relevance
-
摘要: 传统的网络图像标注方法忽视了标签集整体相关性对标注结果的影响,导致标签集整体相关性缺乏和语义冗余. 为了解决上述问题,提出了一种基于标签集相关性学习的大规模网络图像在线语义标注方法. 给出了标签集对图像相关性和标签集内部相关性的概率估计算法,将上述约束形成一个优化问题,采用贪心搜索策略获取近似最优解,找到能合理地平衡上述因素的标签集,并针对大规模图像集和概念集进行了优化. 真实环境下大规模网络图像集上的测试表明,相比于目前的代表性网络图像标注方法,该方法获得的标签集能够更好的描述图像语义,性能提升明显.Abstract: Traditional web image annotation methods neglect the relevance of the assigned label set as a whole, resulting in the label relevance deficiency and redundancy. To solve the above problems, a novel web image annotation method by learning the label set relevance is proposed, which considers both the relevance of label set to image and the label set internal correlation. Measures that can estimate the above factors are designed, and both the constraints are formulated into a joint framework. Meanwhile, an effective greedy search algorithm is proposed for an approximate optimal label set, which reaches a reasonable trade-off between the relevance of label set to image and internal correlation, and makes the framework more applicable to the data set that contains the large scale concept and images. Experiments on real world web image data sets demonstrate the general applicability of our algorithm. In comparison to the state-of-the-art methods, the proposed approach yields better performance.
-
[1] Zhang D S, Islam M M, Lu G J. A review on automatic image annotation techniques. Pattern Recognition, 2012, 45(1): 346-362 [2] [2] Wang M, Ni B B, Hua X S, Chua T S. Assistive tagging: a survey of multimedia tagging with human-computer joint exploration. ACM Computing Surveys, 2012, 44(4): 1-24 [3] [3] Wang X J, Zhang L, Li X R, Ma W Y. Annotating images by mining image search results. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(11): 1919-1932 [4] [4] Liu D, Hua X C, Yang L J. Tag ranking. In: Proceedings of the 2009 International World Wide Web Conference. New York, USA: ACM, 2009. 351-360 [5] [5] Li X R, Snoek C G M, Worring M. Learning social tag relevance by neighbor voting. IEEE Transactions on Multimedia, 2009, 11(7): 1310-1322 [6] [6] Jin Y, Khan L, Prabhakaran B. Knowledge based image annotation refinement. Journal of Signal Processing Systems, 2010, 58(3): 387-406 [7] [7] Wang H, Huang H, Chris H Q D. Image annotation using bi-relational graph of images and semantic labels. In: Proceedings of the 2001 IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2011. 793-800 [8] [8] Yang Y, Wu F, Nie F P, Shen H T, Zhuang Y, Hauptmann A G. Web and personal image annotation by mining label correlation with relaxed visual graph embedding. IEEE Transactions on Image Processing, 2012, 21(3): 1339-1351 [9] [9] Chua T S, Tang J H, Hong R C, Li H J, Luo Z P, Zheng Y T. NUS-WIDE: A real-world web image database from National University of Singapore. In: Proceedings of the 2009 ACM Conference on Image and Video Retrieval. New York, USA: ACM, 2009. 1-9 [10] Huiskes M J, Lew M S. The MIR Flickr retrieval evaluation. In: Proceedings of the 2008 ACM International Conference on Multimedia Information Retrieval. New York, USA: ACM, 2008. 39-43
点击查看大图
计量
- 文章访问数: 1797
- HTML全文浏览量: 47
- PDF下载量: 1044
- 被引次数: 0