Representative Image Selection from Image Dataset
-
摘要: 针对传统图像检索系统通过关键字搜索图像时缺乏语义主题多样性的问题,提出了一种基于互近邻一致性和近邻传播的代表性图像选取算法,为每个查询选取与其相关的不同语义主题的图像集合. 该算法利用互近邻一致性调整图像间的相似度,再进行近邻传播(AP)聚类将图像集分为若干簇,最后通过簇排序选取代表性图像簇并从中选取中心图像为代表性图像. 实验表明,本文方法的性能超过基于K-means的方法和基于Greedy K-means的方法,所选图像能直观有效地概括源图像集的内容,并且在语义上多样化.Abstract: In a traditional image retrieval system, people search images using keywords. However, the result shows a lack of diversity in the sense of semantic theme. For the problem, we propose a viable method for representative image selection. We define representative images as those with diverse contents in the semantic meaning to cover different semantic forms of a query. First, we use mutual nearest neighbor consistency to adjust the similarity between images as the input to the AP clustering. Then we select representative clusters based on cluster ranking and finally take the images of the cluster center from representative clusters as a summary of the image dataset. The results showed that the performance of our method is better than the K-means based method and the greedy K-means based method. The selected images can summarize the content of the original image dataset intuitively and effectively, and they are diverse in semantic meaning as well.
-
[1] Wang M, Yang K Y, Hua X S, Zhang H J. Towards a relevant and diverse search of social images. IEEE Transactions on Multimedia, 2010, 12(8): 829-842 [2] Zha Z J, Yang L J, Mei T, Wang M, Wang Z F, Chua T S, Hua X S. Visual query suggestion: towards capturing user intent in internet image search. ACM Transactions on Multimedia Computing, Communications, and Applications, 2010, 6(3): 1-19 [3] Tang X O, Liu K, Cui J Y, Wen F, Wang X G. IntentSearch: capturing user intention for one-click internet image search. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(7): 1342-1353 [4] Fan J P, Kem D A, Gao Y L, Luo H Z, Li Z M. JustClick: personalized image recommendation via exploratory search from large-scale flickr images. IEEE Transactions on Circuits and Systems for Video Technology, 2009, 19(2): 273-288 [5] Gao Y, Tang J H, Hong R, Dai Q H, Chua T S, Jain R. W2Go: A travel guidance system by automatic landmark ranking. In: Proceedings of the international conference on Multimedia. New York, USA: ACM, 2010. 123-132 [6] Hong R, Tang J H, Tan H K, Ngo C W, Yan S C, Chua T S. Beyond search: event driven summarization for web videos. ACM Transactions on Multimedia Computing, Communications, and Applications, 2011, 7(4): 1-21 [7] Yang L J, Hanjalic A. Supervised reranking for web image search. In: Proceedings of the international conference on Multimedia. New York, USA: ACM, 2010. 183-192 [8] Hama H, Zin T T, Tin P. A hybrid ranking of link and popularity for novel search engine. International Journal of Innovative Computing, Information and Control, 2009, 5(11): 4041-4049 [9] Jaffe A, Naaman M, Tassa T, Davis M. Generating summaries for large collections of geo-referenced photographs. In: Proceedings of the 15th international conference on World Wide Web. New York, USA: ACM, 2006, 853-854 [10] Simon I, Snavely N, Seitz S M. Scene summarization for online image collections. In: Proceedings of IEEE 11th International Conference on Computer Vision. Rio de Janeiro: IEEE, 2007, 1-8 [11] Kennedy L S, Naaman M. Generating diverse and representative image search results for landmarks. In: Proceeding of the 17th international conference on World Wide Web. New York, USA: ACM, 2008, 297-306 [12] Raguram R, Lazebnik S. Computing iconic summaries of general visual concepts. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. Anchorage, AK: IEEE, 2008. 1-8 [13] Berg T L, Berg A C. Finding iconic images. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. Miami, FL: IEEE, 2009, 1-8 [14] Berg T L, Forsyth D A. Automatic Ranking of Iconic Images, Technical report, EECS Department, University of California, Berkeley, 2007 [15] Chen Zeng, Hou Jin, Zhang Deng-Sheng, Zhang Hua-Zhong. Image re-ranking based on extraction of semantic regions. Acta Automatica Sinica, 2012, 37(11): 1356-1359 (陈曾, 侯进, 张登胜, 张华忠. 基于语义区域提取的图像重排. 自动化学报, 2012, 37(11): 1356-1359) [16] Frey B J, Dueck D. Clustering by passing messages between data points. Science, 2007, 315(5814): 972-976 [17] Ding C, He X F. K-nearest-neighbor consistency in data clustering: incorporating local information into global optimization. In: Proceedings of the 2004 ACM symposium on Applied computing. New York, USA: ACM, 2004, 584-589 [18] Harel J, Koch C, Perona P. Graph-based visual saliency. Advances in Neural Information Processing Systems, 2007, 19: 545-552 [19] Zhang Lin-Bo, Wang Chun-Heng, Xiao Bai-Hua, Shao Yun-Xue. Image representation using bag of phrases. Acta Automatica Sinica, 2012, 38(1): 46-54 (张琳波, 王春恒, 肖柏华, 邵允学. 基于Bag-of-phrases的图像表示方法. 自动化学报, 2012, 38(1): 46-54) [20] Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91-110 [21] Lin Hai-Feng, Ma Yu-Feng, Song Tao. Research on object tracking algorithm based on SIFT. Acta Automatica Sinica, 2010, 36(8): 1204-1208 (蔺海峰, 马宇峰, 宋涛. 基于SIFT特征目标跟踪算法研究. 自动化学报, 2010, 36(8): 1204-1208)
点击查看大图
计量
- 文章访问数: 1426
- HTML全文浏览量: 55
- PDF下载量: 936
- 被引次数: 0