access icon free Query-dependent metric learning for adaptive, content-based image browsing and retrieval

Content-based image retrieval (CBIR) systems often incorporate a relevance feedback mechanism in which retrieval is adapted based on users identifying images as relevant or irrelevant. Such relevance decisions are often assumed to be category-based. However, forcing a user to decide upon category membership of an image, even when unfamiliar with a database and irrespective of context, is restrictive. An alternative is to obtain user feedback in the form of relative similarity judgments. The ability of a user to provide meaningful feedback depends on the interface that displays retrieved images and facilitates the feedback. Similarity-based 2D layouts provide context and can enable more efficient visual search. Motivated by these observations, this study describes and evaluates an interactive image browsing and retrieval approach based on relative similarity feedback obtained from 2D image layouts. It incorporates online maximal-margin learning to adapt the image similarity metric used to perform retrieval. A user starts a session by browsing a collection of images displayed in a 2D layout. He/she may choose a query image perceived to be similar to the envisioned target image. A set of images similar to the query are then returned. The user can then provide relational feedback and/or update the query image to obtain a new set of images. Algorithms for CBIR are often characterised empirically by simulating usage based on pre-defined, fixed category labels, deeming retrieved results as relevant if they share a category label with the query. In contrast, the purpose of the system in this study is to enable browsing and retrieval without predefined categories. Therefore evaluation is performed in a target-based setting by quantifying the efficiency with which target images are retrieved given initial queries.

Inspec keywords: learning (artificial intelligence); image processing; feedback; image retrieval

Other keywords: content-based image retrieval system; adaptive content-based image browsing; image retrieval; similarity-based 2D image layout; image identification; online maximal-margin learning; relevance feedback mechanism; relative similarity judgment; interactive image browsing; query-dependent metric learning; CBIR system

Subjects: Optical, image and video signal processing; Computer vision and image processing techniques; Knowledge engineering techniques

References

    1. 1)
      • 19. Faria, F.F., Veloso, A., Almeida, H.M., Valle, E., Torres, R.d.S., Goncalves, M.A., Meira, W.Jr.: ‘Learning to rank for content-based image retrieval’. Int. Conf. on Multimedia Information Retrieval, 2010.
    2. 2)
    3. 3)
    4. 4)
    5. 5)
      • 20. Huang, W., Chan, K.L., Li, H., Lim, J.H., Liu, J., Wong, T.Y.: ‘Content-based medical image retrieval with metric learning via rank correlation’. Int. Workshop on Machine Learning in Medical Imaging, 2010, pp. 1825.
    6. 6)
    7. 7)
      • 21. Wang, G., Forsyth, D., Hoiem, D.: ‘Comparative object similarity for improved recognition with few or no examples’. IEEE Conf. on Computer Vision and Pattern Recognition, 2010.
    8. 8)
      • 26. Han, J., McKenna, S.J., Wang, R.: ‘Learning query-dependent distance metrics for interactive image retrieval’. Int. Conf. on Computer Vision Systems, Liege, 2007.
    9. 9)
      • 23. Rodden, K.: ‘Evaluating Similarity-Based Visualisations as Interfaces for Image Browsing’, Ph.D. Thesis. University of Cambridge, 2001.
    10. 10)
    11. 11)
      • 18. Lee, J.E., Jin, R., Jain, A.K.: ‘Rank-based distance metric learning: An application to image retrieval’. IEEE Conf. on Computer Vision and Pattern Recognition, Anchorage, 2008.
    12. 12)
    13. 13)
      • 2. Smith, J.R., Chang, S.F.: ‘Automated binary texture feature sets for image retrieval’. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Atlanta, 1996.
    14. 14)
      • 15. Freund, A., Iyer, R., Schapire, R.E., Lozano-Perez, T.: ‘An efficient boosting algorithm for combining preferences’, J. Mach. Learn. Res., 2003, 4, pp. 933969.
    15. 15)
      • 22. Parikh, D., Grauman, K.: ‘Relative Attributes’. IEEE Int. Conf. on Computer Vision, 2011.
    16. 16)
      • 13. Joachims, T.: ‘Optimizing search engines using clickthrough data’. ACM SIGKDD Int. Conf. on Knowledge Discovery and DataMining, Alberta, Canada, 2002.
    17. 17)
    18. 18)
      • 17. Hu, Y., Li, M., Yu, N.: ‘Multiple-instance ranking: Learning to rank images for image retrieval’. IEEE Conf. on Computer Vision and Pattern Recognition, Anchorage, USA, 2008.
    19. 19)
    20. 20)
      • 4. Tong, S., Chang, E.: ‘Support vector machine active learning for image retrieval’. ACM Conf. on Multimedia, 2001.
    21. 21)
      • 16. Frome, A.: ‘Learning Distance Functions for Examplar-based Object Recognition’, Ph.D. thesis. UC Berkeley, 2007.
    22. 22)
      • 14. Schultz, M., Joachims, T.: ‘Learning a distance metric from relative comparisons’. Neural Information Processing Systems, Berlin, 2003.
    23. 23)
    24. 24)
      • 5. He, X., Ma, W.Y., Zhang, H.J.: ‘Learning an image manifold for retrieval’. ACM Int. Conf. on Multimedia, New York, 2004.
    25. 25)
    26. 26)
    27. 27)
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-ipr.2013.0514
Loading

Related content

content/journals/10.1049/iet-ipr.2013.0514
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading