Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free Automatic image annotation by a loosely joint non-negative matrix factorisation

Nowadays, the number of digital images has increased so that the management of this volume of data needs an efficient system for browsing, categorising and searching. Automatic image annotation is designed for assigning tags to images for more accurate retrieval. Non-negative matrix factorisation (NMF) is a traditional machine learning technique for decomposing a matrix into a set of basis and coefficients under the non-negative constraints. In this study, the authors propose a two-step algorithm for designing an automatic image annotation system that employs the NMF framework for its first step and a variant of K-nearest neighbourhood as its second step. In the first step, a new multimodal NMF algorithm is proposed to extract the latent factors which reflect the content of images. This is done by jointly factorising the visual and textual data feature matrices so that they have close representation, although not necessarily the same. In the second step, after mapping images to the latent factors space a few tags are predicted for the new images based on a weighted average of similar data. They evaluated the performance of the proposed method and compared it to the state-of-the-art literature. Comparison results demonstrate the effectiveness and potential of the proposed method in image annotation applications.

References

    1. 1)
    2. 2)
      • 8. Putthividhy, D., Attias, H.T., Nagarajan, S.S.: ‘Topic regression multi-modal latent Dirichlet allocation for image annotation’. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2010.
    3. 3)
      • 10. Zhang, R., Zhang, Z., Li, M., Ma, W.-Y., Zhang, H.-J.: ‘A probabilistic semantic model for image annotation and multimodal image retrieval’. Tenth IEEE Int. Conf. Computer Vision (ICCV), 2005.
    4. 4)
      • 26. Acar, E., Gurdeniz, G., Rasmussen, M.A., Rago, D., Dragsted, L.O., Bro, R.: ‘Coupled matrix factorization with sparse factors to identify potential biomarkers in metabolomics’. IEEE 12th Int. Conf. Data Mining Workshops (ICDMW), 2012.
    5. 5)
    6. 6)
      • 32. Manning, C.D., Raghavan, P., Schütze, H.: ‘Introduction to information retrieval’ (Cambridge university press Cambridge, 2008).
    7. 7)
      • 9. Wang, C., Blei, D., Li, F.-F.: ‘Simultaneous image classification and annotation’. IEEE Computer Vision and Pattern Recognition (CVPR), 2009.
    8. 8)
      • 15. Verma, Y., Jawahar, C.: ‘Exploring Svm for image annotation in presence of confusing labels’. Proc.Conf.24th British Machine Vision, 2013.
    9. 9)
    10. 10)
      • 40. Xiang, Y., Zhou, X., Chua, T.-S., Ngo, C.-W.: ‘A revisit of generative model for automatic image annotation using Markov random fields’. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2009.
    11. 11)
      • 37. Oliva, A.: ‘Gist of the scene’, Neurobiol. Attention, 2005, 696, (64), pp. 251258.
    12. 12)
    13. 13)
    14. 14)
    15. 15)
    16. 16)
      • 18. Savita, P., Patel, D., Sinhal, A.: ‘A neural network approach to improve the efficiency of image annotation’, Int. J. Eng. Res. Technol., 2013, 1, pp. 3541.
    17. 17)
    18. 18)
      • 38. Chen, M., Zheng, A., Weinberger, K.: ‘Fast image tagging’. Proc.30th Int. Conf. Machine Learning, 2013.
    19. 19)
      • 11. Lienhart, R., Romberg, S., Hörster, E.: ‘Multilayer plsa for multimodal image retrieval’. Proc. ACM Int. Conf. Image and Video Retrieval, 2009.
    20. 20)
      • 25. Akata, Z., Thurau, C., Bauckhage, C.: ‘Non-negative matrix factorization in multimodality data for segmentation and label prediction’. 16th Computer Vision Winter Workshop, 2011.
    21. 21)
      • 14. Kalayeh, M.M., Idrees, H., Shah, M.: ‘Nmf-Knn: image annotation using weighted multi-view non-negative matrix factorization’. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2014.
    22. 22)
      • 27. Yan, X., Guo, J., Liu, S., Cheng, X., Wang, Y.: ‘Learning topics in short texts by non-negative matrix factorization on term correlation matrix’. Proc. SIAM Int. Conf. Data Mining, 2013.
    23. 23)
    24. 24)
    25. 25)
      • 19. Guillaumin, M., Mensink, T., Verbeek, J., Schmid, C.: ‘Tagprop: discriminative metric learning in nearest neighbor models for image auto-annotation’. IEEE 12th Int. Conf. Computer Vision, 2009.
    26. 26)
      • 35. Tsai, C.-F.: ‘Bag-of-words representation in image annotation: a review’. ISRN Artificial Intelligence, 2012.
    27. 27)
      • 23. Caicedo, J.C., González, F.A.: ‘Multimodal fusion for image retrieval using matrix factorization’. Proc. Second ACM Int. Conf. Multimedia Retrieval, 2012.
    28. 28)
      • 39. Lu, Z., Peng, Y.: ‘Image annotation by semantic sparse recoding of visual content’. Proc. 20th ACM Int. Conf. Multimedia, ACM, 2012.
    29. 29)
      • 36. Lowe, D.G.: ‘Object recognition from local scale-invariant features’. Proc. Seventh IEEE Int. Conf. Computer Vision, 1999.
    30. 30)
      • 22. Driesen, J.: ‘Discovering words in speech using matrix factorization’. PhD thesis, KU Leuven, ESAT, 2012.
    31. 31)
      • 29. Caicedo, J.C., González, F.A.: ‘Online matrix factorization for multimodal image retrieval’. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, Springer, 2012.
    32. 32)
      • 33. Duygulu, P., Barnard, K., de Freitas, J.F., Forsyth, D.A.: ‘Object recognition as machine translation: learning a lexicon for a fixed image vocabulary’. Computer Vision, ECCV, Springer, 2002.
    33. 33)
      • 20. Verma, Y., Jawahar, C.: ‘Image annotation using metric learning in semantic neighbourhoods’. Computer Vision (ECCV), 2012.
    34. 34)
      • 31. Makadia, A., Pavlovic, V., Kumar, S.: ‘A new baseline for image annotation’. Computer Vision, ECCV, Springer, 2008.
    35. 35)
      • 34. Von Ahn, L., Dabbish, L.: ‘Labeling images with a computer game’. Proc. SIGCHI Conf. Human Factors in Computing Systems, ACM, 2004.
    36. 36)
    37. 37)
      • 21. BenAbdallah, J., Caicedo, J.C., Gonzalez, F.A., Nasraoui, O.: ‘Multimodal image annotation using non-negative matrix factorization’. IEEE/WIC/ACM Int. Conf. Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010.
    38. 38)
      • 1. ‘Instagram Statistics for 2014’, http://www.jennstrends.com/instagram-statistics-for-2014/, accessed September 2014.
    39. 39)
    40. 40)
    41. 41)
      • 24. Eweiwi, A., Cheema, M.S., Bauckhage, C.: ‘Discriminative joint non-negative matrix factorization for human action classification’, Pattern Recogn., 2013, 8142, pp. 6170.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cvi.2014.0413
Loading

Related content

content/journals/10.1049/iet-cvi.2014.0413
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address