access icon free Data driven visual tracking via representation learning and online multi-class LPBoost learning

Visual object tracking is a challenging task due to two intractable problems: visual appearance representation and online update model. Existing approaches often operate appearance model based on hand-crafted features with discriminative feature selection. The tracking learning model is formulated as a binary classification. However, some issues remain to be addressed. First, there does not exist sufficient information for online feature selection. Second, these algorithms do not make use of structure information between object and background. In this study, the authors propose an algorithm named data driven tracker with an appearance model which exploits prior visual target representation by binary PCANet. The authors’ speed up strategy by binary operation on the convolution filters is efficient for tracking task with little performance loss. They formulate the learning model as multi-class task via online LPBoost. Their data-driven tracking (DDT) algorithm performs favourably on various challenging sequences by evaluating against state-of-the-art trackers.

Inspec keywords: image classification; image filtering; learning (artificial intelligence); feature extraction; object tracking; image representation; feature selection

Other keywords: visual target representation; representation learning; data driven visual object tracking; online feature selection; online update model; binary classification; binary PCANet; tracking learning model; DDT algorithm; convolution filters; binary operation; multiclass task; visual appearance representation; structure information; online multiclass LPBoost learning

Subjects: Filtering methods in signal processing; Image recognition; Computer vision and image processing techniques; Knowledge engineering techniques

References

    1. 1)
      • 18. Zhang, K., Zhang, L., Liu, Q., et al: ‘Fast visual tracking via dense spatio-temporal context learning’. Proc. 13th European Conf. on Computer Vision (ECCV), Zurich, Switzerland, September 2014, pp. 127141.
    2. 2)
    3. 3)
      • 8. Lee, H., Grosse, R., Ranganath, R., et al: ‘Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations’. Proc. 26th Annual Int. Conf. on Machine Learning (ACM), Montreal, Canada, June 2009, pp. 609616.
    4. 4)
      • 21. Saffari, A., Godec, M., Pock, T., et al: ‘Online multi-class LPBoost’. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, June 2010, pp. 35703577.
    5. 5)
    6. 6)
      • 34. Adam, A., Rivlin, E., Shimshoni, I.: ‘Robust fragments-based tracking using the integral histogram’. Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, New York, NY, USA, June 2006, pp. 798805.
    7. 7)
    8. 8)
      • 3. Grabner, H., Grabner, M., Bischof, H.: ‘Real-time tracking via on-line boosting’. Proc. British Machine Vision Conf., Edinburgh, Scotland, September 2006, p. 6.
    9. 9)
      • 31. Bordes, A., Bottou, L., Gallinari, P., et al: ‘Solving multiclass support vector machines with LaRank’. Proc. 24th Int. Conf. on Machine Learning, Corvallis, OR, USA, June 2007, pp. 8996.
    10. 10)
      • 30. Wright, S., Nocedal, J.: ‘Numerical optimization’ (Springer, New York, NY, 1999), vol. 2.
    11. 11)
      • 10. Masnadi-Shirazi, H., Mahadevan, V., Vasconcelos, N.: ‘On the design of robust classifiers for computer vision’. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, June 2010, pp. 779786.
    12. 12)
      • 13. Babenko, B., Yang, M.-H., Belongie, S.: ‘Visual tracking with online multiple instance learning’. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, (CVPR), Miami, FL, USA, June 2009, pp. 983990.
    13. 13)
      • 24. Hare, S., Saffari, A., Torr, P.H.: ‘Efficient online structured output learning for keypoint-based object tracking’. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Providence, RI, USA, June 2012, pp. 18941901.
    14. 14)
    15. 15)
      • 20. Wang, N., Wang, J., Yeung, D.-Y.: ‘Online robust non-negative dictionary learning for visual tracking’. IEEE Int. Conf. on Computer Vision (ICCV), Sydney, Australia, September 2013, pp. 657664.
    16. 16)
      • 9. Wang, N., Yeung, D.-Y.: ‘Learning a deep compact image representation for visual tracking’, Presented at Neural Information Processing Systems, Lake Tahoe, CA, December 2013, pp. 809817.
    17. 17)
    18. 18)
      • 22. Everingham, M., Van Gool, L., Williams, C., et al: ‘The Pascal visual object classes challenge 2007 (VOC 2007) results’. 2008, http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html.
    19. 19)
      • 35. Wu, Y., Lim, J., Yang, M.-H.: ‘Online object tracking: a benchmark’. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Portland, OR, USA, June 2013.
    20. 20)
    21. 21)
      • 15. Henriques, J.F., Caseiro, R., Martins, P., et al: ‘Exploiting the circulant structure of tracking-by-detection with kernels’. Proc. 8th European Conf. on Computer Vision (ECCV), Florence, Italy, October 2012, pp. 702715.
    22. 22)
    23. 23)
      • 23. Chan, T.-H., Jia, K., Gao, S., et al: ‘PCANet: a simple deep learning baseline for image classification?arXiv preprint, arXiv:1404.3606, 2014.
    24. 24)
      • 17. Hong, S., Han, B.: ‘Visual tracking by sampling tree-structured graphical models’. Computer Vision – Proc. 13th European Conf. on Computer Vision (ECCV), Zurich, Switzerland, September 2014, pp. 116.
    25. 25)
      • 33. Kwon, J., Lee, K.M.: ‘Visual tracking decomposition’. Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, June 2010, pp. 12691276.
    26. 26)
      • 5. Li, H., Shen, C., Shi, Q.: ‘Real-time visual tracking using compressive sensing’. Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA, June 2011, pp. 13051312.
    27. 27)
      • 11. Leistner, C., Saffari, A., Roth, P.M., et al: ‘On robustness of on-line boosting – a competitive study’. IEEE 12th Int. Conf. on Computer Vision Workshops (ICCV Workshops), Kyoto, Japan, September 2009, pp. 13621369.
    28. 28)
    29. 29)
      • 19. Mei, X., Ling, H.: ‘Robust visual tracking using ℓ1 minimization’. 12th IEEE Int. Conf. on Computer Vision, Kyoto, Japan, September 2009, pp. 14361443.
    30. 30)
      • 16. Hare, S., Saffari, A., Torr, P.H.: ‘Struck: Structured output tracking with kernels’. IEEE Int. Conf. on Computer Vision (ICCV), Barcelona, Spain, November 2011, pp. 263270.
    31. 31)
    32. 32)
      • 32. Zhang, K., Zhang, L., Yang, M.-H.: ‘Real-time compressive tracking’. Proc. 8th European Conf. on Computer Vision (ECCV), Florence, Italy, October 2012, pp. 864877.
    33. 33)
    34. 34)
      • 12. Grabner, H., Leistner, C., Bischof, H.: ‘Semi-supervised on-line boosting for robust tracking’. Proc. 10th European Conf. on Computer Vision (ECCV), Marseille, France, October 2008, pp. 234247.
    35. 35)
      • 28. Yu, Q., Dinh, T.B., Medioni, G.: ‘Online tracking and reacquisition using co-trained generative and discriminative trackers’. Proc. 10th European Conf. on Computer Vision (ECCV), Marseille, France, October 2008, pp. 678691.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cvi.2014.0388
Loading

Related content

content/journals/10.1049/iet-cvi.2014.0388
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading