Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free HPILN: a feature learning framework for cross-modality person re-identification

Most video surveillance systems use both RGB and infrared cameras, making it a vital technique to re-identify a person cross the RGB and infrared modalities. This task can be challenging due to both the cross-modality variations caused by heterogeneous images in RGB and infrared, and the intra-modality variations caused by the heterogeneous human poses, camera position, light brightness etc. To meet these challenges, a novel feature learning framework, hard pentaplet and identity loss network (HPILN), is proposed. In the framework existing single-modality re-identification models are modified to fit for the cross-modality scenario, following which specifically designed hard pentaplet loss and identity loss are used to increase the accuracy of the modified cross-modality re-identification models. Based on the benchmark of the SYSU-MM01 dataset, extensive experiments have been conducted, showing that the authors’ method outperforms all existing ones in terms of cumulative match characteristic curve and mean average precision.

References

    1. 1)
      • 2. Yu, Q., Chang, X., Song, Y.Z., et al: ‘The devil is in the middle: exploiting mid-level representations for cross-domain instance matching’. arXiv preprint arXiv:1711.08106, 2017.
    2. 2)
      • 6. Wang, G., Yuan, Y., Chen, X., et al: ‘Learning discriminative features with multiple granularities for person re-identification’. ACM Int. Conf. on Multimedia, Seoul, South Korea, 2018, pp. 274282.
    3. 3)
      • 39. Wen, Y., Zhang, K., Li, Z., et al: ‘A discriminative feature learning approach for deep face recognition’. Proc. European Conf. on Computer Vision, Amsterdam, Netherlands, 2016, pp. 499515.
    4. 4)
      • 19. Qian, X., Fu, Y., Jiang, Y.G., et al: ‘Multi-scale deep learning architectures for person re-identification’. Proc. 2017 IEEE Int. Conf. on Computer Vision (ICCV), Venice, Italy, 2017, pp. 54095418.
    5. 5)
      • 32. Chen, Y.C., Zhu, X., Zheng, W.S., et al: ‘Person re-identification by camera correlation aware feature augmentation’, IEEE Trans. Pattern Anal. Mach. Intell., 2018, 40, (2), pp. 392408.
    6. 6)
      • 24. Kniaz, V.V., Knyaz, V.A., Hladuvka, J., et al: ‘Thermalgan: multimodal color-to-thermal image translation for person re-identification in multispectral dataset’. Proc. European Conf. on Computer Vision (ECCV), Munich, Germany, 2018.
    7. 7)
      • 23. Chattopadhyay, P., Sural, S., Mukherjee, J.: ‘Information fusion from multiple cameras for gait-based re-identification and recognition’, IET Image Process., 2015, 9, (11), pp. 969976.
    8. 8)
      • 14. Schroff, F., Kalenichenko, D., Philbin, J.: ‘FaceNet: a unified embedding for face recognition and clustering’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Boston, USA, 2015, pp. 815823.
    9. 9)
      • 12. Farenzena, M., Bazzani, L., Perina, A., et al: ‘Person re-identification by symmetry-driven accumulation of local features’. Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, San Fransisco, USA, 2010, pp. 23602367.
    10. 10)
      • 11. Wang, Z., Wang, Z., Zheng, Y., et al: ‘Learning to reduce dual-level discrepancy for infrared-visible person re-identification’. Proc. IEEE Int. Conf. on Artificial Intelligence Organisation/IEEE Computer Society Conf. on Computer Vision and Pattern Recognition (CVPR), Long Beach, USA, 2019, pp. 618626.
    11. 11)
      • 15. Hermans, A., Beyer, L., Leibe, B.: ‘In defense of the triplet loss for person re-identification’. arXiv preprint arXiv:1703.07737, 2017.
    12. 12)
      • 13. Varior, R.R., Haloi, M., Wang, G.: ‘Gated Siamese convolutional neural network architecture for human re-identification’. Proc. European Conf. on Computer Vision, Amsterdam, Netherlands, 2016, pp. 791808.
    13. 13)
      • 38. Yoon, K., Song, Y.M., Jeon, M.: ‘Multiple hypothesis tracking algorithm for multi-target multi-camera tracking with disjoint views’, IET Image Process., 2018, 12, (7), pp. 11751184.
    14. 14)
      • 20. Li, W., Zhu, X., Gong, S.: ‘Harmonious attention network for person re-identification’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 22852294.
    15. 15)
      • 28. Hao, Y., Wang, N., Li, J., et al: ‘HSME: hypersphere manifold embedding for visible thermal person re-identification’. Proc. AAAI Conf. on Artificial Intelligence, Honolulu, Hawaii, USA, 2019, vol. 33, pp. 83858392.
    16. 16)
      • 25. Moon, H., Phillips, P.J.: ‘Computational and performance aspects of PCA-based face-recognition algorithms’, Perception, 2001, 30, (3), pp. 303321.
    17. 17)
      • 26. Kingma, D.P., Ba, J.: ‘Adam: a method for stochastic optimization’. arXiv preprint arXiv:1412.6980, 2014.
    18. 18)
      • 41. Huang, G., Liu, Z., Van Der Maaten, L., et al: ‘Densely connected convolutional networks’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 2017, pp. 47004708.
    19. 19)
      • 17. Xiao, T., Li, H., Ouyang, W., et al: ‘Learning deep feature representations with domain guided dropout for person re-identification’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 12491258.
    20. 20)
      • 37. Ristani, E., Solera, F., Zou, R., et al: ‘Performance measures and a data set for multi-target, multi-camera tracking’. Proc. European Conf. on Computer Vision, Amsterdam, Netherlands, 2016, pp. 1735.
    21. 21)
      • 40. He, K., Zhang, X., Ren, S., et al: ‘Deep residual learning for image recognition’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 770778.
    22. 22)
      • 27. Ye, M., Lan, X., Wang, Z., et al: ‘Bi-directional center-constrained top-ranking for visible thermal person re-identification’, IEEE Trans. Inf. Forensics Sec., 2020, 15, pp. 407419.
    23. 23)
      • 4. Chang, X., Hospedales, T.M., Xiang, T.: ‘Multi-level factorisation net for person re-identification’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 21092118.
    24. 24)
      • 36. Li, W., Zhao, R., Xiao, T., et al: ‘DeepReID: deep filter pairing neural network for person re-identification’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 152159.
    25. 25)
      • 3. Sun, Y., Zheng, L., Yang, Y., et al: ‘Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline)’. European Conf. on Computer Vision (ECCV), Munich, Germany, 2018, pp. 480496.
    26. 26)
      • 33. Rasiwasia, N., Costa Pereira, J., Coviello, E., et al: ‘A new approach to cross-modal multimedia retrieval’. Proc. 18th ACM Int. Conf. on Multimedia, Firenze, Italy, 2010, pp. 251260.
    27. 27)
      • 21. Wu, A., Zheng, W.S., Lai, J.H.: ‘Robust depth-based person re-identification’, IEEE Trans. Image Process., 2017, 26, (6), pp. 25882603.
    28. 28)
      • 18. Chen, H., Wang, Y., Shi, Y., et al: ‘Deep transfer learning for person re-identification’. Proc. 2018 IEEE Fourth Int. Conf. on Multimedia Big Data (BigMM), Xi'an, China, 2018, pp. 15.
    29. 29)
      • 29. Dalal, N., Triggs, B.: ‘Histograms of oriented gradients for human detection’. Proc. IEEE Int. Conf. on Computer Vision & Pattern Recognition (CVPR'05), San Diego, CA, USA, 2005, vol. 1, pp. 886893.
    30. 30)
      • 34. Pedagadi, S., Orwell, J., Velastin, S., et al: ‘Local fisher discriminant analysis for pedestrian re-identification’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 33183325.
    31. 31)
      • 16. Chen, W., Chen, X., Zhang, J., et al: ‘Beyond triplet loss: a deep quadruplet network for person re-identification’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 2017, pp. 403412.
    32. 32)
      • 1. Zheng, L., Yang, Y., Hauptmann, A.G.: ‘Person re-identification: past, present and future’. arXiv preprint arXiv:1610.02984, 2016.
    33. 33)
      • 5. Dai, Z., Chen, M., Zhu, S., et al: ‘Batch feature erasing for person re-identification and beyond’. arXiv preprint arXiv:1811.07130, 2018.
    34. 34)
      • 10. Kang, J.K., Hoang, T.M., Park, K.R., et al: ‘Person re-identification between visible and thermal camera images based on deep residual CNN using single input’, IEEE Access, 2019, 7, pp. 5797257984.
    35. 35)
      • 8. Ye, M., Wang, Z., Lan, X., et al: ‘Visible thermal person re-identification via dual-constrained top-ranking’. Proc. Int. Joint Conf. on Artificial Intelligence, Stockholm, Sweden, 2018, pp. 10921099.
    36. 36)
      • 22. Barbosa, I.B., Cristani, M., Del Bue, A., et al: ‘Re-identification with RGB-D sensors’. Proc. European Conf. on Computer Vision, Florence, Italy, 2012, pp. 433442.
    37. 37)
      • 9. Dai, P., Ji, R., Wang, H., et al: ‘Cross-modality person re-identification with generative adversarial training’. Proc. Int. Joint Conf. on Artificial Intelligence, Stockholm, Sweden, 2018, pp. 677683.
    38. 38)
      • 42. Selvaraju, R.R., Cogswell, M., Das, A., et al: ‘Grad-CAM: visual explanations from deep networks via gradient-based localization’. Proc. IEEE Int. Conf. on Computer Vision, Venice, Italy, 2017, pp. 618626.
    39. 39)
      • 7. Wu, A., Zheng, W.S., Yu, H.X., et al: ‘RGB-infrared cross-modality person re-identification’. Proc. IEEE Int. Conf. on Computer Vision, Venice, Italy, 2017, pp. 53805389.
    40. 40)
      • 30. Liao, S., Hu, Y., Zhu, X., et al: ‘Person re-identification by local maximal occurrence representation and metric learning’. Proc. IEEE Int. Computer Society Conf. on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 21972206.
    41. 41)
      • 31. Lin, D., Tang, X.: ‘Inter-modality face recognition’. Proc. European Conf. on Computer Vision, Graz, Austria, 2006, pp. 1326.
    42. 42)
      • 35. Zheng, L., Shen, L., Tian, L., et al: ‘Scalable person re-identification: a benchmark’. Proc. IEEE Int. Conf. on Computer Vision, Santiago, Chile, 2015, pp. 11161124.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-ipr.2019.0699
Loading

Related content

content/journals/10.1049/iet-ipr.2019.0699
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address