This is an open access article published by the IET under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/)
In recent decades, the local pattern descriptor has achieved tremendous success in the field of face recognition, pedestrian detection, and image texture analysis. This study presents a generic approach, called the filtered local pattern descriptor (FLPD), which expands the traditional local pattern descriptor (TLPD) by using multi-scale and multi-type filter banks. The FLPD encodes the local information of an image based on the convolutional sum of the sub-image blocks and the filter banks, instead of the original pixel values in the TLPD. This design can effectively increase the diversity of the TLPD feature extraction, thereby enhancing the ability of feature representation and its reliability. Two FLPD-based feature representation methods are proposed for the face image and the pedestrian image. To evaluate the performance of the proposed FLPD, extensive experiments on face recognition and infrared pedestrian detection are conducted using several benchmark image datasets. The experimental results illustrate that the FLPD has a significant advantage in the discrimination and stability of feature extraction, and is able to achieve a satisfactory accuracy in comparison with state-of-the-art methods. It is demonstrated that the FLPD is a powerful and convenient extension of the TLPD by filter banks, and suitable to be implemented as feature extraction into approaches to solve the binary or multi-class image classification problems.
References
-
-
1)
-
9. Dubey, S.R., Singh, S.K., Singh, R.K.: ‘Local wavelet pattern: a new feature descriptor for image retrieval in medical CT databases’, IEEE Trans. Image Process., 2015, 24, (12), pp. 5892–5903 (doi: 10.1109/TIP.2015.2493446).
-
2)
-
20. Lecun, Y., Bottou, L., Bengio, Y., et al: ‘Gradient-based learning applied to document recognition’, Proc. IEEE, 1998, 86, (11), pp. 2278–2324 (doi: 10.1109/5.726791).
-
3)
-
11. Ojala, T., Pietikäinen, M., Mäenpää, T.: ‘Gray Scale and Rotation Invariant Texture Classification with Local Binary Patterns’. Computer Vision – ECCV 2000, Springer Berlin Heidelberg, 2000, pp. 404–420.
-
4)
-
2. Guo, Z., Zhang, D.: ‘A completed modeling of local binary pattern operator for texture classification’, IEEE Trans. Image Process., 2010, 19, (6), pp. 1657–1663 (doi: 10.1109/TIP.2010.2044957).
-
5)
-
42. Varma, M., Zisserman, A.: ‘A statistical approach to texture classification from single images’, Int. J. Comput. Vis., 2005, 62, (1–2), pp. 61–81 (doi: 10.1007/s11263-005-4635-4).
-
6)
-
5. Papakostas, G.A., Koulouriotis, D.E., Karakasis, E.G., et al: ‘Moment-based local binary patterns: a novel descriptor for invariant pattern recognition applications’, Neurocomputing, 2013, 99, (1), pp. 358–371 (doi: 10.1016/j.neucom.2012.06.031).
-
7)
-
8)
-
2. Ojala, T., Pietikainen, M., Harwood, D.: ‘Performance evaluation of texture measures with classification based on Kullback discrimination of distributions’. IAPR Int. Conf. on Pattern Recognition, 1994. Vol. 1 – Conf. A: Computer Vision and Image Processing IEEE, 1994, vol. 1, pp. 582–585.
-
9)
-
27. Zhang, S., Benenson, R., Schiele, B.: ‘Filtered channel features for pedestrian detection’. IEEE Conf. on Computer Vision and Pattern Recognition IEEE Computer Society, 2015, pp. 1751–1760.
-
10)
-
28. Ahonen, T., Hadid, A., Pietikainen, M.: ‘Face description with local binary patterns: application to face recognition’, IEEE Trans. Pattern Anal. Mach. Intell., 2006, 28, (12), pp. 2037–2041 (doi: 10.1109/TPAMI.2006.244).
-
11)
-
10. Viola, P., Jones, M.: ‘Robust real-time face detection’, Int. J. Comput. Vis., 2004, 2, (57), pp. 137–154 (doi: 10.1023/B:VISI.0000013087.49260.fb).
-
12)
-
13. Tan, X., Triggs, B.: ‘Enhanced local texture feature sets for face recognition under difficult lighting conditions’, IEEE Trans. Image Process., 2010, 19, (6), pp. 168–182.
-
13)
-
37. Felzenszwalb, P.F., Girshick, R.B., Mcallester, D., et al: ‘Object detection with discriminatively trained part-based models’, IEEE Trans. Softw. Eng., 2010, 32, (9), pp. 1627–1645.
-
14)
-
8. Verma, M., Raman, B.: ‘Local tri-directional patterns: a new texture feature descriptor for image retrieval’, Digit. Signal Process., 2016, 51, pp. 62–72 (doi: 10.1016/j.dsp.2016.02.002).
-
15)
-
7. Dollár, P., Wojek, C., Schiele, B., Perona, P.: ‘Pedestrian detection: an evaluation of the state of the art’, IEEE Trans. Pattern Anal. Mach. Intell., 2012, 34, (4), pp. 743–761 (doi: 10.1109/TPAMI.2011.155).
-
16)
-
32. Nam, W., Dollár, P., Han, J.H.: ‘Local decorrelation for improved detection’, Adv. Neural Inf. Process. Syst., 2014, 1, pp. 424–432.
-
17)
-
30. Murala, S., Maheshwari, R.P., Balasubramanian, R.: ‘Local maximum edge binary patterns: a new descriptor for image retrieval and object tracking’, Signal Process., 2012, 92, pp. 1467–1479 (doi: 10.1016/j.sigpro.2011.12.005).
-
18)
-
14. Liao, S., Zhu, X., Lei, Z., et al: ‘Learning multi-scale block local binary patterns for face recognition’, Adv. Biometrics., 2007, 4642, pp. 828–837 (doi: 10.1007/978-3-540-74549-5_87).
-
19)
-
9. Ojala, T., Pietikäinen, M., Harwood, D.: ‘A comparative study of texture measures with classification based on feature distributions’, Pattern Recognit., 1996, 29, (1), pp. 51–59 (doi: 10.1016/0031-3203(95)00067-4).
-
20)
-
4. Pietikäinen, M., Nurmela, T., Mäenpää, T., et al: ‘View-based recognition of real-world textures’, Pattern Recognit., 2004, 37, (2), pp. 313–323 (doi: 10.1016/S0031-3203(03)00231-0).
-
21)
-
24. Dalal, N., Triggs, B.: ‘Histograms of oriented gradients for human detection’. IEEE Conf. on Computer Vision and Pattern Recognition, 2013, pp. 886–893.
-
22)
-
16. Jabid, T., Kabir, M.H., Chae, O.: ‘Local Directional Pattern (LDP) for face recognition’, Int. J. Innov. Comput. Inf. Control, 2010, 8, (4), pp. 329–330.
-
23)
-
22. Kim, D.S., Kim, M., Kim, B.S., et al: ‘Histograms of local intensity differences for pedestrian classification in far-infrared images’, Electron. Lett., 2013, 49, (4), pp. 258–260 (doi: 10.1049/el.2012.4261).
-
24)
-
9. Dubey, S.R., Singh, S.K., Singh, R.K.: ‘Local diagonal extrema pattern: a new and efficient feature descriptor for CT image retrieval’, Signal Process. Lett., 2015, 22, (9), pp. 1215–1219 (doi: 10.1109/LSP.2015.2392623).
-
25)
-
3. Casasent, D.P.: ‘Experiments with two industrial problems using texture classification based on feature distributions’. , 1994.
-
26)
-
30. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ‘ImageNet classification with deep convolutional neural networks’, Adv. Neural Inf. Process. Syst., 2012, 25, (2), p. 2012.
-
27)
-
21. Wu, J., Rehg, J.M.: ‘CENTRIST: a visual descriptor for scene categorization’, IEEE Trans. Pattern Anal. Mach. Intell., 2011, 33, (8), pp. 1489–1501 (doi: 10.1109/TPAMI.2010.224).
-
28)
-
17. Jun, B., Kim, D.: ‘Robust face detection using local gradient patterns and evidence accumulation’, Pattern Recognit., 2012, 45, (9), pp. 3304–3316 (doi: 10.1016/j.patcog.2012.02.031).
-
29)
-
20. Sun, J., Fan, G., Wu, X.: ‘New local edge binary patterns for image retrieval’. IEEE Int. Conf. on Image Processing, 2013, pp. 4014–4018.
-
30)
-
12. Zhang, W., Shan, S., Gao, W., et al: ‘Local Gabor Binary Pattern Histogram Sequence (LGBPHS): a novel non-statistical model for face representation and recognition’. Tenth IEEE Int. Conf. on Computer Vision IEEE Computer Society, 2005, pp. 786–791.
-
31)
-
22. Ojala, T., Pietikainen, M., Maenpaa, T.: ‘Multiresolution gray-scale and rotation invariant texture classification with local binary patterns’, Trans. Pattern Anal. Mach. Intell., 2002, 24, (7), pp. 971–987 (doi: 10.1109/TPAMI.2002.1017623).
-
32)
-
31. Girshick, R., Donahue, J., Darrell, T., et al: ‘Region-based convolutional networks for accurate object detection and segmentation’, IEEE Trans. Pattern Anal. Mach. Intell., 2015, 38, (1), pp. 1–1.
-
33)
-
33. Zhang, S., Beneson, R., Omran, M., et al: , 2016.
-
34)
-
15. Liao, S., Chung, A.C.S.: ‘Face recognition by using elongated local binary patterns with average maximum distance gradient magnitude’. Computer Vision - ACCV 2007, Asian Conf. on Computer Vision, Proc. 2007, Tokyo, Japan, 18–22 November 2007, pp. 672–679.
-
35)
-
26. Varma, M., Zisserman, A.: ‘Texture classification: are filter banks necessary?’. IEEE Conf. on Computer Vision and Pattern Recognition, 2003, p. 691.
-
36)
-
23. Wu, J., Liu, N., Geyer, C., et al: ‘C4: a real-time object detection framework’, IEEE Trans. Image Process., 2013, 22, (10), pp. 4096–4107 (doi: 10.1109/TIP.2013.2270111).
-
37)
http://iet.metastore.ingenta.com/content/journals/10.1049/joe.2016.0307
Related content
content/journals/10.1049/joe.2016.0307
pub_keyword,iet_inspecKeyword,pub_concept
6
6