http://iet.metastore.ingenta.com
1887

Real-time text tracking in natural scenes

Real-time text tracking in natural scenes

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Computer Vision — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

The authors present a system that automatically detects, recognises and tracks text in natural scenes in real-time. The focus of the author's method is on large text found in outdoor environments, such as shop signs, street names, billboards and so on. Built on top of their previously developed techniques for scene text detection and orientation estimation, the main contribution of this work is to present a complete end-to-end scene text reading system based on text tracking. They propose to use a set of unscented Kalman filters to maintain each text region's identity and to continuously track the homography transformation of the text into a fronto-parallel view, thereby being resilient to erratic camera motion and wide baseline changes in orientation. The system is designed for continuous, unsupervised operation in a handheld or wearable system over long periods of time. It is completely automatic and features quick failure recovery and interactive text reading. It is also highly parallelised to maximise usage of available processing power and achieve real-time operation. They demonstrate the performance of the system on sequences recorded in outdoor scenarios.

References

    1. 1)
      • B. Epshtein , E. Ofek , Y. Wexler .
        1. Epshtein, B., Ofek, E., Wexler, Y.: ‘Detecting text in natural scenes with stroke width transform’. Computer Vision and Pattern Recognition, 2010, pp. 29632970.
        . Computer Vision and Pattern Recognition , 2963 - 2970
    2. 2)
    3. 3)
      • L. Neumann , J. Matas .
        3. Neumann, L., Matas, J.: ‘Real-time scene text localization and recognition’. Computer Vision and Pattern Recognition, 2012, pp. 35383545.
        . Computer Vision and Pattern Recognition , 3538 - 3545
    4. 4)
      • H. Chen , S. Tsai , G. Schroth , D. Chen , R. Grzeszczuk , B. Girod .
        4. Chen, H., Tsai, S., Schroth, G., Chen, D., Grzeszczuk, R., Girod, B.: ‘Robust text detection in natural images with edge-enhanced maximally stable external regions’. Int. Conf. on Image Processing, 2011, pp. 26092612.
        . Int. Conf. on Image Processing , 2609 - 2612
    5. 5)
      • S.M. Lucas , A. Panaretos , L. Sosa , A. Tang , S. Wong , R. Young .
        5. Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ‘ICDAR 2003 robust reading competitions’. Int. Conf. on Document Analysis and Recognition, 2003, pp. 682687.
        . Int. Conf. on Document Analysis and Recognition , 682 - 687
    6. 6)
    7. 7)
    8. 8)
      • C. Merino , M. Mirmehdi .
        8. Merino, C., Mirmehdi, M.: ‘A framework towards realtime detection and tracking of text’. Camera Based Document Analysis and Recognition, 2007, pp. 1017.
        . Camera Based Document Analysis and Recognition , 10 - 17
    9. 9)
    10. 10)
    11. 11)
      • J. Gllavata , R. Ewerth , B. Freisleben .
        11. Gllavata, J., Ewerth, R., Freisleben, B.: ‘Tracking text in MPEG videos’. Int. Conf. on Multimedia, 2004, pp. 240243.
        . Int. Conf. on Multimedia , 240 - 243
    12. 12)
      • G.K. Myers , B. Burns .
        12. Myers, G.K., Burns, B.: ‘A robust method for tracking scene text in video imagery’. Camera Based Document Analysis and Recognition, 2005, vol. 1.
        . Camera Based Document Analysis and Recognition
    13. 13)
      • H. Shiratori , H. Goto , H. Kobayashi .
        13. Shiratori, H., Goto, H., Kobayashi, H.: ‘An efficient text capture method for moving robots using DCT feature and text tracking’. Int. Conf. on Pattern Recognition, 2006, pp. 10501053.
        . Int. Conf. on Pattern Recognition , 1050 - 1053
    14. 14)
      • M. Tanaka , H. Goto .
        14. Tanaka, M., Goto, H.: ‘Autonomous text capturing robot using improved DCT feature and text tracking’. Int. Conf. on Document Analysis and Recognition, 2007, 2, pp. 11781182.
        . Int. Conf. on Document Analysis and Recognition , 1178 - 1182
    15. 15)
      • M. Tanaka , H. Goto .
        15. Tanaka, M., Goto, H.: ‘Text-tracking wearable camera system for visually-impaired people’. Int. Conf. on Pattern Recognition, 2008, pp. 14.
        . Int. Conf. on Pattern Recognition , 1 - 4
    16. 16)
      • H. Goto , M. Tanaka .
        16. Goto, H., Tanaka, M.: ‘Text-tracking wearable camera system for the blind’. Int. Conf. on Document Analysis and Recognition, 2009, pp. 141145.
        . Int. Conf. on Document Analysis and Recognition , 141 - 145
    17. 17)
      • Y. Na , D. Wen .
        17. Na, Y., Wen, D.: ‘An effective video text tracking algorithm based on sift feature and geometric constraint’. Advances in Multimedia Information Processing, 2010, pp. 392403.
        . Advances in Multimedia Information Processing , 392 - 403
    18. 18)
      • R. Minetto , N. Thome , M. Cord , N.J. Leite , J. Stolfi .
        18. Minetto, R., Thome, N., Cord, M., Leite, N.J., Stolfi, J.: ‘Snoopertrack: text detection and tracking for outdoor videos’. Int. Conf. on Image Processing, 2011, pp. 505508.
        . Int. Conf. on Image Processing , 505 - 508
    19. 19)
    20. 20)
      • R.I. Hartley , A. Zisserman . (2004)
        20. Hartley, R.I., Zisserman, A.: ‘Multiple view geometry in computer vision’ (Cambridge University Press, 2004, 2nd edn.).
        .
    21. 21)
      • T.Q. Phan , P. Shivakumara , T. Lu , C.L. Tan .
        21. Phan, T.Q., Shivakumara, P., Lu, T., Tan, C.L.: ‘Recognition of video text through temporal integration’. Int. Conf. on Document Analysis and Recognition, 2013, pp. 589593.
        . Int. Conf. on Document Analysis and Recognition , 589 - 593
    22. 22)
    23. 23)
      • K. Wang , B. Babenko , S. Belongie .
        23. Wang, K., Babenko, B., Belongie, S.: ‘End-to-end scene text recognition’. Int. Conf. on Computer Vision, 2011, pp. 14571464.
        . Int. Conf. on Computer Vision , 1457 - 1464
    24. 24)
      • A. Mishra , K. Alahari , C. Jawahar .
        24. Mishra, A., Alahari, K., Jawahar, C.: ‘Top-down and bottom-up cues for scene text recognition’. Computer Vision and Pattern Recognition, 2012, pp. 26872694.
        . Computer Vision and Pattern Recognition , 2687 - 2694
    25. 25)
    26. 26)
      • C. Merino-Gracia , K. Lenc , M. Mirmehdi .
        26. Merino-Gracia, C., Lenc, K., Mirmehdi, M.: ‘A head-mounted device for recognizing text in natural scenes’, in Iwamura, M., Shafait, F.(Eds.): ‘Camera based document analysis and recognition’, 2012, (LNCS, 7139), pp. 2941.
        .
    27. 27)
      • A. Targhi , E. Hayman , J. Eklundh , M. Shahshahani .
        27. Targhi, A., Hayman, E., Eklundh, J., Shahshahani, M.: ‘The Eigen-transform & applications’. Asian Conf. of Computer Vision, I, 2006, pp. 7079.
        . Asian Conf. of Computer Vision, I , 70 - 79
    28. 28)
      • M. de Berg , M. van Kreveld , M. Overmars , O. Schwarzkopf . (2000)
        28. de Berg, M., van Kreveld, M., Overmars, M., Schwarzkopf, O.: ‘Computational geometry: algorithms and applications’ (Springer-Verlag, 2000, 2nd edn.).
        .
    29. 29)
      • M. Pilu .
        29. Pilu, M.: ‘Extraction of illusory linear clues in perspectively skewed documents’. Computer Vision and Pattern Recognition, 2001, pp. 363368.
        . Computer Vision and Pattern Recognition , 363 - 368
    30. 30)
      • G. Toussaint .
        30. Toussaint, G.: ‘Solving geometric problems with the rotating calipers’. Mediterranean Electrotechnical Conf., 1983, pp. 1017.
        . Mediterranean Electrotechnical Conf , 10 - 17
    31. 31)
      • A. Doucet , J. de Freitas , N. Gordon . (2001)
        31. Doucet, A., de Freitas, J., Gordon, N.: ‘Sequential Monte Carlo methods in practice’ (Springer-Verlag, 2001).
        .
    32. 32)
      • E. Wan , R. Van Der Merwe .
        32. Wan, E., Van Der Merwe, R.: ‘The unscented Kalman filter for nonlinear estimation’. Adaptive Systems for Signal Processing, Communications, and Control, 2000, pp. 153158.
        . Adaptive Systems for Signal Processing, Communications, and Control , 153 - 158
    33. 33)
      • G. Klein , D. Murray .
        33. Klein, G., Murray, D.: ‘Parallel tracking and mapping for small AR workspaces’. ISMAR, 2007, pp. 225234.
        . ISMAR , 225 - 234
    34. 34)
    35. 35)
      • K. Bernardin , R. Stiefelhagen .
        35. Bernardin, K., Stiefelhagen, R.: ‘Evaluating multiple object tracking performance: the CLEAR MOT metrics’, J. Image Video Process., 2008, 2008, pp. 1:11:10.
        . J. Image Video Process. , 1:1 - 1:10
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cvi.2013.0217
Loading

Related content

content/journals/10.1049/iet-cvi.2013.0217
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address