Vision-based hand pose estimation through similarity search using the earth mover's distance

Access Full Text

Vision-based hand pose estimation through similarity search using the earth mover's distance

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Computer Vision — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Vision-based hand pose estimation presents unique challenges, particularly if high-fidelity reconstruction is desired. Searching large databases of synthetic pose candidates for items similar to the input offers an attractive means of attaining this goal. The earth mover's distance is a perceptually meaningful measure of dissimilarity that has shown great promise in content-based image retrieval. It is in general, however, a computationally expensive operation and must be used sparingly. The authors investigate a way of economising on its use while preserving much of its accuracy when applied naively in the context of searching for hand pose candidates in large synthetic databases. In particular, a two-tier search method is proposed which achieves similar accuracy with a speed increase of two orders of magnitude. The system performance is evaluated using real input and the results obtained using the different approaches are compared.

Inspec keywords: computer vision; image retrieval; performance evaluation; pose estimation; content-based retrieval; image reconstruction

Other keywords: large synthetic databases; vision-based hand pose estimation; contour-based similarity search; computationally expensive operation; system performance evaluation; two-tier search method; earth mover distance; large database searching; high-fidelity reconstruction; content-based image retrieval; synthetic pose candidates

Subjects: Computer vision and image processing techniques; Image recognition; Information retrieval techniques

References

    1. 1)
      • Yianilos, P.N.: `Data structures and algorithms for nearest neighbor search in general metric spaces', Proc. ACM-SIAM Symp. Discrete algorithms, 1993, Austin, Texas, USA, p. 311–321.
    2. 2)
      • G.R. Bradski . Computer vision face tracking for use in a perceptual user interface. Intel Technol. J. , 2 , 12 - 21
    3. 3)
    4. 4)
    5. 5)
      • P. Zezula , G. Amato , V. Dohnal , M. Batko . (2006) Similarity search: the metric space approach.
    6. 6)
      • Athitsos, V., Alon, J., Sclaroff, S., Kollios, G.: `Boostmap: a method for efficient approximate similarity rankings', Proc. IEEE Computer Society Conf. Computer Vision Pattern Recognition, 2004, Washington, DC, USA, 2, p. 268–275.
    7. 7)
      • Indyk, P., Thaper, N.: `Fast image retrieval via embeddings', Int. Workshop Statistics and Computer Theories of Vision (ICCV), 2003, Nice, France.
    8. 8)
      • Zieren, J.: `Visuelle erkennung von handposituren für einen interaktiven gebärdensprachtutor', 2007, PhD, RWTH Aachen.
    9. 9)
      • Rubner, Y., Tomasi, C., Guibas, L.J.: `A metric for distributions with applications to image databases', Proc. Intl. Conf. Computer Vision, 1998, Bombay, India, p. 59–66.
    10. 10)
      • V. Athitsos , S. Sclaroff , A. Camurri , G. Volpe . (2004) Database indexing methods for 3D hand pose estimation, Gesture-based communication in human-computer interaction.
    11. 11)
      • G.R. Bradski . The OpenCV library. Dr Dobb's J. Softw. Tools , 11 , 120 - 125
    12. 12)
      • Athitsos, V., Sclaroff, S.: `Estimating 3D hand pose from a cluttered image', Proc. IEEE Conf. Computer Vision Pattern Recognition, 2003, Madison, WI, USA, 2, p. 432–439.
    13. 13)
    14. 14)
      • Weber, R., Schek, H., Blott, S.: `A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces', Proc. Int. Conf. Very Large Data Bases, 1998, New York, USA, p. 194–205.
    15. 15)
    16. 16)
      • Dick, T., Zieren, J., Kraiss, K.: `Visual hand posture recognition in monocular image sequences', Proc. Symp. German Association Pattern Recognition, 2006, Berlin, Germany, p. 566–575.
    17. 17)
    18. 18)
      • Grauman, K., Darrell, T.: `Fast contour matching using approximate earth mover's distance', Proc. IEEE Conf. Computer Vision Pattern Recognition, 2004, Washington, DC, USA, 1, p. 220–227.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cvi.2011.0128
Loading

Related content

content/journals/10.1049/iet-cvi.2011.0128
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading