© The Institution of Engineering and Technology
Vision-based hand pose estimation presents unique challenges, particularly if high-fidelity reconstruction is desired. Searching large databases of synthetic pose candidates for items similar to the input offers an attractive means of attaining this goal. The earth mover's distance is a perceptually meaningful measure of dissimilarity that has shown great promise in content-based image retrieval. It is in general, however, a computationally expensive operation and must be used sparingly. The authors investigate a way of economising on its use while preserving much of its accuracy when applied naively in the context of searching for hand pose candidates in large synthetic databases. In particular, a two-tier search method is proposed which achieves similar accuracy with a speed increase of two orders of magnitude. The system performance is evaluated using real input and the results obtained using the different approaches are compared.
References
-
-
1)
-
Yianilos, P.N.: `Data structures and algorithms for nearest neighbor search in general metric spaces', Proc. ACM-SIAM Symp. Discrete algorithms, 1993, Austin, Texas, USA, p. 311–321.
-
2)
-
G.R. Bradski
.
Computer vision face tracking for use in a perceptual user interface.
Intel Technol. J.
,
2 ,
12 -
21
-
3)
-
Y. Rubner ,
C. Tomasi ,
L.J. Guibas
.
The earth mover's distance as a metric for image retrieval.
Int. J. Comput. Vis.
,
2 ,
99 -
121
-
4)
-
C. Teh ,
R.T. Chin
.
On the detection of dominant points on digital curves.
IEEE Trans. Pattern Anal. Mach. Intell.
,
8 ,
859 -
872
-
5)
-
P. Zezula ,
G. Amato ,
V. Dohnal ,
M. Batko
.
(2006)
Similarity search: the metric space approach.
-
6)
-
Athitsos, V., Alon, J., Sclaroff, S., Kollios, G.: `Boostmap: a method for efficient approximate similarity rankings', Proc. IEEE Computer Society Conf. Computer Vision Pattern Recognition, 2004, Washington, DC, USA, 2, p. 268–275.
-
7)
-
Indyk, P., Thaper, N.: `Fast image retrieval via embeddings', Int. Workshop Statistics and Computer Theories of Vision (ICCV), 2003, Nice, France.
-
8)
-
Zieren, J.: `Visuelle erkennung von handposituren für einen interaktiven gebärdensprachtutor', 2007, PhD, RWTH Aachen.
-
9)
-
Rubner, Y., Tomasi, C., Guibas, L.J.: `A metric for distributions with applications to image databases', Proc. Intl. Conf. Computer Vision, 1998, Bombay, India, p. 59–66.
-
10)
-
V. Athitsos ,
S. Sclaroff ,
A. Camurri ,
G. Volpe
.
(2004)
Database indexing methods for 3D hand pose estimation, Gesture-based communication in human-computer interaction.
-
11)
-
G.R. Bradski
.
The OpenCV library.
Dr Dobb's J. Softw. Tools
,
11 ,
120 -
125
-
12)
-
Athitsos, V., Sclaroff, S.: `Estimating 3D hand pose from a cluttered image', Proc. IEEE Conf. Computer Vision Pattern Recognition, 2003, Madison, WI, USA, 2, p. 432–439.
-
13)
-
A. Erol ,
G. Bebis ,
M. Nicolescu ,
R.D. Boyle ,
X. Twombly
.
Vision-based hand pose estimation: a review.
Comput. Vis. Image Underst.
,
52 -
73
-
14)
-
Weber, R., Schek, H., Blott, S.: `A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces', Proc. Int. Conf. Very Large Data Bases, 1998, New York, USA, p. 194–205.
-
15)
-
V. Athitsos ,
H. Wang ,
A. Stefan
.
A database-based framework for gesture recognition.
Pers. Ubiquitous Comput.
,
6 ,
511 -
526
-
16)
-
Dick, T., Zieren, J., Kraiss, K.: `Visual hand posture recognition in monocular image sequences', Proc. Symp. German Association Pattern Recognition, 2006, Berlin, Germany, p. 566–575.
-
17)
-
V. Athitsos ,
J. Alon ,
S. Sclaroff ,
G. Kollios
.
BoostMap: an embedding method for efficient nearest neighbor retrieval.
IEEE Trans. Pattern Anal. Mach. Intell.
,
1 ,
89 -
104
-
18)
-
Grauman, K., Darrell, T.: `Fast contour matching using approximate earth mover's distance', Proc. IEEE Conf. Computer Vision Pattern Recognition, 2004, Washington, DC, USA, 1, p. 220–227.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-cvi.2011.0128
Related content
content/journals/10.1049/iet-cvi.2011.0128
pub_keyword,iet_inspecKeyword,pub_concept
6
6