Online ISSN
1751-9640
Print ISSN
1751-9632
IET Computer Vision
Volume 4, Issue 3, September 2010
Volumes & issues:
Volume 4, Issue 3
September 2010
-
- Author(s): J. Cai and R. Walker
- Source: IET Computer Vision, Volume 4, Issue 3, p. 149 –161
- DOI: 10.1049/iet-cvi.2009.0063
- Type: Article
- + Show details - Hide details
-
p.
149
–161
(13)
In this study, the authors propose a novel algorithm to estimate the heights of objects from monocular aerial images taken from mobile platforms such as unmanned aerial vehicles and small airplanes. Sequential images captured by a single camera mounted on a mobile platform contain 3D information of objects. In this study, the authors propose to use illumination normalisation to reduce illumination variations and to use at least two objects with known distances to accurately estimate the camera focal length. The authors also propose a novel stereo matching algorithm using dynamic programming with explicit occlusion modelling to recover depth information in occluded regions and to preserve depth discontinuity. As a result, the authors are able to reliably estimate the heights of objects in or close to power line corridors. Our experiments show that the proposed algorithm can estimate the heights of trees and power poles from aerial images with average errors of 1.8 and 1.1 m, respectively, when the flight height is in the range between 230 and 280 m above ground level. - Author(s): N.A. Thacker ; J.V. Manjon ; P.A. Bromiley
- Source: IET Computer Vision, Volume 4, Issue 3, p. 162 –172
- DOI: 10.1049/iet-cvi.2008.0076
- Type: Article
- + Show details - Hide details
-
p.
162
–172
(11)
Noise filtering is a common step in image processing, and is particularly effective in improving the subjective quality of images. A large number of techniques have been developed, many of which concentrate on the problem of removing noise without damaging small structures such as edges. One recent approach that demonstrates empirical merit is the non-local means (NLM) algorithm. However, in order to use noise filtering algorithms in quantitative or clinical image analysis tasks an understanding of their behaviour that goes beyond subjective appearance must be developed. The purpose of this study is to investigate the statistical basis of NLM in order to attempt to understand the conditions required for its use. The theory is illustrated on synthetic data and clinical magnetic resonance images of the brain. - Author(s): K. Ruba Soundar and K. Murugesan
- Source: IET Computer Vision, Volume 4, Issue 3, p. 173 –182
- DOI: 10.1049/iet-cvi.2008.0065
- Type: Article
- + Show details - Hide details
-
p.
173
–182
(10)
Face recognition can significantly impact authentication, monitoring and indexing applications. Much research on face recognition using global and local information has been done earlier. By using global feature preservation techniques like principal component analysis (PCA) and linear discriminant analysis (LDA), the authors can effectively preserve only the Euclidean structure of face space that suffers lack of local features, but which may play a major role in some applications. On the other hand, the local feature preservation technique namely locality preserving projections (LPP) preserves local information and obtains a face subspace that best detects the essential face manifold structure; however, it also suffers loss in global features which may also be important in some of the applications. A new combined approach for recognising faces that integrates the advantages of the global feature extraction technique LDA and the local feature extraction technique LPP has been introduced here. Xiaofei He et al. in their work used PCA to extract similarity features from a given set of images followed by LPP. But in the proposed method, the authors use LDA (instead of PCA) to extract discriminating features that yields improved facial image recognition results. This has been verified by making a fair comparison with the existing methods. - Author(s): G. Passino ; I. Patras ; E. Izquierdo
- Source: IET Computer Vision, Volume 4, Issue 3, p. 183 –194
- DOI: 10.1049/iet-cvi.2008.0093
- Type: Article
- + Show details - Hide details
-
p.
183
–194
(12)
In image semantic segmentation a semantic category label is associated to each image pixel. This classification problem is characterised by pixel dependencies at different scales. On a small-scale pixel correlation is related to object instance sharing, whereas on a middle- and large scale to category co-presence and relative location constraints. The contribution of this study is two-fold. First, the authors present a framework that jointly learns category appearances and pixel dependencies at different scales. Small-scale dependencies are accounted by clustering pixels into larger patches via image oversegmentation. To tackle middle-scale dependencies a conditional random field (CRF) is built over the patches. A novel strategy to exploit local patch aspect coherence is used to impose an optimised structure in the graph to have exact and efficient inference. The second contribution is a method to account for full patch neighbourhoods without introducing loops in the graphical structures. ‘Weak neighbours’ are introduced, which are patches connected in the image but not in the inference graph. They are pre-classified according to their visual appearance and their category distribution probability is then used in the CRF inference step. Experimental evidence of the validity of the method shows improvements in comparison to other works in the field. - Author(s): H. Hu ; P. Zhang ; F. De la Torre
- Source: IET Computer Vision, Volume 4, Issue 3, p. 195 –208
- DOI: 10.1049/iet-cvi.2009.0024
- Type: Article
- + Show details - Hide details
-
p.
195
–208
(14)
There are two fundamental problems with the linear discriminant analysis (LDA) for face recognition. First one is LDA is not stable because of the small training sample size problem. The other is that it would collapse the data samples of different classes into one single cluster when the class distributions are multimodal. An enhanced LDA method is proposed to overcome these two problems. The between- and within-class scatters are reformulated by introducing two different weighted matrices in respective. The enhanced Fisher criterion is then presented, which can preserve the local structure of different class in the reduced subspace. Moreover, maximum margin criterion is adopted to avoid the singularity problem of the within-class scatter matrix. Extensive experiments show encouraging recognition performance of the proposed algorithm. - Author(s): B.J. Kang and K.R. Park
- Source: IET Computer Vision, Volume 4, Issue 3, p. 209 –217
- DOI: 10.1049/iet-cvi.2009.0081
- Type: Article
- + Show details - Hide details
-
p.
209
–217
(9)
There are limits to a single biometric observation such as variation in an individual biometric feature due to the condition of a sensor, the health condition of a human, illumination variation and so on. To overcome such limitations, the authors propose a new multimodal biometric approach integrating finger vein recognition and finger geometry recognition at the score level. The method presents three advantages compared to previous works: (i) the proposed multimodal biometric system can be constructed as a tiny device, which uses a finger vein and finger geometry features acquired from a single finger; (ii) the proposed finger geometry recognition, based on Fourier descriptors, is robust to the translation and rotation of a finger; and (iii) the authors obtained better recognition accuracy using the score-level fusion method based on a support vector machine than by other score-level fusion methods such as the MAX, MIN and SUM rules. The results showed that the equal error rate of the proposed method decreased by as much as 1.089 and 1.627% compared with finger vein recognition and finger geometry recognition methods, respectively. - Author(s): R. Pal ; A. Mukherjee ; P. Mitra ; J. Mukherjee
- Source: IET Computer Vision, Volume 4, Issue 3, p. 218 –229
- DOI: 10.1049/iet-cvi.2009.0067
- Type: Article
- + Show details - Hide details
-
p.
218
–229
(12)
Visual attention is an indispensable component of complex vision tasks. A multi-scale, complex network-based approach for determining visual saliency is described. It uses degree centrality (conceptually and computationally the simplest among all the centrality measures) over a network of image regions to form a saliency map. The regions used in the network are multiscale in nature with scale selected automatically. Experimental evaluation establishes the superiority of the method over existing saliency methods, even in noisy environments.
Height estimation from monocular image sequences using dynamic programming with explicit occlusions
Statistical interpretation of non-local means
Preserving global and local information – a combined approach for recognising face images
Aspect coherence for graph-based semantic image labelling
Face recognition using enhanced linear discriminant analysis
Multimodal biometric method based on vein and geometry of a single finger
Modelling visual saliency using degree centrality
Most viewed content for this Journal
Article
content/journals/iet-cvi
Journal
5
Most cited content for this Journal
-
Brain tumour classification using two-tier classifier with adaptive segmentation technique
- Author(s): V. Anitha and S. Murugavalli
- Type: Article
-
Driving posture recognition by convolutional neural networks
- Author(s): Chao Yan ; Frans Coenen ; Bailing Zhang
- Type: Article
-
Local directional mask maximum edge patterns for image retrieval and face recognition
- Author(s): Santosh Kumar Vipparthi ; Subrahmanyam Murala ; Anil Balaji Gonde ; Q.M. Jonathan Wu
- Type: Article
-
Fast and accurate algorithm for eye localisation for gaze tracking in low-resolution images
- Author(s): Anjith George and Aurobinda Routray
- Type: Article
-
‘Owl’ and ‘Lizard’: patterns of head pose and eye pose in driver gaze classification
- Author(s): Lex Fridman ; Joonbum Lee ; Bryan Reimer ; Trent Victor
- Type: Article