Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free CNN-based UGS method using Cartesian-to-polar coordinate transformation

The main concern of user-guided segmentation (UGS) is to achieve high segmentation accuracy with minimal user interaction. A novel convolutional neural network (CNN)-based UGS method is proposed, which employs a single click as the user interaction. In the proposed method, the input image in the Cartesian coordinate system is first converted into the polar transformed image with the user-guided point (UGP) as the origin of the polar coordinate system. The transformed image not only effectively delivers the UGP to the CNN, but also enables a single-scale convolution kernel to act as a multi-scale kernel, whose receptive field in the Cartesian coordinate system is altered based on the UGP without any extra parameters. In addition, a feature selection module (FSM) is introduced and utilised to additionally extract radial and angular features from the polar transformed image. Experimental results demonstrate that the proposed CNN using the polar transformed image improves the segmentation accuracy (mean intersection over union) by 3.69% on PASCAL VOC 2012 dataset compared with the CNN using the Cartesian coordinate image. The FSM achieves additional performance improvement of 1.32%. Moreover, the proposed method outperforms the conventional non-CNN-based UGS methods by 12.61% on average.

References

    1. 1)
    2. 2)
    3. 3)
      • 6. Mishra, A., Aloimonos, Y., Fah, C.L.: ‘Active segmentation with fixation’. IEEE 12th Int. Conf. Computer Vision, Kyoto, Japan, September 2009, pp. 468475.
    4. 4)
    5. 5)
      • 7. Long, J., Shelhamer, E., Darrell, T.: ‘Fully convolutional networks for semantic segmentation’. IEEE Conf. Computer Vision and Pattern Recognition, Boston, MA, USA, June 2015, pp. 34313440.
    6. 6)
    7. 7)
      • 5. Peng, C., Zhang, Z., Yu, G., et al: ‘Large kernel matters – improve semantic segmentation by global convolution network’. IEEE Conf. Computer Vision and Pattern Recognition, Honolulu, HI, USA, July 2017, pp. 43534361.
    8. 8)
      • 2. Jain, S.D., Grauman, K.: ‘Click carving: segmenting objects in video with point clicks’, arXiv preprint arXiv:1607.0115, 2016.
http://iet.metastore.ingenta.com/content/journals/10.1049/el.2018.5051
Loading

Related content

content/journals/10.1049/el.2018.5051
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address