Multimodal supervised image translation

Multimodal supervised image translation

For access to this article, please select a purchase option:

Buy article PDF
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Your details
Why are you recommending this title?
Select reason:
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Multimodal image-to-image translation is a class of vision and graphics problems where the goal is to learn a one-to-many mapping between the source domain and target domain. Given an image in the source domain, the model aims to produce as many diverse results as possible. It is an important and challenging problem in the task of image translation. To this end, recent works utilise Gaussian vectors to produce diverse results but with a small difference. It is because of the special probabilistic nature of Gaussian distribution. In this work, the authors propose linearly distributed latent codes instead of conventional Gaussian vectors, which control the style of generated images. Taking advantage of linear distribution, their model can produce much more diverse results and outperform the state-of-the-art baselines in terms of diversity. Qualitative and quantitative comparisons against baselines demonstrate the effectiveness and superiority of their method.

Related content

This is a required field
Please enter a valid email address