Photo-realistic image super-resolution via variational autoencoders

Siu, Wan Chi; Liu, Zhisong

Please use this identifier to cite or link to this item: https://repository.cihe.edu.hk/jspui/handle/cihe/1239

DC Field	Value	Language
dc.contributor.author	Siu, Wan Chi	en_US
dc.contributor.author	Liu, Zhisong	-
dc.contributor.other	Chan, Y.-L.	-
dc.date.accessioned	2021-08-11T05:54:32Z	-
dc.date.available	2021-08-11T05:54:32Z	-
dc.date.issued	2021	-
dc.identifier.uri	https://repository.cihe.edu.hk/jspui/handle/cihe/1239	-
dc.description.abstract	There is a great leap in objective accuracy on image super-resolution, which recently brings a new challenge on image super-resolution with larger up-scaling (e.g. 4× ) using pixel based distortion for measurement. This causes over-smooth effect which cannot grasp well the perceptual similarity. The advent of generative adversarial networks makes it possible super-resolve a low-resolution image to generate photo-realistic images sharing distribution with the high-resolution images. However, generative networks suffer from problems of mode-collapse and unrealistic sample generation. We propose to perform Image Super-Resolution via Variational AutoEncoders (SR-VAE) learning according to the conditional distribution of the high-resolution images induced by the low-resolution images. Given that the Conditional Variational Autoencoders tend to generate blur images, we add the conditional sampling mechanism to narrow down the latent subspace for reconstruction. To evaluate the model generalization, we use KL loss to measure the divergence between latent vectors and standard Gaussian distribution. Eventually, in order to balance the trade-off between super-resolution distortion and perception, not only that we use pixel based loss, we also use the modified deep feature loss between SR and HR images to estimate the reconstruction. In experiments, we evaluated a large number of datasets to make comparison with other state-of-the-art super-resolution approaches. Results on both objective and subjective measurements show that our proposed SR-VAE can achieve good photo-realistic perceptual quality closer to the natural image manifold while maintain low distortion.	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.relation.ispartof	IEEE Transactions on Circuits and Systems for Video Technology	en_US
dc.title	Photo-realistic image super-resolution via variational autoencoders	en_US
dc.type	journal article	en_US
dc.identifier.doi	10.1109/TCSVT.2020.3003832	-
dc.contributor.affiliation	School of Computing and Information Sciences	en_US
dc.relation.issn	1558-2205	en_US
dc.description.volume	31	en_US
dc.description.issue	4	en_US
dc.description.startpage	1351	en_US
dc.description.endpage	1365	en_US
dc.cihe.affiliated	No	-
item.grantfulltext	none	-
item.languageiso639-1	en	-
item.fulltext	No Fulltext	-
item.openairecristype	http://purl.org/coar/resource_type/c_6501	-
item.openairetype	journal article	-
item.cerifentitytype	Publications	-
crisitem.author.dept	Yam Pak Charitable Foundation School of Computing and Information Sciences	-
crisitem.author.dept	Yam Pak Charitable Foundation School of Computing and Information Sciences	-
crisitem.author.orcid	0000-0001-8280-0367	-
crisitem.author.orcid	0000-0003-4507-3097	-
Appears in Collections:	CIS Publication

Show simple item record

Google Scholar^TM

Check

Google Scholar^TM

Altmetric

Altmetric

Google ScholarTM

Altmetric

Altmetric

Google Scholar^TM