Please use this identifier to cite or link to this item:
https://repository.cihe.edu.hk/jspui/handle/cihe/4663
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chan, Anthony Hing-Hung | en_US |
dc.contributor.author | Siu, Wan Chi | en_US |
dc.contributor.author | Cheng, Eric Wing Ho | en_US |
dc.date.accessioned | 2025-04-24T10:25:53Z | - |
dc.date.available | 2025-04-24T10:25:53Z | - |
dc.date.issued | 2024 | - |
dc.identifier.uri | https://repository.cihe.edu.hk/jspui/handle/cihe/4663 | - |
dc.description.abstract | Manipulating facial poses is challenging, especially when addressing significant pose variations. While extensive research has been dedicated to address large poses and manipulate various facial expressions, this frequently results in compromised image quality. The challenge may arise from nonlinearity of the latent space. We must navigate a complex path along the high-quality image manifold and determine the optimal direction for the face rotation task, which may secure the most effective disentanglement. Moreover, the regularity of the latent space also affects directly the quality of the resulting image. In this paper, we have made a careful study of the latent space, and deliberately crafted our model to identify the complicated trajectory of rotating facial manipulation with exceptional disentanglement. Our facial pose generative model, aims at enhancing the quality of generated images while preserving the identity and fidelity and achieving better disentanglement. Data acquisition is another challenging aspect, requiring extensive preparation and meticulous setup. To address this, we suggest a flipping technique to mitigate dataset limitations. Ultimately, we strive to strike a balance between image quality and pose generation, ensuring that our results are both visually pleasing and accurately representing the desired facial pose. | en_US |
dc.language.iso | en | en_US |
dc.title | High-quality facial pose generation with latent space processing | en_US |
dc.type | conference paper | en_US |
dc.relation.conference | 2024 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) | en_US |
dc.contributor.affiliation | Yam Pak Charitable Foundation School of Computing and Information Sciences | en_US |
dc.contributor.affiliation | Yam Pak Charitable Foundation School of Computing and Information Sciences | en_US |
dc.contributor.affiliation | Yam Pak Charitable Foundation School of Computing and Information Sciences | en_US |
dc.cihe.affiliated | Yes | - |
item.languageiso639-1 | en | - |
item.grantfulltext | open | - |
item.fulltext | With Fulltext | - |
item.cerifentitytype | Publications | - |
item.openairecristype | http://purl.org/coar/resource_type/c_5794 | - |
item.openairetype | conference paper | - |
crisitem.author.dept | Yam Pak Charitable Foundation School of Computing and Information Sciences | - |
crisitem.author.dept | Yam Pak Charitable Foundation School of Computing and Information Sciences | - |
crisitem.author.dept | Yam Pak Charitable Foundation School of Computing and Information Sciences | - |
crisitem.author.orcid | 0000-0001-7479-0787 | - |
crisitem.author.orcid | 0000-0001-8280-0367 | - |
Appears in Collections: | CIS Publication |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
View Online | 95 B | HTML | View/Open |

Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.