A Hybrid Deep Learning Approach for Color Face Generation Using Mask R-CNN and GAN
| dc.contributor.advisor | Mesfin Abebe (PhD) | |
| dc.contributor.author | Mekonnen Bayisa | |
| dc.date.accessioned | 2026-04-07T10:48:20Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Generating realistic, high-resolution color face images remains a challenging task in computer vision because it requires both a precise structural representation of facial components and fine grained texture generation. Existing GAN-based techniques tend to fail in preserving semantic consistency between facial areas and generating high-quality textures, thus resulting in unnatural or blurry images. To solve these issues, this research presents a hybrid deep learning approach that combines Mask R-CNN for semantic facial region segmentation and a GAN-based generator for realistic face image synthesis. Mask R-CNN is used to precisely segment dominant facial features, including eyes, nose, and mouth, which are then employed to guide an attention-enhanced U-Net generator. The generator uses self-attention modules to model long-range spatial dependencies and channel attention to dynamically highlight informative feature channels, thereby preserving local and global image details. A multiscale PatchGAN discriminator enforces image realism at various scales, while training process employs a combination of pixel-wise, perceptual, feature-matching, and structural similarity losses to enhance overall image realism. The introduced method is compared on the CelebA-HQ dataset, with PSNR of 28.33, SSIM of 0.9207, and FID of 21.77, outperforming baseline models and state-of-the-art methods. In addition, the employment of semantic guidance and attention mechanisms enables the model to generalize well even with small or diverse datasets, making it practical for real-world usage scenarios. The results show the framework's promise for high-fidelity facial synthesis in virtual reality, digital media, and other computer vision applications. | |
| dc.identifier.uri | https://etd.astu.edu.et/handle/123456789/3049 | |
| dc.language.iso | en | |
| dc.publisher | ASTU | |
| dc.subject | Color Face Synthesis | |
| dc.subject | GAN | |
| dc.subject | Mask R-CNN | |
| dc.subject | Facial Features Segmentation | |
| dc.subject | Hybrid Deep Learning | |
| dc.title | A Hybrid Deep Learning Approach for Color Face Generation Using Mask R-CNN and GAN | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Final Thesis by Mekonnen Bayisa for print with similarity.pdf
- Size:
- 2.7 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed to upon submission
- Description:
