Generative Adversarial Network Based Visual Saliency Prediction with Cascaded Hierarchical Atrous Spatial Pyramid Pooling

dc.contributor.advisorProf. Yun Koo Chung
dc.contributor.authorDaniel, Dufera
dc.date.accessioned2025-12-17T10:54:11Z
dc.date.issued2022-09
dc.description.abstractVisual saliency refers to an area of an image that attracts human attention. The Human Visual System (HVS) can focus on specific parts of a scene, rather than the whole image. Visual attention describes a set of cognitive procedures that choose important information and filter out unnecessary information from cluttered visual scenes. Images become a soul in computer vision since it contains plenty of information and human being receives 80% of information through vision. In processing the whole image while only a certain part of an image is needed, more resources are consumed. Instead of processing the whole pixels of an image, specifying only the needed pixel is computationally efficient to minimize the efforts. This is achieved by using GAN with CHASPP module and EfficientNet-B7 which uniformly scales up all dimensions of the image (depth, width, and resolution) is selected as feature extractor in this study which improves the way of extracting features in visual saliency prediction. Different datasets like CAT2000, MIT1003, DUTOMRON, and PASCALS are used in this study to illustrate the efficiency of the selected models and techniques. Human attention modeling focuses on a bottom-up approach that computes the impact of visual stimuli popping from its surrounding. However different models and algorithms have different results in the prediction of the attention area of an image. In this study, we developed effective visual saliency prediction using GAN with CHASPP and other factors like edge loss and perceptual loss. CHASPP module scored the best result on the same datasets measured by different evaluation metrics. It improved the baseline work of SalGAN+ASPP from 3.356 ± 0.04 to 3.851 ± 0.01 (SalGAN+CHASPP+e). This study concluded that the CHASPP module, edge loss, and perceptual loss have a great influence on visual saliency prediction using a generative model.en_US
dc.description.sponsorshipASTUen_US
dc.identifier.urihttp://10.240.1.28:4000/handle/123456789/1552
dc.language.isoen_USen_US
dc.publisherASTUen_US
dc.subjectVisual saliency prediction, Attention area, Generative Adversarial Network, Cascaded Hierarchical Atrous Spatial Pyramid Pooling, Low-Level Features, High-Level Featuresen_US
dc.titleGenerative Adversarial Network Based Visual Saliency Prediction with Cascaded Hierarchical Atrous Spatial Pyramid Poolingen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Daniel Dufera.pdf
Size:
2.57 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description:

Collections