ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Saliency Prediction on Omnidirectional Images with Brain-Like Shallow Neural Network

Zhu Dandan, Chen Yongqing, Min Xiongkuo, Zhao Defang, Zhu Yucheng, Zhou Qiangqiang, Yang Xiaokang, Tian Han

Auto-TLDR; A Brain-like Neural Network for Saliency Prediction of Head Fixations on Omnidirectional Images

Abstract Slides Poster

Deep feedforward convolutional neural networks (CNNs) perform well in the saliency prediction of omnidirectional images (ODIs), and have become the leading class of candidate models of the visual processing mechanism in the primate ventral stream. These CNNs have evolved from shallow network architecture to extremely deep and branching architecture to achieve superb performance in various vision tasks, yet it is unclear how brain-like they are. In particular, these deep feedforward CNNs are difficult to mapping to ventral stream structure of the brain visual system due to their vast number of layers and missing biologically-important connections, such as recurrence. To tackle this issue, some brain-like shallow neural networks are introduced. In this paper, we propose a novel brain-like network model for saliency prediction of head fixations on ODIs. Specifically, our proposed model consists of three modules: a CORnet-S module, a template feature extraction module and a ranking attention module (RAM). The CORnet-S module is a lightweight artificial neural network (ANN) with four anatomically mapped areas (V1, V2, V4 and IT) and it can simulate the visual processing mechanism of ventral visual stream in the human brain. The template features extraction module is introduced to extract attention maps of ODIs and provide guidance for the feature ranking in the following RAM module. The RAM module is used to rank and select features that are important for fine-grained saliency prediction. Extensive experiments have validated the effectiveness of the proposed model in predicting saliency maps of ODIs, and the proposed model outperforms other state-of-the-art methods with similar scale.

Similar papers

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Feiyan Hu, Kevin Mcguinness

Auto-TLDR; MobileNetV2: A Convolutional Neural Network for Saliency Prediction

Saliency Prediction on Omnidirectional Images with Brain-Like Shallow Neural Network

Similar papers

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Classifying Eye-Tracking Data Using Saliency Maps

TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

An Improved Bilinear Pooling Method for Image-Based Action Recognition

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets

Translating Adult's Focus of Attention to Elderly's

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Progressive Scene Segmentation Based on Self-Attention Mechanism

Attention Pyramid Module for Scene Recognition

Utilising Visual Attention Cues for Vehicle Detection and Tracking

Collaborative Human Machine Attention Module for Character Recognition

RSAN: Residual Subtraction and Attention Network for Single Image Super-Resolution

From Early Biological Models to CNNs: Do They Look Where Humans Look?

Attention Stereo Matching Network

Aggregating Object Features Based on Attention Weights for Fine-Grained Image Retrieval

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Two-Level Attention-Based Fusion Learning for RGB-D Face Recognition

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Second-Order Attention Guided Convolutional Activations for Visual Recognition

Arbitrary Style Transfer with Parallel Self-Attention

HANet: Hybrid Attention-Aware Network for Crowd Counting

MANet: Multimodal Attention Network Based Point-View Fusion for 3D Shape Recognition

Directed Variational Cross-encoder Network for Few-Shot Multi-image Co-segmentation

Context-Aware Residual Module for Image Classification

Selective Kernel and Motion-Emphasized Loss Based Attention-Guided Network for HDR Imaging of Dynamic Scenes

Pose-Aware Multi-Feature Fusion Network for Driver Distraction Recognition

Self and Channel Attention Network for Person Re-Identification

Single Image Deblurring Using Bi-Attention Network

Free-Form Image Inpainting Via Contrastive Attention Network

Flow-Guided Spatial Attention Tracking for Egocentric Activity Recognition

CSpA-DN: Channel and Spatial Attention Dense Network for Fusing PET and MRI Images

BCAU-Net: A Novel Architecture with Binary Channel Attention Module for MRI Brain Segmentation

Multi-Scale and Attention Based ResNet for Heartbeat Classification

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Deep Residual Attention Network for Hyperspectral Image Reconstruction

Two-Stream Temporal Convolutional Network for Dynamic Facial Attractiveness Prediction

Attentive Part-Aware Networks for Partial Person Re-Identification

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Multi-Order Feature Statistical Model for Fine-Grained Visual Categorization

Spatial-Related and Scale-Aware Network for Crowd Counting

Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

MFI: Multi-Range Feature Interchange for Video Action Recognition

Adaptive Image Compression Using GAN Based Semantic-Perceptual Residual Compensation

SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection