ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

Ihsan Ullah, Sean Reilly, Michael Madden

Auto-TLDR; Lateral Inhibition in Deep Neural Networks for Object Recognition and Semantic Segmentation

Abstract Slides Poster

In a Convolutional Neural Network, each neuron in the output feature map takes input from the neurons in its receptive field. This receptive field concept plays a vital role in today's deep neural networks. However, inspired by neuro-biological research, it has been proposed to add inhibitory neurons outside the receptive field, which may enhance the performance of neural network models. In this paper, we begin with deep network architectures such as VGG and ResNet, and propose an approach to add lateral inhibition in each output neuron to reduce its impact on its neighbours, both in fine-tuning pre-trained models and training from scratch. Our experiments show that notable improvements upon prior baseline deep models can be achieved. A key feature of our approach is that it is easy to add to baseline models; it can be adopted in any model containing convolution layers, and we demonstrate its value in applications including object recognition and semantic segmentation of aerial images, where we show state-of-the-art result on the Aeroscape dataset. On semantic segmentation tasks, our enhancement shows 17.43% higher mIoU than a single baseline model on a single source (the Aeroscape dataset), 13.43% higher performance than an ensemble model on the same single source, and 7.03% higher than an ensemble model on multiple sources (segmentation datasets). Our experiments illustrate the potential impact of using inhibitory neurons in deep learning models, and they also show better results than the baseline models that have standard convolutional layer.

Similar papers

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

Leonel Rosas-Arias, Gibran Benitez-Garcia, Jose Portillo-Portillo, Gabriel Sanchez-Perez, Keiji Yanai

Auto-TLDR; FASSD-Net: Dilated Asymmetric Pyramidal Fusion for Real-Time Semantic Segmentation

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

Similar papers

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

Boundary-Aware Graph Convolution for Semantic Segmentation

Multi-Direction Convolution for Semantic Segmentation

CQNN: Convolutional Quadratic Neural Networks

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Multiple Document Datasets Pre-Training Improves Text Line Detection with Deep Neural Networks

Enhanced Feature Pyramid Network for Semantic Segmentation

Automatic Semantic Segmentation of Structural Elements related to the Spinal Cord in the Lumbar Region by Using Convolutional Neural Networks

WeightAlign: Normalizing Activations by Weight Alignment

Transitional Asymmetric Non-Local Neural Networks for Real-World Dirt Road Segmentation

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Semantic Segmentation Refinement Using Entropy and Boundary-guided Monte Carlo Sampling and Directed Regional Search

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Attention Based Coupled Framework for Road and Pothole Segmentation

Rethinking of Deep Models Parameters with Respect to Data Distribution

EdgeNet: Semantic Scene Completion from a Single RGB-D Image

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Planar 3D Transfer Learning for End to End Unimodal MRI Unbalanced Data Segmentation

Multiscale Attention-Based Prototypical Network for Few-Shot Semantic Segmentation

Early Wildfire Smoke Detection in Videos

Zoom-CAM: Generating Fine-Grained Pixel Annotations from Image Labels

Improving Batch Normalization with Skewness Reduction for Deep Neural Networks

FatNet: A Feature-Attentive Network for 3D Point Cloud Processing

RescueNet: Joint Building Segmentation and Damage Assessment from Satellite Imagery

Fine-Tuning Convolutional Neural Networks: A Comprehensive Guide and Benchmark Analysis for Glaucoma Screening

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Dual Encoder Fusion U-Net (DEFU-Net) for Cross-manufacturer Chest X-Ray Segmentation

Progressive Gradient Pruning for Classification, Detection and Domain Adaptation

Not All Domains Are Equally Complex: Adaptive Multi-Domain Learning

Building Computationally Efficient and Well-Generalizing Person Re-Identification Models with Metric Learning

Bridging the Gap between Natural and Medical Images through Deep Colorization

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

A Close Look at Deep Learning with Small Data

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values

Segmenting Kidney on Multiple Phase CT Images Using ULBNet

Cross-Domain Semantic Segmentation of Urban Scenes Via Multi-Level Feature Alignment

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Semantic Object Segmentation in Cultural Sites Using Real and Synthetic Data

Neuron-Based Network Pruning Based on Majority Voting

Unsupervised Domain Adaptation with Multiple Domain Discriminators and Adaptive Self-Training

Improved Residual Networks for Image and Video Recognition

CT-UNet: An Improved Neural Network Based on U-Net for Building Segmentation in Remote Sensing Images

ESResNet: Environmental Sound Classification Based on Visual Domain Models

DE-Net: Dilated Encoder Network for Automated Tongue Segmentation