ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Multiscale Attention-Based Prototypical Network for Few-Shot Semantic Segmentation

Yifei Zhang, Desire Sidibe, Olivier Morel, Fabrice Meriaudeau

Auto-TLDR; Few-shot Semantic Segmentation with Multiscale Feature Attention

Abstract Slides

Deep learning-based image understanding techniques require a large number of labeled images for training. Few-shot semantic segmentation, on the contrary, aims at generalizing the segmentation ability of the model to new categories given only a few labeled samples. To tackle this problem, we propose a novel prototypical network (MAPnet) with multiscale feature attention. To fully exploit the representative features of target classes, we firstly extract rich contextual information of labeled support images via a multiscale feature enhancement module. The learned prototypes from support features provide further semantic guidance on the query image. Then we adaptively integrate multiple similarity-guided probability maps by attention mechanism, yielding an optimal pixel-wise prediction. Furthermore, the proposed method was validated on the PASCAL-5i dataset in terms of 1-way N-shot evaluation. We also test the model with weak annotations, including scribble and bounding box annotations. Both the qualitative and quantitative results demonstrate the advantages of our approach over other state-of-the-art methods.

Similar papers

Incorporating Depth Information into Few-Shot Semantic Segmentation

Yifei Zhang, Desire Sidibe, Olivier Morel, Fabrice Meriaudeau

Auto-TLDR; RDNet: A Deep Neural Network for Few-shot Segmentation Using Depth Information

Multiscale Attention-Based Prototypical Network for Few-Shot Semantic Segmentation

Similar papers

Incorporating Depth Information into Few-Shot Semantic Segmentation

Multi-Direction Convolution for Semantic Segmentation

Boundary-Aware Graph Convolution for Semantic Segmentation

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Enhanced Feature Pyramid Network for Semantic Segmentation

Semantic Segmentation Refinement Using Entropy and Boundary-guided Monte Carlo Sampling and Directed Regional Search

TAAN: Task-Aware Attention Network for Few-Shot Classification

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

Local Propagation for Few-Shot Learning

Directed Variational Cross-encoder Network for Few-Shot Multi-image Co-segmentation

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

DARN: Deep Attentive Refinement Network for Liver Tumor Segmentation from 3D CT Volume

Explanation-Guided Training for Cross-Domain Few-Shot Classification

Few-Shot Few-Shot Learning and the Role of Spatial Attention

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Augmented Bi-Path Network for Few-Shot Learning

Video Semantic Segmentation Using Deep Multi-View Representation Learning

MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

Automatic Semantic Segmentation of Structural Elements related to the Spinal Cord in the Lumbar Region by Using Convolutional Neural Networks

Transitional Asymmetric Non-Local Neural Networks for Real-World Dirt Road Segmentation

Joint Semantic-Instance Segmentation of 3D Point Clouds: Instance Separation and Semantic Fusion

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Ordinal Depth Classification Using Region-Based Self-Attention

Mutually Guided Dual-Task Network for Scene Text Detection

Point In: Counting Trees with Weakly Supervised Segmentation Network

SFPN: Semantic Feature Pyramid Network for Object Detection

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Semantics to Space(S2S): Embedding Semantics into Spatial Space for Zero-Shot Verb-Object Query Inferencing

RescueNet: Joint Building Segmentation and Damage Assessment from Satellite Imagery

Attention Based Coupled Framework for Road and Pothole Segmentation

Dynamic Guided Network for Monocular Depth Estimation

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

Few-Shot Learning Based on Metric Learning Using Class Augmentation

Forground-Guided Vehicle Perception Framework

FatNet: A Feature-Attentive Network for 3D Point Cloud Processing

CAggNet: Crossing Aggregation Network for Medical Image Segmentation

Meta Generalized Network for Few-Shot Classification

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

Graph-Based Interpolation of Feature Vectors for Accurate Few-Shot Classification

Complementing Representation Deficiency in Few-Shot Image Classification: A Meta-Learning Approach

DA-RefineNet: Dual-Inputs Attention RefineNet for Whole Slide Image Segmentation

Semantic Object Segmentation in Cultural Sites Using Real and Synthetic Data

Zoom-CAM: Generating Fine-Grained Pixel Annotations from Image Labels