ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Rotation Invariant Aerial Image Retrieval with Group Convolutional Metric Learning

Hyunseung Chung, Woo-Jeoung Nam, Seong-Whan Lee

Auto-TLDR; Robust Remote Sensing Image Retrieval Using Group Convolution with Attention Mechanism and Metric Learning

Abstract Slides Poster

Remote sensing image retrieval (RSIR) is the process of ranking database images depending on the degree of similarity compared to the query image. As the complexity of RSIR increases due to the diversity in shooting range, angle, and location of remote sensors, there is an increasing demand for methods to address these issues and improve retrieval performance. In this work, we introduce a novel method for retrieving aerial images by merging group convolution with attention mechanism and metric learning, resulting in robustness to rotational variations. For refinement and emphasis on important features, we applied channel attention in each group convolution stage. By utilizing the characteristics of group convolution and channel-wise attention, it is possible to acknowledge the equality among rotated but identically located images. The training procedure has two main steps: (i) training the network with Aerial Image Dataset (AID) for classification, (ii) fine-tuning the network with triplet-loss for retrieval with Google Earth South Korea and NWPU-RESISC45 datasets. Results show that the proposed method performance exceeds other state-of-the-art retrieval methods in both rotated and original environments. Furthermore, we utilize class activation maps (CAM) to visualize the distinct difference of main features between our method and baseline, resulting in better adaptability in rotated environments.

Similar papers

Attentive Part-Aware Networks for Partial Person Re-Identification

Lijuan Huo, Chunfeng Song, Zhengyi Liu, Zhaoxiang Zhang

Auto-TLDR; Part-Aware Learning for Partial Person Re-identification

Rotation Invariant Aerial Image Retrieval with Group Convolutional Metric Learning

Similar papers

Attentive Part-Aware Networks for Partial Person Re-Identification

Loop-closure detection by LiDAR scan re-identification

Generalized Local Attention Pooling for Deep Metric Learning

Self and Channel Attention Network for Person Re-Identification

Multi-Level Deep Learning Vehicle Re-Identification Using Ranked-Based Loss Functions

Aggregating Object Features Based on Attention Weights for Fine-Grained Image Retrieval

CT-UNet: An Improved Neural Network Based on U-Net for Building Segmentation in Remote Sensing Images

Pose-Robust Face Recognition by Deep Meta Capsule Network-Based Equivariant Embedding

Two-Level Attention-Based Fusion Learning for RGB-D Face Recognition

Supporting Skin Lesion Diagnosis with Content-Based Image Retrieval

Attention-Based Deep Metric Learning for Near-Duplicate Video Retrieval

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

Building Computationally Efficient and Well-Generalizing Person Re-Identification Models with Metric Learning

Top-DB-Net: Top DropBlock for Activation Enhancement in Person Re-Identification

DAIL: Dataset-Aware and Invariant Learning for Face Recognition

Audio-Based Near-Duplicate Video Retrieval with Audio Similarity Learning

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Multi-Label Contrastive Focal Loss for Pedestrian Attribute Recognition

The Color Out of Space: Learning Self-Supervised Representations for Earth Observation Imagery

Free-Form Image Inpainting Via Contrastive Attention Network

Nonlinear Ranking Loss on Riemannian Potato Embedding

Deep Top-Rank Counter Metric for Person Re-Identification

Attention-Based Selection Strategy for Weakly Supervised Object Localization

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

Adaptive L2 Regularization in Person Re-Identification

Siamese Graph Convolution Network for Face Sketch Recognition

Not 3D Re-ID: Simple Single Stream 2D Convolution for Robust Video Re-Identification

Attention Pyramid Module for Scene Recognition

An Improved Bilinear Pooling Method for Image-Based Action Recognition

Progressive Learning Algorithm for Efficient Person Re-Identification

SSDL: Self-Supervised Domain Learning for Improved Face Recognition

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

TAAN: Task-Aware Attention Network for Few-Shot Classification

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches

One-Shot Representational Learning for Joint Biometric and Device Authentication

Rethinking ReID：Multi-Feature Fusion Person Re-Identification Based on Orientation Constraints

Face Anti-Spoofing Using Spatial Pyramid Pooling

Large-Scale Historical Watermark Recognition: Dataset and a New Consistency-Based Approach

Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets

Attentive Hybrid Feature Based a Two-Step Fusion for Facial Expression Recognition

Self-Supervised Learning with Graph Neural Networks for Region of Interest Retrieval in Histopathology

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Augmented Bi-Path Network for Few-Shot Learning

DFH-GAN: A Deep Face Hashing with Generative Adversarial Network

Deeply-Fused Attentive Network for Stereo Matching

3D Facial Matching by Spiral Convolutional Metric Learning and a Biometric Fusion-Net of Demographic Properties

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

G-FAN: Graph-Based Feature Aggregation Network for Video Face Recognition