ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Convolutional STN for Weakly Supervised Object Localization

Akhil Meethal, Marco Pedersoli, Soufiane Belharbi, Eric Granger

Auto-TLDR; Spatial Localization for Weakly Supervised Object Localization

Abstract Slides

Weakly-supervised object localization is a challenging task in which the object of interest should be localized while learning its appearance. State-of-the-art methods recycle the architecture of a standard CNN by using the activation maps of the last layer for localizing the object. While this approach is simple and works relatively well, object localization relies on different features than classification, thus, a specialized localization mechanism is required during training to improve performance. In this paper, we propose a convolutional, multi-scale spatial localization network that provides accurate localization for the object of interest. Experimental results on CUB-200-2011 and ImageNet datasets show competitive performance of our proposed approach on Weakly supervised localization.

Similar papers

Attention-Based Selection Strategy for Weakly Supervised Object Localization

Zhenfei Zhang

Auto-TLDR; An Attention-based Selection Strategy for Weakly Supervised Object Localization

Convolutional STN for Weakly Supervised Object Localization

Similar papers

Attention-Based Selection Strategy for Weakly Supervised Object Localization

Dual-Attention Guided Dropblock Module for Weakly Supervised Object Localization

PRF-Ped: Multi-Scale Pedestrian Detector with Prior-Based Receptive Field

Detecting Objects with High Object Region Percentage

Understanding When Spatial Transformer Networks Do Not Support Invariance, and What to Do about It

Detective: An Attentive Recurrent Model for Sparse Object Detection

Bidirectional Matrix Feature Pyramid Network for Object Detection

Point In: Counting Trees with Weakly Supervised Segmentation Network

Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Scene Text Detection with Selected Anchors

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

SyNet: An Ensemble Network for Object Detection in UAV Images

SFPN: Semantic Feature Pyramid Network for Object Detection

CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

Construction Worker Hardhat-Wearing Detection Based on an Improved BiFPN

Attention Pyramid Module for Scene Recognition

Aggregating Object Features Based on Attention Weights for Fine-Grained Image Retrieval

Nighttime Pedestrian Detection Based on Feature Attention and Transformation

Few-Shot Few-Shot Learning and the Role of Spatial Attention

Cascade Saliency Attention Network for Object Detection in Remote Sensing Images

A Novel Region of Interest Extraction Layer for Instance Segmentation

CenterRepp: Predict Central Representative Point Set's Distribution for Detection

VTT: Long-Term Visual Tracking with Transformers

Hierarchical Head Design for Object Detectors

Tiny Object Detection in Aerial Images

Activity and Relationship Modeling Driven Weakly Supervised Object Detection

EAGLE: Large-Scale Vehicle Detection Dataset in Real-World Scenarios Using Aerial Imagery

Which Airline Is This? Airline Logo Detection in Real-World Weather Conditions

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

Foreground-Focused Domain Adaption for Object Detection

Small Object Detection by Generative and Discriminative Learning

Efficient-Receptive Field Block with Group Spatial Attention Mechanism for Object Detection

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

Local Attention and Global Representation Collaborating for Fine-Grained Classification

Zoom-CAM: Generating Fine-Grained Pixel Annotations from Image Labels

Text Recognition - Real World Data and Where to Find Them

Object Detection Model Based on Scene-Level Region Proposal Self-Attention

Forground-Guided Vehicle Perception Framework

Superpixel-Based Refinement for Object Proposal Generation

ScarfNet: Multi-Scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection

Robust Localization of Retinal Lesions Via Weakly-Supervised Learning

StrongPose: Bottom-up and Strong Keypoint Heat Map Based Pose Estimation

HANet: Hybrid Attention-Aware Network for Crowd Counting

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

A Modified Single-Shot Multibox Detector for Beyond Real-Time Object Detection

One-Stage Multi-Task Detector for 3D Cardiac MR Imaging

FeatureNMS: Non-Maximum Suppression by Learning Feature Embeddings