ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Modeling Extent-Of-Texture Information for Ground Terrain Recognition

Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

Auto-TLDR; Extent-of-Texture Guided Inter-domain Message Passing for Ground Terrain Recognition

Abstract Slides Poster

Ground Terrain Recognition is a difficult task as the context information varies significantly over the regions of a ground terrain image. In this paper, we propose a novel approach towards ground-terrain recognition via modeling the Extent-of-Texture information to establish a balance between the order-less texture component and ordered-spatial information locally. At first, the proposed method uses a CNN backbone feature extractor network to capture meaningful information of a ground terrain image, and model the extent of texture and shape information locally. Then, the order-less texture information and ordered shape information are encoded in a patch-wise manner, which is utilized by intra-domain message passing module to make every patch aware of each other for rich feature learning. Next, the Extent-of-Texture (EoT) Guided Inter-domain Message Passing module combines the extent of texture and shape information with the encoded texture and shape information in a patch-wise fashion for sharing knowledge to balance out the order-less texture information with ordered shape information. Further, Bilinear model generates a pairwise correlation between the order-less texture information and ordered shape information. Finally, the ground-terrain image classification is performed by a fully connected layer. The experimental results indicate superior performance of the proposed model over existing state-of-the-art techniques on publicly available datasets like DTD, MINC and GTOS-mobile.

Similar papers

Classification of Intestinal Gland Cell-Graphs Using Graph Neural Networks

Linda Studer, Jannis Wallau, Heather Dawson, Inti Zlobec, Andreas Fischer

Auto-TLDR; Graph Neural Networks for Classification of Dysplastic Gland Glands using Graph Neural Networks

Modeling Extent-Of-Texture Information for Ground Terrain Recognition

Similar papers

Classification of Intestinal Gland Cell-Graphs Using Graph Neural Networks

Region and Relations Based Multi Attention Network for Graph Classification

Surface IR Reflectance Estimation and Material Recognition Using ToF Camera

Privacy Attributes-Aware Message Passing Neural Network for Visual Privacy Attributes Classification

Ordinal Depth Classification Using Region-Based Self-Attention

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Two-Level Attention-Based Fusion Learning for RGB-D Face Recognition

What Nodes Vote To? Graph Classification without Readout Phase

A Multi-Head Self-Relation Network for Scene Text Recognition

Single-Modal Incremental Terrain Clustering from Self-Supervised Audio-Visual Feature Learning

Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation

Attention Pyramid Module for Scene Recognition

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

Unconstrained Vision Guided UAV Based Safe Helicopter Landing

Stratified Multi-Task Learning for Robust Spotting of Scene Texts

An Improved Bilinear Pooling Method for Image-Based Action Recognition

A Novel Deep-Learning Pipeline for Light Field Image Based Material Recognition

TAAN: Task-Aware Attention Network for Few-Shot Classification

Human-Centric Parsing Network for Human-Object Interaction Detection

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

MANet: Multimodal Attention Network Based Point-View Fusion for 3D Shape Recognition

Self and Channel Attention Network for Person Re-Identification

MFI: Multi-Range Feature Interchange for Video Action Recognition

Using Scene Graphs for Detecting Visual Relationships

ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection

UDBNET: Unsupervised Document Binarization Network Via Adversarial Game

Attention-Driven Body Pose Encoding for Human Activity Recognition

Attention Based Coupled Framework for Road and Pothole Segmentation

Cross-Media Hash Retrieval Using Multi-head Attention Network

Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

Self-Supervised Learning with Graph Neural Networks for Region of Interest Retrieval in Histopathology

A Grid-Based Representation for Human Action Recognition

Free-Form Image Inpainting Via Contrastive Attention Network

PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation

Context Matters: Self-Attention for Sign Language Recognition

DmifNet:3D Shape Reconstruction Based on Dynamic Multi-Branch Information Fusion

Context for Object Detection Via Lightweight Global and Mid-Level Representations

Joint Learning Multiple Curvature Descriptor for 3D Palmprint Recognition

Force Banner for the Recognition of Spatial Relations

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

SAT-Net: Self-Attention and Temporal Fusion for Facial Action Unit Detection

Global Context-Based Network with Transformer for Image2latex

Domain Siamese CNNs for Sparse Multispectral Disparity Estimation

Aggregating Object Features Based on Attention Weights for Fine-Grained Image Retrieval

DA-RefineNet: Dual-Inputs Attention RefineNet for Whole Slide Image Segmentation

Automated Whiteboard Lecture Video Summarization by Content Region Detection and Representation

Edge-Aware Graph Attention Network for Ratio of Edge-User Estimation in Mobile Networks