ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Sketch-SNet: Deeper Subdivision of Temporal Cues for Sketch Recognition

Yizhou Tan, Lan Yang, Honggang Zhang

Auto-TLDR; Sketch Recognition using Invariable Structural Feature and Drawing Habits Feature

Abstract Slides Poster

Sketch recognition is a central task in sketchrelated researches. Different from the natural image, the sparse pixel distribution of sketch destroys the visual texture which encourages researchers to explore the temporal information of sketch. With the release of million-scale datasets, we explore the invariable structure of sketch and specific order of strokes in sketch. Prior works based on Recurrent Neural Network (RNN) trend to output different features with changed stroke orders. In particular, we adopt a novel method by employing a Graph Convolutional Network (GCN) to extract invariable structural feature under any orders of strokes. Compared to traditional comprehension of sketch, we further split the temporal information of sketch into two types of feature (invariable structural feature (ISF) and drawing habits feature (DHF)) which aim to reduce the confusion in temporal information. We propose a two-branch GCN-RNN network to extract two types of feature respectively, termed Sketch-SNet. The GCN branch is encouraged to extract the ISF through receiving various shuffled strokes of an input sketch. The RNN branch takes the original input to extract DHF by learning the pattern of strokes’ order. Meanwhile, we introduce semantic information to generate soft-labels owing to the high abstractness of sketch. Extensive experiments on the Quick-Draw dataset demonstrate that our further subdivision of temporal information improves the performance of sketch recognition which surpasses state-of-the-art by a large margin.

Similar papers

Label Incorporated Graph Neural Networks for Text Classification

Yuan Xin, Linli Xu, Junliang Guo, Jiquan Li, Xin Sheng, Yuanyuan Zhou

Auto-TLDR; Graph Neural Networks for Semi-supervised Text Classification

Sketch-SNet: Deeper Subdivision of Temporal Cues for Sketch Recognition

Similar papers

Label Incorporated Graph Neural Networks for Text Classification

Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network

GCNs-Based Context-Aware Short Text Similarity Model

PICK: Processing Key Information Extraction from Documents Using Improved Graph Learning-Convolutional Networks

More Correlations Better Performance: Fully Associative Networks for Multi-Label Image Classification

GuCNet: A Guided Clustering-Based Network for Improved Classification

Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting

A Two-Stream Recurrent Network for Skeleton-Based Human Interaction Recognition

Adaptive Word Embedding Module for Semantic Reasoning in Large-Scale Detection

VSB^2-Net: Visual-Semantic Bi-Branch Network for Zero-Shot Hashing

Reinforcement Learning with Dual Attention Guided Graph Convolution for Relation Extraction

Open Set Domain Recognition Via Attention-Based GCN and Semantic Matching Optimization

Boundary-Aware Graph Convolution for Semantic Segmentation

Siamese Graph Convolution Network for Face Sketch Recognition

Semantic Bilinear Pooling for Fine-Grained Recognition

Channel-Wise Dense Connection Graph Convolutional Network for Skeleton-Based Action Recognition

MANet: Multimodal Attention Network Based Point-View Fusion for 3D Shape Recognition

Exploiting Knowledge Embedded Soft Labels for Image Recognition

Visual Oriented Encoder: Integrating Multimodal and Multi-Scale Contexts for Video Captioning

Kernel-based Graph Convolutional Networks

Context for Object Detection Via Lightweight Global and Mid-Level Representations

Object Detection Using Dual Graph Network

3D Attention Mechanism for Fine-Grained Classification of Table Tennis Strokes Using a Twin Spatio-Temporal Convolutional Neural Networks

What Nodes Vote To? Graph Classification without Readout Phase

Learning Connectivity with Graph Convolutional Networks

Let's Play Music: Audio-Driven Performance Video Generation

Automatic Student Network Search for Knowledge Distillation

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

Equation Attention Relationship Network (EARN) : A Geometric Deep Metric Framework for Learning Similar Math Expression Embedding

Temporal Attention-Augmented Graph Convolutional Network for Efficient Skeleton-Based Human Action Recognition

Cross-Lingual Text Image Recognition Via Multi-Task Sequence to Sequence Learning

Road Network Metric Learning for Estimated Time of Arrival

TreeRNN: Topology-Preserving Deep Graph Embedding and Learning

A Prototype-Based Generalized Zero-Shot Learning Framework for Hand Gesture Recognition

Multi-Label Contrastive Focal Loss for Pedestrian Attribute Recognition

Extracting Action Hierarchies from Action Labels and their Use in Deep Action Recognition

Recognizing Bengali Word Images - A Zero-Shot Learning Perspective

PIN: A Novel Parallel Interactive Network for Spoken Language Understanding

Cross-Media Hash Retrieval Using Multi-head Attention Network

Audio-Visual Speech Recognition Using a Two-Step Feature Fusion Strategy

Progressive Scene Segmentation Based on Self-Attention Mechanism

Semantics to Space(S2S): Embedding Semantics into Spatial Space for Zero-Shot Verb-Object Query Inferencing

Prior Knowledge about Attributes: Learning a More Effective Potential Space for Zero-Shot Recognition

Attentive Part-Aware Networks for Partial Person Re-Identification

Stroke Based Posterior Attention for Online Handwritten Mathematical Expression Recognition

You Ought to Look Around: Precise, Large Span Action Detection

TAAN: Task-Aware Attention Network for Few-Shot Classification

Deep Convolutional Embedding for Digitized Painting Clustering