ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

A Boundary-Aware Distillation Network for Compressed Video Semantic Segmentation

Hongchao Lu

Auto-TLDR; A Boundary-Aware Distillation Network for Video Semantic Segmentation

Abstract Slides Poster

In recent years optical flow is often estimated to reuse features so as to accelerate video semantic segmentation. With addition of optical flow network, however, extra cost may incur and accuracy may thus be degraded because of repeated warping operation. In this paper, we propose a boundary-aware distillation network (BDNet) that replaces optical flow network with block motion vectors encoded in compressed video, resulting in negligible computational complexity. In order to make salient features, an auxiliary boundary-aware stream is added to the main stream to jointly estimate silhouette and segmentation of objects. To further correct warped features, a well-trained teacher network is employed to transfer knowledge to the main stream. Both boundary-aware stream and the teacher network are neglected during inference stage, so that video segmentation network enables to get faster without increasing any computational burden. By splitting the task into three components, our BDNet shows almost 10% time saving as well as 1.6% accuracy improvement over baseline on the Cityscapes dataset.

Similar papers

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Zhuoying Wang, Yongtao Wang, Zhi Tang, Yangyan Li, Ying Chen, Haibin Ling, Weisi Lin

Auto-TLDR; Gated Scale-Transfer Operation for Semantic Segmentation

A Boundary-Aware Distillation Network for Compressed Video Semantic Segmentation

Similar papers

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation

Boundary-Aware Graph Convolution for Semantic Segmentation

FastSal: A Computationally Efficient Network for Visual Saliency Prediction

Global-Local Attention Network for Semantic Segmentation in Aerial Images

Transitional Asymmetric Non-Local Neural Networks for Real-World Dirt Road Segmentation

Feature-Supervised Action Modality Transfer

Semantic Segmentation Refinement Using Entropy and Boundary-guided Monte Carlo Sampling and Directed Regional Search

Real-Time Semantic Segmentation Via Region and Pixel Context Network

Exploiting Distilled Learning for Deep Siamese Tracking

Enhanced Feature Pyramid Network for Semantic Segmentation

Knowledge Distillation with a Precise Teacher and Prediction with Abstention

CASNet: Common Attribute Support Network for Image Instance and Panoptic Segmentation

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

PSDNet: A Balanced Architecture of Accuracy and Parameters for Semantic Segmentation

ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation

Automatic Student Network Search for Knowledge Distillation

Unsupervised Domain Adaptation with Multiple Domain Discriminators and Adaptive Self-Training

Joint Semantic-Instance Segmentation of 3D Point Clouds: Instance Separation and Semantic Fusion

Feature Fusion for Online Mutual Knowledge Distillation

Do Not Treat Boundaries and Regions Differently: An Example on Heart Left Atrial Segmentation

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Object Segmentation Tracking from Generic Video Cues

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

Teacher-Student Training and Triplet Loss for Facial Expression Recognition under Occlusion

Towards Low-Bit Quantization of Deep Neural Networks with Limited Data

Compact CNN Structure Learning by Knowledge Distillation

Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video

Towards Practical Compressed Video Action Recognition: A Temporal Enhanced Multi-Stream Network

Distilling Spikes: Knowledge Distillation in Spiking Neural Networks

Utilising Visual Attention Cues for Vehicle Detection and Tracking

Channel Planting for Deep Neural Networks Using Knowledge Distillation

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Cross-Domain Semantic Segmentation of Urban Scenes Via Multi-Level Feature Alignment

MFI: Multi-Range Feature Interchange for Video Action Recognition

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

Multi-Order Feature Statistical Model for Fine-Grained Visual Categorization

Triplet-Path Dilated Network for Detection and Segmentation of General Pathological Images

Multi-Direction Convolution for Semantic Segmentation

Knowledge Distillation Beyond Model Compression

Human Segmentation with Dynamic LiDAR Data

Efficient Online Subclass Knowledge Distillation for Image Classification

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

CAggNet: Crossing Aggregation Network for Medical Image Segmentation

FOANet: A Focus of Attention Network with Application to Myocardium Segmentation