ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Revisiting Optical Flow Estimation in 360 Videos

Keshav Bhandari, Ziliang Zong, Yan Yan

Auto-TLDR; LiteFlowNet360: A Domain Adaptation Framework for 360 Video Optical Flow Estimation

Abstract Slides

Nowadays 360 video analysis has become a significant research topic in the field since the appearance of high-quality and low-cost 360 wearable devices. In this paper, we propose a novel LiteFlowNet360 architecture for 360 videos optical flow estimation. We design LiteFlowNet360 as a domain adaptation framework from perspective video domain to 360 video domain. We adapt it from simple kernel transformation techniques inspired by Kernel Transformer Network (KTN) to cope with inherent distortion in 360 videos caused by the sphere-to-plane projection. First, we apply an incremental transformation of convolution layers in feature pyramid network and show that further transformation in inference and regularization layers are not important, hence reducing the network growth in terms of size and computation cost. Second, we refine the network by training with augmented data in a supervised manner. We perform data augmentation by projecting the images in a sphere and re-projecting to a plane. Third, we train LiteFlowNet360 in a self-supervised manner using target domain 360 videos. Experimental results show the promising results of 360 video optical flow estimation using the proposed novel architecture.

Similar papers

OmniFlowNet: A Perspective Neural Network Adaptation for Optical Flow Estimation in Omnidirectional Images

Charles-Olivier Artizzu, Haozhou Zhang, Guillaume Allibert, Cédric Demonceaux

Auto-TLDR; OmniFlowNet: A Convolutional Neural Network for Omnidirectional Optical Flow Estimation

Revisiting Optical Flow Estimation in 360 Videos

Similar papers

OmniFlowNet: A Perspective Neural Network Adaptation for Optical Flow Estimation in Omnidirectional Images

PA-FlowNet: Pose-Auxiliary Optical Flow Network for Spacecraft Relative Pose Estimation

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

A Lightweight Network to Learn Optical Flow from Event Data

STaRFlow: A SpatioTemporal Recurrent Cell for Lightweight Multi-Frame Optical Flow Estimation

ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

Residual Learning of Video Frame Interpolation Using Convolutional LSTM

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Object Segmentation Tracking from Generic Video Cues

Siamese Fully Convolutional Tracker with Motion Correction

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Motion-Supervised Co-Part Segmentation

Towards Practical Compressed Video Action Recognition: A Temporal Enhanced Multi-Stream Network

Movement-Induced Priors for Deep Stereo

FC-DCNN: A Densely Connected Neural Network for Stereo Estimation

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Deep Homography-Based Video Stabilization

5D Light Field Synthesis from a Monocular Video

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

MFI: Multi-Range Feature Interchange for Video Action Recognition

Suppressing Features That Contain Disparity Edge for Stereo Matching

Estimating Gaze Points from Facial Landmarks by a Remote Spherical Camera

TinyVIRAT: Low-Resolution Video Action Recognition

Learning Group Activities from Skeletons without Individual Action Labels

ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation

Shape Consistent 2D Keypoint Estimation under Domain Shift

Better Prior Knowledge Improves Human-Pose-Based Extrinsic Camera Calibration

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Partially Supervised Multi-Task Network for Single-View Dietary Assessment

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Learning Stereo Matchability in Disparity Regression Networks

Motion Complementary Network for Efficient Action Recognition

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Learning Knowledge-Rich Sequential Model for Planar Homography Estimation in Aerial Video

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views

RWF-2000: An Open Large Scale Video Database for Violence Detection

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

AttendAffectNet: Self-Attention Based Networks for Predicting Affective Responses from Movies

Future Urban Scenes Generation through Vehicles Synthesis

Semi-Supervised Deep Learning Techniques for Spectrum Reconstruction

Joint Face Alignment and 3D Face Reconstruction with Efficient Convolution Neural Networks

Human Segmentation with Dynamic LiDAR Data

Attention Stereo Matching Network

PointSpherical: Deep Shape Context for Point Cloud Learning in Spherical Coordinates

Not All Domains Are Equally Complex: Adaptive Multi-Domain Learning

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

Visual Saliency Oriented Vehicle Scale Estimation

Generalized Shortest Path-Based Superpixels for Accurate Segmentation of Spherical Images