ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Human Segmentation with Dynamic LiDAR Data

Tao Zhong, Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi

Auto-TLDR; Spatiotemporal Neural Network for Human Segmentation with Dynamic Point Clouds

Abstract Slides

Consecutive LiDAR scans and depth images compose dynamic 3D sequences, which contain more abundant spatiotemporal information than a single frame. Similar to the development history of image and video perception, dynamic 3D sequence perception starts to come into sight after inspiring research on static 3D data perception. This work proposes a spatiotemporal neural network for human segmentation with the dynamic LiDAR point clouds. It takes a sequence of depth images as input. It has a two-branch structure, i.e., the spatial segmentation branch and the temporal velocity estimation branch. The velocity estimation branch is designed to capture motion cues from the input sequence and then propagates them to the other branch. So that the segmentation branch segments humans according to both spatial and temporal features. These two branches are jointly learned on a generated dynamic point cloud data set for human recognition. Our works fill in the blank of dynamic point cloud perception with the spherical representation of point cloud and achieves high accuracy. The experiments indicate that the introduction of temporal feature benefits the segmentation of dynamic point cloud perception.

Similar papers

PC-Net: A Deep Network for 3D Point Clouds Analysis

Zhuo Chen, Tao Guan, Yawei Luo, Yuesong Wang

Auto-TLDR; PC-Net: A Hierarchical Neural Network for 3D Point Clouds Analysis

Human Segmentation with Dynamic LiDAR Data

Similar papers

PC-Net: A Deep Network for 3D Point Clouds Analysis

Video Semantic Segmentation Using Deep Multi-View Representation Learning

Ghost Target Detection in 3D Radar Data Using Point Cloud Based Deep Neural Network

Early Wildfire Smoke Detection in Videos

Semantic Segmentation for Pedestrian Detection from Motion in Temporal Domain

Sensor-Independent Pedestrian Detection for Personal Mobility Vehicles in Walking Space Using Dataset Generated by Simulation

Progressive Scene Segmentation Based on Self-Attention Mechanism

Object Segmentation Tracking from Generic Video Cues

Residual Learning of Video Frame Interpolation Using Convolutional LSTM

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

ACCLVOS: Atrous Convolution with Spatial-Temporal ConvLSTM for Video Object Segmentation

MixedFusion: 6D Object Pose Estimation from Decoupled RGB-Depth Features

Cross-Regional Attention Network for Point Cloud Completion

PointDrop: Improving Object Detection from Sparse Point Clouds Via Adversarial Data Augmentation

MANet: Multimodal Attention Network Based Point-View Fusion for 3D Shape Recognition

Revisiting Sequence-To-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory

Motion U-Net: Multi-Cue Encoder-Decoder Network for Motion Segmentation

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

PointSpherical: Deep Shape Context for Point Cloud Learning in Spherical Coordinates

Learning Object Deformation and Motion Adaption for Semi-Supervised Video Object Segmentation

FatNet: A Feature-Attentive Network for 3D Point Cloud Processing

Holistic Grid Fusion Based Stop Line Estimation

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation

CARRADA Dataset: Camera and Automotive Radar with Range-Angle-Doppler Annotations

PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

PHNet: Parasite-Host Network for Video Crowd Counting

Self-Supervised Detection and Pose Estimation of Logistical Objects in 3D Sensor Data

Attention Based Coupled Framework for Road and Pothole Segmentation

Manual-Label Free 3D Detection Via an Open-Source Simulator

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

Deep Space Probing for Point Cloud Analysis

A Lightweight Network to Learn Optical Flow from Event Data

MFI: Multi-Range Feature Interchange for Video Action Recognition

Towards Efficient 3D Point Cloud Scene Completion Via Novel Depth View Synthesis

Temporal Pulses Driven Spiking Neural Network for Time and Power Efficient Object Recognition in Autonomous Driving

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

Incorporating Depth Information into Few-Shot Semantic Segmentation

S-VoteNet: Deep Hough Voting with Spherical Proposal for 3D Object Detection

Video Object Detection Using Object's Motion Context and Spatio-Temporal Feature Aggregation

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

What and How? Jointly Forecasting Human Action and Pose

Movement-Induced Priors for Deep Stereo

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

A Grid-Based Representation for Human Action Recognition

Vehicle Classification from Profile Measures

Delivering Meaningful Representation for Monocular Depth Estimation