ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Distinctive 3D Local Deep Descriptors

Fabio Poiesi, Davide Boscaini

Auto-TLDR; DIPs: Local Deep Descriptors for Point Cloud Regression

Abstract Slides Poster

We present a simple but yet effective method for learning distinctive 3D local deep descriptors (DIPs) that can be used to register point clouds without requiring an initial alignment. Point cloud patches are extracted, canonicalised with respect to their estimated local reference frame and encoded into rotation-invariant compact descriptors by a PointNet-based deep neural network. DIPs can effectively generalise across different sensor modalities because they are learnt end-to-end from locally and randomly sampled points. Moreover, because DIPs encode only local geometric information, they are robust to clutter, occlusions and missing regions. We evaluate and compare DIPs against alternative hand-crafted and deep descriptors on several indoor and outdoor datasets reconstructed using different sensors. Results show that DIPs (i) achieve comparable results to the state-of-the-art on RGB-D indoor scenes (3DMatch dataset), (ii) outperform state-of-the-art by a large margin on laser-scanner outdoor scenes (ETH dataset), and (iii) generalise to indoor scenes reconstructed with the Visual-SLAM system of Android ARCore.

Similar papers

A Plane-Based Approach for Indoor Point Clouds Registration

Ketty Favre, Muriel Pressigout, Luce Morin, Eric Marchand

Auto-TLDR; A plane-based registration approach for indoor environments based on LiDAR data

Distinctive 3D Local Deep Descriptors

Similar papers

A Plane-Based Approach for Indoor Point Clouds Registration

PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation

FatNet: A Feature-Attentive Network for 3D Point Cloud Processing

PointSpherical: Deep Shape Context for Point Cloud Learning in Spherical Coordinates

3D Point Cloud Registration Based on Cascaded Mutual Information Attention Network

Cross-Regional Attention Network for Point Cloud Completion

Joint Supervised and Self-Supervised Learning for 3D Real World Challenges

MixedFusion: 6D Object Pose Estimation from Decoupled RGB-Depth Features

Deep Space Probing for Point Cloud Analysis

RISEdb: A Novel Indoor Localization Dataset

PointDrop: Improving Object Detection from Sparse Point Clouds Via Adversarial Data Augmentation

PC-Net: A Deep Network for 3D Point Clouds Analysis

Directional Graph Networks with Hard Weight Assignments

Multi-Scale Keypoint Matching

A New Geodesic-Based Feature for Characterization of 3D Shapes: Application to Soft Tissue Organ Temporal Deformations

Vehicle Classification from Profile Measures

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

RefiNet: 3D Human Pose Refinement with Depth Maps

Self-Supervised Detection and Pose Estimation of Logistical Objects in 3D Sensor Data

Can You Trust Your Pose? Confidence Estimation in Visual Localization

Enhanced Vote Network for 3D Object Detection in Point Clouds

Progressive Scene Segmentation Based on Self-Attention Mechanism

Ghost Target Detection in 3D Radar Data Using Point Cloud Based Deep Neural Network

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

A Two-Step Approach to Lidar-Camera Calibration

Learning Interpretable Representation for 3D Point Clouds

MANet: Multimodal Attention Network Based Point-View Fusion for 3D Shape Recognition

S-VoteNet: Deep Hough Voting with Spherical Proposal for 3D Object Detection

3D Pots Configuration System by Optimizing Over Geometric Constraints

Sensor-Independent Pedestrian Detection for Personal Mobility Vehicles in Walking Space Using Dataset Generated by Simulation

Surface IR Reflectance Estimation and Material Recognition Using ToF Camera

Learning to Implicitly Represent 3D Human Body from Multi-Scale Features and Multi-View Images

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Attention-Based Deep Metric Learning for Near-Duplicate Video Retrieval

Domain Siamese CNNs for Sparse Multispectral Disparity Estimation

Do We Really Need Scene-Specific Pose Encoders?

Learning Knowledge-Rich Sequential Model for Planar Homography Estimation in Aerial Video

Generic Merging of Structure from Motion Maps with a Low Memory Footprint

Partially Supervised Multi-Task Network for Single-View Dietary Assessment

Hybrid Approach for 3D Head Reconstruction: Using Neural Networks and Visual Geometry

Deep Realistic Novel View Generation for City-Scale Aerial Images

Exploiting Local Indexing and Deep Feature Confidence Scores for Fast Image-To-Video Search

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views

3D Facial Matching by Spiral Convolutional Metric Learning and a Biometric Fusion-Net of Demographic Properties

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

Loop-closure detection by LiDAR scan re-identification