ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Manual-Label Free 3D Detection Via an Open-Source Simulator

Zhen Yang, Chi Zhang, Zhaoxiang Zhang, Huiming Guo

Auto-TLDR; DA-VoxelNet: A Novel Domain Adaptive VoxelNet for LIDAR-based 3D Object Detection

Abstract Slides Poster

LiDAR based 3D object detectors typically need a large amount of detailed-labeled point cloud data for training, but these detailed labels are commonly expensive to acquire. In this paper, we propose a manual-label free 3D detection algorithm that leverages the CARLA simulator to generate a large amount of self-labeled training samples and introduces a novel Domain Adaptive VoxelNet (DA-VoxelNet) that can cross the distribution gap from the synthetic data to the real scenario. The self-labeled training samples are generated by a set of high quality 3D models embedded in a CARLA simulator and a proposed LiDAR-guided sampling algorithm. Then a DA-VoxelNet that integrates both a sample-level DA module and an anchor-level DA module is proposed to enable the detector trained by the synthetic data to adapt to real scenario. Experimental results show that the proposed unsupervised DA 3D detector on KITTI evaluation set can achieve 76.66% and 56.64% mAP on BEV mode and 3D mode respectively. The results reveal a promising perspective of training a LIDAR-based 3D detector without any hand-tagged label.

Similar papers

Foreground-Focused Domain Adaption for Object Detection

Yuchen Yang, Nilanjan Ray

Auto-TLDR; Unsupervised Domain Adaptation for Unsupervised Object Detection

Abstract Slides Similar

Object detectors suffer from accuracy loss caused by domain shift from a source to a target domain. Unsupervised domain adaptation (UDA) approaches mitigate this loss by training with unlabeled target domain images. A popular processing pipeline applies adversarial training that aligns the distributions of the features from the two domains. We advocate that aligning the full image level features is not ideal for UDA object detection due to the presence of varied background areas during inference. Thus, we propose a novel foreground-focused domain adaptation (FFDA) framework which mines the loss of the domain discriminators to concentrate on the backpropagation of foreground loss. We obtain mining masks by collecting target predictions and source labels to outline foreground regions, and apply the masks to image and instance level domain discriminators to allow backpropagation only on the mined regions. By reinforcing this foreground-focused adaptation throughout multiple layers in the detector model, we gain a significant accuracy boost on the target domain prediction. Compared to previous works, our method reaches the new state-of-the-art accuracy on adapting Cityscape to Foggy Cityscape dataset and demonstrates competitive accuracy on other datasets that include various scenarios for autonomous driving applications.

Cross-Domain Semantic Segmentation of Urban Scenes Via Multi-Level Feature Alignment

Bin Zhang, Shengjie Zhao, Rongqing Zhang

Auto-TLDR; Cross-Domain Semantic Segmentation Using Generative Adversarial Networks

Manual-Label Free 3D Detection Via an Open-Source Simulator

Similar papers

Foreground-Focused Domain Adaption for Object Detection

Cross-Domain Semantic Segmentation of Urban Scenes Via Multi-Level Feature Alignment

PointDrop: Improving Object Detection from Sparse Point Clouds Via Adversarial Data Augmentation

Sensor-Independent Pedestrian Detection for Personal Mobility Vehicles in Walking Space Using Dataset Generated by Simulation

Yolo+FPN: 2D and 3D Fused Object Detection with an RGB-D Camera

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

Joint Supervised and Self-Supervised Learning for 3D Real World Challenges

Shape Consistent 2D Keypoint Estimation under Domain Shift

Unsupervised Domain Adaptation with Multiple Domain Discriminators and Adaptive Self-Training

Multiple Future Prediction Leveraging Synthetic Trajectories

Unsupervised Multi-Task Domain Adaptation

DAPC: Domain Adaptation People Counting Via Style-Level Transfer Learning and Scene-Aware Estimation

DeepBEV: A Conditional Adversarial Network for Bird’s Eye View Generation

Unsupervised Domain Adaptation for Object Detection in Cultural Sites

Vehicle Lane Merge Visual Benchmark

S-VoteNet: Deep Hough Voting with Spherical Proposal for 3D Object Detection

MagnifierNet: Learning Efficient Small-Scale Pedestrian Detector towards Multiple Dense Regions

Human Segmentation with Dynamic LiDAR Data

Self-Training for Domain Adaptive Scene Text Detection

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

Teacher-Student Competition for Unsupervised Domain Adaptation

A Unified Framework for Distance-Aware Domain Adaptation

Unsupervised Domain Adaptation for Person Re-Identification through Source-Guided Pseudo-Labeling

Temporal Pulses Driven Spiking Neural Network for Time and Power Efficient Object Recognition in Autonomous Driving

Open Set Domain Recognition Via Attention-Based GCN and Semantic Matching Optimization

PointSpherical: Deep Shape Context for Point Cloud Learning in Spherical Coordinates

Domain Generalized Person Re-Identification Via Cross-Domain Episodic Learning

Ghost Target Detection in 3D Radar Data Using Point Cloud Based Deep Neural Network

Enhanced Vote Network for 3D Object Detection in Point Clouds

Forground-Guided Vehicle Perception Framework

Efficient Shadow Detection and Removal Using Synthetic Data with Domain Adaptation

Energy-Constrained Self-Training for Unsupervised Domain Adaptation

A Simple Domain Shifting Network for Generating Low Quality Images

Self-Supervised Detection and Pose Estimation of Logistical Objects in 3D Sensor Data

Enlarging Discriminative Power by Adding an Extra Class in Unsupervised Domain Adaptation

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Class Conditional Alignment for Partial Domain Adaptation

MixedFusion: 6D Object Pose Estimation from Decoupled RGB-Depth Features

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

Self-Supervised Domain Adaptation with Consistency Training

Cross-Regional Attention Network for Point Cloud Completion

Rethinking Domain Generalization Baselines

Semi-Supervised Domain Adaptation Via Selective Pseudo Labeling and Progressive Self-Training

Progressive Unsupervised Domain Adaptation for Image-Based Person Re-Identification

Learning Low-Shot Generative Networks for Cross-Domain Data

Supervised Domain Adaptation Using Graph Embedding

Quantifying the Use of Domain Randomization