ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Edge-Aware Monocular Dense Depth Estimation with Morphology

Zhi Li, Xiaoyang Zhu, Haitao Yu, Qi Zhang, Yongshi Jiang

Auto-TLDR; Spatio-Temporally Smooth Dense Depth Maps Using Only a CPU

Abstract Slides Poster

Dense depth maps play an important role in Computer Vision and AR (Augmented Reality). For CV applications, a dense depth map is the cornerstone of 3D reconstruction allowing real objects to be precisely displayed in the computer. And Dense depth maps can handle correct occlusion relationships between virtual content and real objects for better user experience in AR. However, the complicated computation limits the development of computing dense depth maps. We present a novel algorithm that produces low latency, spatio-temporally smooth dense depth maps using only a CPU. The depth maps exhibit sharp discontinuities at depth edges in low computational complexity ways. Our algorithm obtains the sparse SLAM reconstruction first, then extracts coarse depth edges from a down-sampled RGB image by morphology operations. Next, we thin the depth edges and align them with image edges. Finally, a Warm-Start initialization scheme and an improved optimization solver are adopted to accelerate convergence. We evaluate our proposal quantitatively and the result shows improvements on the accuracy of depth map with respect to other state-of-the-art and baseline techniques.

Similar papers

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Yawen Lu, Yuxing Wang, Devarth Parikh, Guoyu Lu

Auto-TLDR; Self-supervised LIDAR for Low-Cost Depth Estimation

Abstract Slides Similar

Depth estimation is playing an important role in indoor and outdoor scene understanding, autonomous driving, augmented reality and many other tasks. Vehicles and robotics are able to use active illumination sensors such as LIDAR to receive high precision depth estimation. However, high-resolution Lidars are usually too expensive, which limits its massive production on various applications. Though single beam LIDAR enjoys the benefits of low cost, one beam depth sensing is not usually sufficient to perceive the surrounding environment in many scenarios. In this paper, we propose a learning-based framework to explore to replicate similar or even higher performance as costly LIDARs with our designed self-supervised network and a low-cost single-beam LIDAR. After the accurate calibration with a visible camera, the single beam LIDAR can adjust the scale uncertainty of the depth map estimated by the visible camera. The adjusted depth map enjoys the benefits of high resolution and sensing accuracy as high beam LIDAR and maintains low-cost as single beam LIDAR. Thus we can achieve similar sensing effect of high beam LIDAR with more than a 50-100 times cheaper price (e.g., \$80000 Velodyne HDL-64E LIDAR v.s. \$1000 SICK TIM-781 2D LIDAR and normal camera). The proposed approach is verified on our collected dataset and public dataset with superior depth-sensing performance.

Dynamic Guided Network for Monocular Depth Estimation

Xiaoxia Xing, Yinghao Cai, Yiping Yang, Dayong Wen

Auto-TLDR; DGNet: Dynamic Guidance Upsampling for Self-attention-Decoding for Monocular Depth Estimation

Edge-Aware Monocular Dense Depth Estimation with Morphology

Similar papers

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Dynamic Guided Network for Monocular Depth Estimation

Learning Non-Rigid Surface Reconstruction from Spatio-Temporal Image Patches

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

DEN: Disentangling and Exchanging Network for Depth Completion

Cost Volume Refinement for Depth Prediction

Towards Efficient 3D Point Cloud Scene Completion Via Novel Depth View Synthesis

Benchmarking Cameras for OpenVSLAM Indoors

Learning to Segment Dynamic Objects Using SLAM Outliers

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Facetwise Mesh Refinement for Multi-View Stereo

Dynamic Resource-Aware Corner Detection for Bio-Inspired Vision Sensors

Partially Supervised Multi-Task Network for Single-View Dietary Assessment

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

SECI-GAN: Semantic and Edge Completion for Dynamic Objects Removal

Hybrid Approach for 3D Head Reconstruction: Using Neural Networks and Visual Geometry

Mobile Augmented Reality: Fast, Precise, and Smooth Planar Object Tracking

AV-SLAM: Autonomous Vehicle SLAM with Gravity Direction Initialization

RISEdb: A Novel Indoor Localization Dataset

Future Urban Scenes Generation through Vehicles Synthesis

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

Deep Homography-Based Video Stabilization

Object Segmentation Tracking from Generic Video Cues

MixedFusion: 6D Object Pose Estimation from Decoupled RGB-Depth Features

Derivation of Geometrically and Semantically Annotated UAV Datasets at Large Scales from 3D City Models

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

STaRFlow: A SpatioTemporal Recurrent Cell for Lightweight Multi-Frame Optical Flow Estimation

Rotational Adjoint Methods for Learning-Free 3D Human Pose Estimation from IMU Data

FastCompletion: A Cascade Network with Multiscale Group-Fused Inputs for Real-Time Depth Completion

Light3DPose: Real-Time Multi-Person 3D Pose Estimation from Multiple Views

5D Light Field Synthesis from a Monocular Video

Delivering Meaningful Representation for Monocular Depth Estimation

DAL: A Deep Depth-Aware Long-Term Tracker

Visual Saliency Oriented Vehicle Scale Estimation

Edge-Guided CNN for Denoising Images from Portable Ultrasound Devices

User-Independent Gaze Estimation by Extracting Pupil Parameter and Its Mapping to the Gaze Angle

Ordinal Depth Classification Using Region-Based Self-Attention

Video Reconstruction by Spatio-Temporal Fusion of Blurred-Coded Image Pair

D3Net: Joint Demosaicking, Deblurring and Deringing

FC-DCNN: A Densely Connected Neural Network for Stereo Estimation

Free-Form Image Inpainting Via Contrastive Attention Network

Enhancing Depth Quality of Stereo Vision Using Deep Learning-Based Prior Information of the Driving Environment

A Multi-Task Neural Network for Action Recognition with 3D Key-Points

Learning Knowledge-Rich Sequential Model for Planar Homography Estimation in Aerial Video

Movement-Induced Priors for Deep Stereo

A Two-Step Approach to Lidar-Camera Calibration