ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Enhancing Depth Quality of Stereo Vision Using Deep Learning-Based Prior Information of the Driving Environment

Weifu Li, Vijay John, Seiichi Mita

Auto-TLDR; A Novel Post-processing Mathematical Framework for Stereo Vision

Abstract Slides Poster

Generation of high density depth values of the driving environment is indispensable for autonomous driving. Stereo vision is one of the practical and effective methods to generate these depth values. However, the accuracy of the stereo vision is limited by texture-less regions, such as sky and road areas, and repeated patterns in the image. To overcome these problems, we propose to enhance the stereo generated depth by incorporating prior information of the driving environment. Prior information, generated by deep learning-based U-Net model, is utilized in a novel post-processing mathematical framework to refine the stereo generated depth. The proposed mathematical framework is formulated as an optimization problem, which refines the errors due to texture-less regions and repeated patterns. Owing to its mathematical formulation, the post-processing framework is not a black-box and is explainable, and can be readily utilized for depth maps generated by any stereo vision algorithm. The proposed framework is qualitatively validated on the acquired dataset and KITTI dataset. The results obtained show that the proposed framework improves the stereo depth generation accuracy

Similar papers

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Yawen Lu, Yuxing Wang, Devarth Parikh, Guoyu Lu

Auto-TLDR; Self-supervised LIDAR for Low-Cost Depth Estimation

Abstract Slides Similar

Depth estimation is playing an important role in indoor and outdoor scene understanding, autonomous driving, augmented reality and many other tasks. Vehicles and robotics are able to use active illumination sensors such as LIDAR to receive high precision depth estimation. However, high-resolution Lidars are usually too expensive, which limits its massive production on various applications. Though single beam LIDAR enjoys the benefits of low cost, one beam depth sensing is not usually sufficient to perceive the surrounding environment in many scenarios. In this paper, we propose a learning-based framework to explore to replicate similar or even higher performance as costly LIDARs with our designed self-supervised network and a low-cost single-beam LIDAR. After the accurate calibration with a visible camera, the single beam LIDAR can adjust the scale uncertainty of the depth map estimated by the visible camera. The adjusted depth map enjoys the benefits of high resolution and sensing accuracy as high beam LIDAR and maintains low-cost as single beam LIDAR. Thus we can achieve similar sensing effect of high beam LIDAR with more than a 50-100 times cheaper price (e.g., \$80000 Velodyne HDL-64E LIDAR v.s. \$1000 SICK TIM-781 2D LIDAR and normal camera). The proposed approach is verified on our collected dataset and public dataset with superior depth-sensing performance.

Learning Stereo Matchability in Disparity Regression Networks

Jingyang Zhang, Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

Auto-TLDR; Deep Stereo Matchability for Weakly Matchable Regions

Abstract Slides Similar

Learning-based stereo matching has recently achieved promising results, yet still suffers difficulties in establishing reliable matches in weakly matchable regions that are textureless, non-Lambertian, or occluded. In this paper, we address this challenge by proposing a stereo matching network that considers pixel-wise matchability. Specifically, the network jointly regresses disparity and matchability maps from 3D probability volume through expectation and entropy operations. Next, a learned attenuation is applied as the robust loss function to alleviate the influence of weakly matchable pixels in the training. Finally, a matchability-aware disparity refinement is introduced to improve the depth inference in weakly matchable regions. The proposed deep stereo matchability (DSM) framework can improve the matching result or accelerate the computation while still guaranteeing the quality. Moreover, the DSM framework is portable to many recent stereo networks. Extensive experiments are conducted on Scene Flow and KITTI stereo datasets to demonstrate the effectiveness of the proposed framework over the state-of-the-art learning-based stereo methods.

Movement-Induced Priors for Deep Stereo

Yuxin Hou, Muhammad Kamran Janjua, Juho Kannala, Arno Solin

Auto-TLDR; Fusing Stereo Disparity Estimation with Movement-induced Prior Information

Enhancing Depth Quality of Stereo Vision Using Deep Learning-Based Prior Information of the Driving Environment

Similar papers

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Learning Stereo Matchability in Disparity Regression Networks

Movement-Induced Priors for Deep Stereo

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

Deeply-Fused Attentive Network for Stereo Matching

FC-DCNN: A Densely Connected Neural Network for Stereo Estimation

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Attention Based Coupled Framework for Road and Pothole Segmentation

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

Attention Stereo Matching Network

Suppressing Features That Contain Disparity Edge for Stereo Matching

Holistic Grid Fusion Based Stop Line Estimation

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

Lane Detection Based on Object Detection and Image-To-Image Translation

Semantic Segmentation for Pedestrian Detection from Motion in Temporal Domain

Delivering Meaningful Representation for Monocular Depth Estimation

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

5D Light Field Synthesis from a Monocular Video

Edge-Aware Monocular Dense Depth Estimation with Morphology

Semantic Segmentation Refinement Using Entropy and Boundary-guided Monte Carlo Sampling and Directed Regional Search

Feature Point Matching in Cross-Spectral Images with Cycle Consistency Learning

ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

Calibration and Absolute Pose Estimation of Trinocular Linear Camera Array for Smart City Applications

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

Cost Volume Refinement for Depth Prediction

Anomaly Detection, Localization and Classification for Railway Inspection

Visual Saliency Oriented Vehicle Scale Estimation

Street-Map Based Validation of Semantic Segmentation in Autonomous Driving

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

Polarimetric Image Augmentation

Adaptive Image Compression Using GAN Based Semantic-Perceptual Residual Compensation

Deep Realistic Novel View Generation for City-Scale Aerial Images

Temporal Pulses Driven Spiking Neural Network for Time and Power Efficient Object Recognition in Autonomous Driving

SIDGAN: Single Image Dehazing without Paired Supervision

Transitional Asymmetric Non-Local Neural Networks for Real-World Dirt Road Segmentation

Towards Efficient 3D Point Cloud Scene Completion Via Novel Depth View Synthesis

Human Segmentation with Dynamic LiDAR Data

DSPNet: Deep Learning-Enabled Blind Reduction of Speckle Noise

RONELD: Robust Neural Network Output Enhancement for Active Lane Detection

Breast Anatomy Enriched Tumor Saliency Estimation

Fast Region-Adaptive Defogging and Enhancement for Outdoor Images Containing Sky

DA-RefineNet: Dual-Inputs Attention RefineNet for Whole Slide Image Segmentation

Object Detection on Monocular Images with Two-Dimensional Canonical Correlation Analysis

A Two-Step Approach to Lidar-Camera Calibration

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Dynamic Guided Network for Monocular Depth Estimation