ICPR2020 Paper Browser

Paper download is intended for registered attendees only, and is subjected to the IEEE Copyright Policy. Any other use is strongly forbidden.

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

Matteo Poggi, Fabio Tosi, Filippo Aleotti, Stefano Mattoccia

Auto-TLDR; Joint Training of Deep-Networks for Outlier Detection from Stereo Images

Abstract Slides Poster

Deep architectures represent the state-of-the-art for perceiving depth from stereo images. Although these methods are highly accurate, it is crucial to effectively detect any outlier through confidence measures since a wrong perception of even small portions of the sensed scene might lead to catastrophic consequences, for instance, in autonomous driving. Purposely, state-of-the-art confidence estimation methods rely on deep-networks as well. In this paper, arguing that these tasks are two sides of the same coin, we propose a novel paradigm for their joint training. Specifically, inspired by the successful deployment of GANs in other fields, we design two deep architectures: a generator for disparity estimation and a discriminator for distinguishing correct assignments from outliers. The two networks are jointly trained in a new peculiar weakly adversarial manner pushing the former to fix the errors detected by the discriminator while keeping the correct prediction unchanged. Experimental results on standard stereo datasets prove that such joint training paradigm yields significant improvements. Moreover, an additional outcome of our proposal is the ability to detect outliers with better accuracy compared to the state-of-the-art.

Similar papers

Learning Stereo Matchability in Disparity Regression Networks

Jingyang Zhang, Yao Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang, Long Quan

Auto-TLDR; Deep Stereo Matchability for Weakly Matchable Regions

Abstract Slides Similar

Learning-based stereo matching has recently achieved promising results, yet still suffers difficulties in establishing reliable matches in weakly matchable regions that are textureless, non-Lambertian, or occluded. In this paper, we address this challenge by proposing a stereo matching network that considers pixel-wise matchability. Specifically, the network jointly regresses disparity and matchability maps from 3D probability volume through expectation and entropy operations. Next, a learned attenuation is applied as the robust loss function to alleviate the influence of weakly matchable pixels in the training. Finally, a matchability-aware disparity refinement is introduced to improve the depth inference in weakly matchable regions. The proposed deep stereo matchability (DSM) framework can improve the matching result or accelerate the computation while still guaranteeing the quality. Moreover, the DSM framework is portable to many recent stereo networks. Extensive experiments are conducted on Scene Flow and KITTI stereo datasets to demonstrate the effectiveness of the proposed framework over the state-of-the-art learning-based stereo methods.

Movement-Induced Priors for Deep Stereo

Yuxin Hou, Muhammad Kamran Janjua, Juho Kannala, Arno Solin

Auto-TLDR; Fusing Stereo Disparity Estimation with Movement-induced Prior Information

Leveraging a Weakly Adversarial Paradigm for Joint Learning of Disparity and Confidence Estimation

Similar papers

Learning Stereo Matchability in Disparity Regression Networks

Movement-Induced Priors for Deep Stereo

FC-DCNN: A Densely Connected Neural Network for Stereo Estimation

Deeply-Fused Attentive Network for Stereo Matching

Two-Stage Adaptive Object Scene Flow Using Hybrid CNN-CRF Model

Extending Single Beam Lidar to Full Resolution by Fusing with Single Image Depth Estimation

Suppressing Features That Contain Disparity Edge for Stereo Matching

Attention Stereo Matching Network

Real-Time Monocular Depth Estimation with Extremely Light-Weight Neural Network

ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

Domain Siamese CNNs for Sparse Multispectral Disparity Estimation

STaRFlow: A SpatioTemporal Recurrent Cell for Lightweight Multi-Frame Optical Flow Estimation

Multi-Scale Residual Pyramid Attention Network for Monocular Depth Estimation

P2D: A Self-Supervised Method for Depth Estimation from Polarimetry

HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects

Efficient Shadow Detection and Removal Using Synthetic Data with Domain Adaptation

Feature Point Matching in Cross-Spectral Images with Cycle Consistency Learning

DeepBEV: A Conditional Adversarial Network for Bird’s Eye View Generation

Shape Consistent 2D Keypoint Estimation under Domain Shift

Can You Trust Your Pose? Confidence Estimation in Visual Localization

Local Facial Attribute Transfer through Inpainting

NetCalib: A Novel Approach for LiDAR-Camera Auto-Calibration Based on Deep Learning

Machine-Learned Regularization and Polygonization of Building Segmentation Masks

Delivering Meaningful Representation for Monocular Depth Estimation

Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

Foreground-Focused Domain Adaption for Object Detection

Enhancing Depth Quality of Stereo Vision Using Deep Learning-Based Prior Information of the Driving Environment

Unsupervised Domain Adaptation with Multiple Domain Discriminators and Adaptive Self-Training

JUMPS: Joints Upsampling Method for Pose Sequences

Combining GANs and AutoEncoders for Efficient Anomaly Detection

Partially Supervised Multi-Task Network for Single-View Dietary Assessment

Unsupervised 3D Human Pose Estimation in Multi-view-multi-pose Video

A Fine-Grained Dataset and Its Efficient Semantic Segmentation for Unstructured Driving Scenarios

FastCompletion: A Cascade Network with Multiscale Group-Fused Inputs for Real-Time Depth Completion

SECI-GAN: Semantic and Edge Completion for Dynamic Objects Removal

Object Segmentation Tracking from Generic Video Cues

Uncertainty-Sensitive Activity Recognition: A Reliability Benchmark and the CARING Models

Holistic Grid Fusion Based Stop Line Estimation

GAP: Quantifying the Generative Adversarial Set and Class Feature Applicability of Deep Neural Networks

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions

Attention Based Coupled Framework for Road and Pothole Segmentation

Coarse to Fine: Progressive and Multi-Task Learning for Salient Object Detection

Future Urban Scenes Generation through Vehicles Synthesis

HPERL: 3D Human Pose Estimastion from RGB and LiDAR

PEAN: 3D Hand Pose Estimation Adversarial Network

Super-Resolution Guided Pore Detection for Fingerprint Recognition

Ordinal Depth Classification Using Region-Based Self-Attention

Robust Pedestrian Detection in Thermal Imagery Using Synthesized Images